AP's industry-leading taxonomy and tagging provide accurate, comprehensive, richly detailed descriptive metadata, designed specifically for use by news publishers, and tuned to be search- and SEO-friendly to support digital news products. AP Metadata Services APIs give you direct access to the same classification system that supports AP's award-winning global news operation.
Good metadata offers a variety of benefits, opening up new possibilities for connecting with readers and managing content:
Overview
Get a quick introduction to the AP Metadata Services (PDF).
Developer's Guide
Review the complete documentation of API methods (PDF).
Taxonomy Release Notes
Learn what's changed in the latest AP News Taxonomy release (PDF).
Tagging Release Notes
See what's new in the latest release of the AP Tagging service (PDF).
GitHub
View code samples and ontology files.
RDF/XML,
RDF/TTL,
JSON-LD,
HTML
View taxonomy data samples
RDF/XML,
RDF/TTL,
n-triples,
JSON-LD,
Simple XML
View tagging service data samples
Integrate the News Taxonomy into your publishing systems to support manual tagging, or use it in conjunction with the AP Tagging service to apply the taxonomy to your content.
In addition to standardized terminology and unique IDs, the taxonomy stores a variety of details about the people, places, and topics it contains.
This information can power enhanced search experiences, browsing and discovery, or informational displays; for example:
The AP News Taxonomy has five main areas of coverage:
The IAB Tech Lab Content Taxonomy is the Interactive Advertising Bureau's standard collection of content classification categories. The taxonomy service gives you access to mappings between the AP News Taxonomy's Subject terms and the IAB's classification categories.
The AP News Taxonomy is constantly being updated to capture the latest news and the biggest newsmakers. Whether it’s this week’s IPOs or the new crop of college athletes, AP’s taxonomy developers are always working to keep the vocabularies current and relevant.
Subscribers are kept up-to-date in real time – as soon as a change is published, the new version becomes available in the AP News Taxonomy. A detailed log of all changes is accessible through a separate API. You can keep track of all changes, or just the ones you care most about.
Taxonomy mappings are updated as external taxonomy updates become available, and new mappings are completed.
Subscribers can access the taxonomy by making calls to an API and request the full set of terms in a given vocabulary, a subset of terms, or information about a particular term. Calls are also available for retrieving deprecated terms, term change logs, and additional information about the structure of the taxonomy. Taxonomy data can be returned in Semantic-Web compatible formats, such as RDF/XML, RDF/TTL or JSON-LD. Information about a particular term can also be returned as HTML.
To learn more, refer to the Developer's Guide.
AP Tagging service receives your English-language news content and automatically returns relevant metadata, using standardized terminology from the AP News Taxonomy or the IAB Tech Lab Content Taxonomy.
The service goes well beyond mere text extraction; it uses a hybrid approach to concept and entity tagging, leveraging a deep knowledge graph for better domain targeting, as well as human-created context-aware semantic rules that understand your content and identify the most pertinent entities and topics.
Each tag applied to your content has a relevance score indicating how pertinent each tag is to your submitted text. Only the tags that truly describe your content will be returned.
Human-created rules allow for more precise control over the service performance.
The system will recognize and return specific entities that it finds in the submitted content and also uses human-created semantic rules to identify topics that may not be explicitly mentioned in the text at all.
For example, a story about a particular country music star can trigger the "Country music" subject, even if the word "music" does not appear in the story.
The tagging service analyzes each piece of submitted content and returns standardized names, relevance scores, IDs and other properties for all the relevant taxonomy terms that triggered a semantic rule. You can specify which taxonomy you want applied - AP's or IAB's - and which of the five areas of AP News Taxonomy's coverage the service should return.
The metadata output is enhanced with additional data properties. For example, companies are given a ticker, athletes are associated with their teams, and geographic locations get latitude and longitude data. You can also access the AP News Taxonomy or the IAB Tech Lab Content Taxonomy mapping for additional information about any given tag.
Additional tagging features will be added on an ongoing basis - please check back here for updates.
Subscribers can access the tagging service by making calls to an API either synchronously, or by submitting multiple documents simultaneously and retrieving the tags at a later time. Content may be submitted as plain text or XML.
Tagging service data can be returned in a Semantic-Web compatible formats, such as RDF/XML, RDF/TTL, n-triples or JSON-LD. It can also be returned in Simple XML format.
To learn more, refer to the Developer's Guide.