|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Create and Manage Profile
What is a profile?A content ingest profile specifies the settings for downloading content. When you create a profile, you select entitlements for which you wish to download content and choose content file types and formats; for example, stories in NITF format or specific video renditions. You can also edit other settings; for example, the directory and folder structure for saving content. How do I create a profile?
How do I select content?
Can I select individual products included in a package?Yes. On the Select Content tab, click the Packages tab, and then click See Products in Package next to the package name to view all products included in a package. To select individual products in a package, select the check box next to each product that you want to download. How do I select content file types and formats?Click the Content Options tab and select the options under Files to choose the file types and formats for downloading content; for example, specific video renditions or NITF-formatted stories. In addition to saving content files, you can also choose to download metadata for individual content items or the entire feed.
MetadataDownloading Metadata Files
Content metadata is returned in the JSON format and/or as XML in the NewsML-G2 format. For more information, see Content Metadata Fields. Downloading metadata files is optional. Typically, you will need to write a custom script to post-process the downloaded metadata files.
JSON: Feed FilesSelect this option to save feed files in JSON format (multiple content items are included in each feed file). By default, feed files are saved in the <Agent
Directory>/feeds folder. The file name format is feed_<yyyyMMdd_HHmmss>_seq_<sequenceNumber>.json
where JSON: Item FilesSelect this option to save item metadata files in JSON format (one content item per file). By default, the item metadata files are saved below the <Agent Directory>/content directory. The default file name format is <ItemID>--<Version>--<ContentID>-item.json. If you are downloading video files and select the <OriginalFileName>--<PartialContentID>.<ext> option on the Content Save Options tab in conjunction with the JSON: Item Files option, the original file name (if available for the video rendition) will be applied to the JSON item metadata files. For example, if the originalfilename value for the video rendition is 4329116_Portugal Virus UK Travel_0_Preview.mp4, the JSON item metadata file name would be 4329116_Portugal Virus UK Travel_0-item.json. JSON: Embed NITF TextIf you are running the latest version of the agent and have selected to download JSON item files, you can select the JSON:Embed NITF Text option to embed the following in the NITF format at the end of the JSON item files:
{// JSON Example: Embedded NITF Caption "nitf_caption": "<?xml version=\"1.0\" encoding=\"utf-8\"?><nitf version=\"-//IPTC//DTD NITF 3.4//EN\" change.date=\"October 18, 2006\" change.time=\"19:30\"><head><docdata><doc-id regsrc=\"AP\"/><date.issue norm=\"20200211T161026Z\"/><\/docdata><\/head> <body><body.head><hedline><hl1 id=\"headline\">United Nations Palestinians<\/hl1><\/hedline><distributor>The Associated Press<\/distributor><\/body.head><body.content><block id=\"Caption\"><p>Palestinian President Mahmoud Abbas speaks during a Security Council meeting at United Nations headquarters, Tuesday, Feb. 11, 2020. (AP Photo/Seth Wenig)<\/p><\/block><\/body.content><\/body><\/nitf>"} XML: Item Files (NewsML-G2)If you are running the latest version of the agent, you can select this option to save item metadata files as XML in the NewsML-G2 format (one content item per file). By default, the item metadata files are saved below the <Agent Directory>/content directory. The default file name format is <ItemID>--<Version>--<ContentID>-item.xml. If you are downloading video files and select the <OriginalFileName>--<PartialContentID>.<ext> option on the Content Save Options tab in conjunction with the XML: Item Files (NewsML-G2) option, the original file name (if available for the video rendition) will be applied to the NewsML-G2 item metadata files. For example, if the originalfilename value for the video rendition is 4329116_Portugal Virus UK Travel_0_Preview.mp4, the NewsML-G2 item metadata file name would be 4329116_Portugal Virus UK Travel_0-item.xml. XML: Embed NITF TextIf you are running the latest version of the agent and have selected to download NewsML-G2 XML item files, you can also select the XML: Embed NITF Text option to embed the following NITF-formatted renditions in the <inlineXML> elements in the NewsML-G2 item files: story body (for text), captions (for pictures, graphics and video) and scripts and/or shotlists (for video). The <inlineXML> elements are embedded in addition to the <remoteContent> elements that contain links to these renditions.
NewsML-G2 Example: Embedded NITF Caption <inlineXML contenttype="application/nitf+xml" wordcount="202"> <nitf version="-//IPTC//DTD NITF 3.4//EN" xmlns="http://iptc.org/std/NITF/2006-10-18/"><body> <body.head> <headline> <hl1>Police in Cincinnati are asking the public for help in identifying the man who poured red paint on the block-long “Black Lives Matter” mural in front of city hall</hl1> </headline> </body.head> <body.content> <block> <p>CINCINNATI (AP) — Police in Cincinnati are asking the public for help in identifying the man who poured red paint on the block-long “Black Lives Matter” mural in front of city hall.</p> ... </block> </body.content> </body> </nitf> </inlineXML> Associated Item FilesSelect this option if you are interested in downloading metadata for any of the following associated items:
The format of the associated item files (JSON and/or NewsML-G2 XML) will match the selected format of the metadata files. For example, if you select JSON: Item Files and Associated Item Files, the associated item files will be downloaded in the JSON format, and if you select XML: Item Files (NewsML-G2) and Associated Item Files, the associated item files will be downloaded in the NewsML-G2 XML format. By default, the associated item metadata files are saved below the <Agent Directory>/content directory. The file name format of the JSON associated item files is <ItemID>--<Version>--<ContentID>-associated_item.json. The file name format of the NewsML-G2 XML associated item files <ItemID>--<Version>--<ContentID>-associated_item.xml. TextText renditions (renditions of textual content) are the NITF version of a story and textual descriptions of the media (for example, an image caption or a video script). Learn more >> NITF: Story BodySelect this option to save NITF-formatted stories. If available, inline links are always included in NITF-formatted stories; for example:
NITF: CaptionSelect this option to save captions for pictures, graphics and video, including linked media captions. The default file name format is <ItemID>--<Version>--<ContentID>_Caption.xml for captions of the pictures, graphics and video included in the feed and <ItemID>--<Version>--<ContentID>_Caption-associated_item.xml for linked media captions. NITF: ScriptSelect this option to save video scripts, including the scripts of linked video. The default file name format is <ItemID>--<Version>--<ContentID>_Script.xml for scripts of the video included in the feed and <ItemID>--<Version>--<ContentID>_Script-associated_item.xml for linked video scripts. NITF: ShotlistSelect this option to save video shotlists, including the shotlists of linked video.
When a separate shotlist is available, the default file name format is <ItemID>--<Version>--<ContentID>_Shotlist.xml for shotlists of the video included in the feed and <ItemID>--<Version>--<ContentID>_Shotlist-associated_item.xml for linked video shotlists. Media RenditionsMedia renditions (renditions of non-textual content) are alternative versions of the same content item based on the format, size and/or other content characteristics, such as video frame rate and aspect ratio. For example, media renditions of a picture or graphic are the high-resolution, preview and thumbnail sizes of an image. Learn More >> Select the check boxes for the media renditions (pictures, graphics, audio and video) that you want to download.
Are all media renditions available for all entitlements?No, all renditions may not be available for certain entitlements. Although the AP offers a wide variety of potential media renditions, their availability for any given entitlement or content item is not guaranteed. Therefore, selecting certain renditions does not guarantee that you receive all of them; it only ensures that if the rendition is available, it is not filtered out. For example, you can select all video renditions, but the available video for a given entitlement or feed may include only MP4 files. What are Content Save Options?Video Tutorial
Agent DirectoryBy default, the agent saves content in the <Agent Directory>/content folder (<Agent Directory> is the folder where the agent jar file is saved); for example: C:\Agent\content. To save files in a different folder:
Content File Naming Format
Content Folder StructureOne folder for all contentAll content is saved to the folder used for saving downloaded content (for example, to C:), and no subfolders are created: One folder per entitlementOne folder is created per product, package or Followed Topic, but no subfolders are created for individual content item versions; for example: One folder per entitlement and one subfolder per version
The agent creates one folder per entitlement below the folder used for saving downloaded content (for example, under C:) and subfolders for individual versions in each entitlement folder; for example:
For AP Top Headlines, the agent creates one folder per entitlement (for example, AP Online Top General Headlines) below the folder used for saving downloaded content (for example, under C:) and subfolders for individual versions of each Top Headline story in each entitlement folder:
Duplicate SettingsWhat are duplicates?Duplicate content is content that has been ingested more than once within a 24-hour period (the standard news cycle). Duplicate content may be delivered for a variety of reasons; for example:
How do I filter out duplicates?By default, the agent does not ingest duplicate content. How can I save duplicates?On the Content Save Options tab, scroll down to the Duplicate Settings section and select one of these options:
Content DeletionDeleting downloaded content files automatically after a certain time from the directory where they are saved cleans up older files, preventing them from exhausting disk space. When content file deletion is enabled, the agent deletes content, log, item metadata and JSON feed files according to the specified settings.
Does the agent automatically delete older files?No. The agent does not delete older files by default. How can I enable content file deletion?On the Content Save Options tab, select one of these options under Content Deletion :
Customize Entitlement SettingsHow do I edit entitlement settings?You can edit settings for one or more individual entitlements that you selected on the Select Content tab.
How do I restore default settings?To restore the default values for all settings in this entitlement, click Restore Default Settings at the bottom of the Entitlement Settings page. Will editing default content settings affect customized entitlements?No. If you later edit the default content settings, the customized entitlements will not be affected. The changes will apply only to entitlements that you have not customized. How do I know which entitlements I have customized?On the Select Content tab, the entitlements for which the settings have been modified are marked by a red warning icon next to the Edit Settings link: Manage ProfilesHow do I edit a profile?
How do I delete a profile?
Export and Import ProfilesCan I export a profile?Yes. If you would like to save a copy of your configuration profile to your machine (for example, as a backup copy), you can export the profile configuration settings as a JSON file. To export a profile:
How do I import a profile?If you have previously exported a configuration profile from the dashboard to a JSON file and saved it to your machine, you can import it to the dashboard. To import a profile:
Default ProfileHow do I set a default profile?You can set a default profile to have all newly installed agents start ingesting content automatically with the same settings, which are specified in that default profile.
To set a default profile:
How can I unassign a default profile?
To unassign a default profile:
|