Media Track Conversion is a division of Media Track dedicated to convert printed media contents such as PDF files, Physical Papers and Microfilmed Documents into structured and searchable text files.
Media Track conversion is a division of Media Track dedicated to convert printed media contents into structured and searchable text files. In most cases, the contents arrive as PDF files, Physical Papers or Microfilmed Documents provided by the client or according to agreements with publishing houses or libraries. Being one of the fastest in the media industry, we assure that your media files are converted quickly allowing you to meet the demands and expectations of your clients.
- Unique, state of art software platform handles the entire end to end process and ensures a top of the lot quality product from the article level to that of the individual printed characters.
- File compression and Optical Character Recognition (OCR)
- Extensive dictionary based, text recognition system which can pinpoint potential problems; after which will be manually checked.
- Advanced system logic as well as random selection is used to take out files for further manual inspection and correction.
We help you increase your content value by digitizing and appropriately tagging different article elements in each article and deliver as XML file. In addition, our XML files can contain supporting information about the article context in terms of media source information, page number, language etc.
– The Choice is Yours as our format is 100% Customizable.
|Language||Language of media source|
|Name||Name of media source|
|ID||6 digit identifier for the media source|
|Doc Date||Media source publishing date|
|Physical Position||Page number of article|
|Logical Position||Section name|
|Sub-headline||Smaller headline like text, typically positioned below headline|
|Pre-introduction||Text to precede introduction, text that acts as a topic or key terminology|
|Story||Article body text|
|Blurb||Short text that is different from the normal layout, intended to catch the reader|
I.e. Encircled text next to an image, typically part of the tabloid layout style
|Quote||Quote pulled out from the story text, typically in large font size|
|Fact box||Text box containing information Eg. Background Information|
|Teaser||Page reference, typically used in front page|
|Byline||Journalist name, email address, phone number|
|Image||Image caption, Photographer name, email, phone number, image rights, holder|