skip to Main Content

Media Track Conversion is a division of Media Track dedicated to convert printed media contents such as PDF files, Physical Papers and Microfilmed Documents into structured and searchable text files.

dots Media Conversion

Media Track conversion is a division of Media Track dedicated to convert printed media contents into structured and searchable text files. In most cases, the contents arrive as PDF files, Physical Papers or Microfilmed Documents provided by the client or according to agreements with publishing houses or libraries. Being one of the fastest in the media industry, we assure that your media files are converted quickly allowing you to meet the demands and expectations of your clients.

  • Unique, state of art software platform handles the entire end to end process and ensures a top of the lot quality product from the article level to that of the individual printed characters.
  • File compression and Optical Character Recognition (OCR)
  • Extensive dictionary based, text recognition system which can pinpoint potential problems; after which will be manually checked.
  • Advanced system logic as well as random selection is used to take out files for further manual inspection and correction.

dots Tagsonomy

We help you increase your content value by digitizing and appropriately tagging different article elements in each article and deliver as XML file.  In addition, our XML files can contain supporting information about the article context in terms of media source information, page number, language etc.
– The Choice is Yours as our format is 100% Customizable.

XML TagDescription
LanguageLanguage of media source
NameName of media source
ID6 digit identifier for the media source
Doc DateMedia source publishing date
Physical PositionPage number of article
Logical PositionSection name
HeadlineArticle headline
Sub-headlineSmaller headline like text, typically positioned below headline
Pre-introductionText to precede introduction, text that acts as a topic or key terminology
IntroductionArticle introduction
StoryArticle body text
BlurbShort text that is different from the normal layout, intended to catch the reader
I.e. Encircled text next to an image, typically part of the tabloid layout style
QuoteQuote pulled out from the story text, typically in large font size
Fact boxText box containing information Eg. Background Information
TeaserPage reference, typically used in front page
BylineJournalist name, email address, phone number
ImageImage caption, Photographer name, email, phone number, image rights, holder
Back To Top