XML / SGML

Extensible markup language (XML) is the universal format for structured as well as unstructured content on the Web and is a subset of SGML . We can convert media rich, unstructured paper, microfilm or scanned documents to powerful XML files. We use and modify a variety of third party encoding standards such as TEI (Text Encoding Initiative) or Docbooks and develop other proprietary content management tools to create XML database. We have extensive experience in encoding characters sets in many languages.

Our services include:

  • Analyzing Your data.
  • Developing/modifying DTD/Schema
  • Writing conversion specifications
  • Pre - Migration data clean-up.
  • OCR / Data Capture
  • SGML/XML encoding
  • Proofing,
  • Validation and QC Audits
  • XML FO transformation
  •