OUR SERVICES
Convert and transform information into structured formats: XML, S1000D, DITA, SPL, proprietary schemas, and others.
Independent review of previously converted content. Provide third-party validation and peace of mind.
Enrich content with new or inferred metadata to improve the utility, discovery, and interoperability of content.
Analyze large document collections to identify content reuse across multiple documents and source formats.
Extract free-form text from textual and form-based documents, PDFs, and other formats, then generate target XML schema.
Submit structured content to platforms such as PubMed, Silverchair, HighWire, and more.
Website harvesting and AI transformations that scan HTML, PDF, Excel, etc. and deliver structured data to your systems.
DCL can automate the creation of training sets and structure data sets to support your AI and machine learning projects.

THE LATEST FROM DCL
The Legal Fine Lines of Fair Use and Generative AI
Training sets for large language models (LLMs) is changing how we think about copyright but the law hasn’t quite caught up yet. In this sharp, timely webinar, legal experts unpack the evolving landscape of fair use as it applies to generative AI: What counts as truly “transformative”?
The Logic Behind the Labeling: Answers to Your SPL and SPM Questions
SPL, SPM… two acronyms that carry a lot of weight in the world of regulatory compliance. Whether you're just getting started or have experience navigating these structures, this webinar is packed with insights, clarifications, and expert answers to both common and uncommon questions.
From Submission to Structured XML: Streamlining Editorial Efficiencies at The BMJ
BMJ Group has long been a leader in medical publishing, known for rigorous peer review, global reach, and trusted reputation built on exceptional publishing expertise. The BMJ is ranked among the top medical journals globally and is valued for its editorial integrity and innovation. DCL recently announced that The BMJ has implemented Content Crystallizer , a solution designed to transform manuscripts from Word documents into structured XML through a configurable, automated, and...
"Control Your Data Before It Controls You": A Tagline for the Times
It was 2007. The iPhone had just been released, Facebook was still mostly for college students, and the term “cloud computing” was just starting to gain traction.
At Data Conversion Laboratory (DCL), we decided to have a little fun with a serious purpose: an internal contest to come up with a new company tagline. Our team members scribbled ideas on notepads, dropped slips into a box, and debated the merits of each over coffee.
The winning entry came from Nir Dayan and the tagline was...
Your Alt Text Strategy Is About to Be Tested—Is It Scalable?
The clock is ticking for publishers across the European Union. By June 28, 2025, the European Accessibility Act (EAA) will require digital content (ebooks, online publications, educational materials, and more) to meet stringent accessibility standards. At the heart of these requirements is a deceptively complex task: providing accurate, meaningful alt text for images.















