Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Can it convert documents automatically from Word/ODT to XML and if so, what sort of heuristics are used?

0
Posted

Can it convert documents automatically from Word/ODT to XML and if so, what sort of heuristics are used?

0

Yes, automatic, hands-free conversion is what Lemon8-XML is designed for. The approach is loosely based on looking for visual “markers” within a document: e.g. a section title which is larger than the surrounding text and bolded, a list of references at the end of the document, a caption immediately before or after an embedded figure, etc. Although the parsers are far from perfect, they have been developed over dozens of scholarly articles and continue to be improved.

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.

Experts123