Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Does PDFNet support extracting table and list info from PDFs?

April 26, 2017extracting info list PDFNet PDFs support Table

0

10 Posted

Does PDFNet support extracting table and list info from PDFs?

1 Answer

0

Posted

PDFNet supports extraction of all content available in PDF document. On the other hand PDF standard does not directly support abstract constructs such as paragraphs, columns, tables, etc. Because the logical structure is missing in PDF document, the target application would need to analyze and generate logical structure based on the underlying content that is available through PDFNet. Note that PDF standard supports marked content and so called ‘tagged PDF’. PDFNet can be used to extract marked content and any existing logical structure. Unfortunately many PDF files are missing tags and logical structure.