Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Is it possible to parse an existing PDF-document and convert it to another format (HTML, DOC, EXCEL)?

April 26, 2017convert DOC Excel existing format html parse possible

0

Posted

Is it possible to parse an existing PDF-document and convert it to another format (HTML, DOC, EXCEL)?

2 Answers

0

Alex Krenvalk Posted

To my mind one of the best solutions for this issue might be tool which I found out at the Inet and it worked out my similar problems quite simply – pdf recovery.

0

Posted

No, the pdf format is just a canvas where text and graphics are placed without any structure information. As such there aren’t any ‘iText-objects’ in a PDF file. For instance: you can’t retrieve a table object from a PDF file. Tables are formed by placing text and lines at selected places. So Im still looking for some way to maybe read a line in a PDF doc, and parse it. Any sugestions to what other PDF parsing packages I could use, would be very apreciated.