How does the program detect and analyse tables?
Here’s an overview of how the software works, this will give you a flavour for the complexity of the issues that need to be addressed. The software first looks for pre-formatted regions of text. It does this by Spotting lines that are clearly formatted, looking for large white space and any table-like characters like ‘|’ and ‘+’. If may also look for code-like lines and diagram-like lines according to the policies set. Each time a heavily formatted line is encountered an attempt is made to extend the preformatted region by “rolling it out” to adjacent, not so clearly formatted lines This “roll out” process is stopped whenever it encounters a line that is clearly not part of the formatted region. This might be a section heading or a set of multiple blank lines (the default is 2). Once a preformatted region is identified, analysis is performed to see whether this is a table, diagram, code sample or something else.