in reply to Re: How to Extract PDF tables using Perl
in thread How to Extract PDF tables using Perl

See my post here for an example that uses the pdftohtml.exe program LanX is referring to

One caveat though: as LanX mentioned in his link, pdftohtml, under certain circumstances, may not break a tabular line up into its individual columns. Unfortunately this sort of thing is really dependent on the internal structure, version, content, and layout of the PDF. The perils of using a display format as data...

  • Comment on Re^2: How to Extract PDF tables using Perl

Replies are listed 'Best First'.
Re^3: How to Extract PDF tables using Perl
by LanX (Saint) on May 11, 2016 at 15:41 UTC
    Another point is that lines for borders will not be represented by pdftohtml, you have to go by text position only.

    Cheers Rolf
    (addicted to the Perl Programming Language and ☆☆☆☆ :)
    Je suis Charlie!