I like this approach because it gives me a bunch of text box strings with their bounding box coordinates, which I then sort by location. This is important for me because the documents that I parse tend to have an irregular 'document order.'
I have also found pdf tips and tricks on the mostly commercial http://www.pdfzone.com site.
In reply to Re: PDF Modules Seeking Recommendations
by toma
in thread PDF Modules Seeking Recommendations
by mitd
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |