in reply to PDF Modules Seeking Recommendations
I like this approach because it gives me a bunch of text box strings with their bounding box coordinates, which I then sort by location. This is important for me because the documents that I parse tend to have an irregular 'document order.'
I have also found pdf tips and tricks on the mostly commercial http://www.pdfzone.com site.
|
|---|