in reply to PDF::API2 to search for text and place hyperlinks in PDF file
I have used a freeware tool called "a-pdf text extractor" from a-pdf.com. This allows text to be extracted with position.
Then I slurp in the resulting text file, remove duplicate entries and extract those lines that I'm interested in. Using a more advanced version of the code above I can put links on some 50% of the pages on the pdf file. In order to get close to 100% I had to patch 1 line of PDF::API2's source code.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: PDF::API2 to search for text in PDF file
by Anonymous Monk on May 19, 2016 at 21:31 UTC |