in reply to How toread the contents of PDF

I recommend CAM::PDF, you can see an example of text extraction in this script.

-stvn

Replies are listed 'Best First'.
Re^2: How toread the contents of PDF
by Anonymous Monk on Aug 29, 2013 at 08:46 UTC
    the link is not wrking

      The link is not working because it references the fixed CAM::PDF version 1.08. The current version is (at the time of writing) 1.60. It's not hard to find a correct link for that.

      I just tried to put a "permalink" to the latest version:

      http://search.cpan.org/dist/CAM-PDF/bin/getpdftext.pl links to the version of the script in the latest distribution.