in reply to how to extract text from PDF

Hi,

Do a Super Search of this topic, it has been covered before.
pdftotext is a non Perl way to extract text from PDF files which you could call from a Perl script.
Have a look at the modules on Cpan and see if any of them fit your requirements.

Martin