in reply to extract text from pdf

CAM::PDF was recommended in the earlier thread How toread the contents of PDF

Update: I tried a few things with this module. It works well with some pdf files, but seems to fail in various ways for others. I couldn't get it to work with a few simple pdf files I created and exported from OpenOffice. The module comes with a small script named getpdftext.pl that may help you. Cheers.