As an alternative to parsing the PDF file, if you just need some values from the document, and don't need to modify the original PDF, try running 'pdf2ascii' on the file, and then parsing the resulting plain text.
Most pdf files have the actual text, not an image of the text as suggested above. (I'm not saying it isn't possible, just unlikely.)