in reply to Parse PDF to text

My experience has been that when you need to parse the document, the pdftotext utility does the best job of preserving the layout of the original. YMMV.

Update: I have not tried "poppler" mentioned below. I downloaded it, tried to compile it (and failed), and don't have time ATM to mess with compiling issues :-(