in reply to how to extract text from PDF
I've used it since late 2002, and the only problems I've had arise from one particular organization (a government agency, go figure) making changes that were not obvious, and doing odd, possibly nonstandard things with their formatting.
pdftotext works well, but you have to watch your source of the data, particularly if that source isn't trustworthy. Although, come to think of it, that's true in all areas of my job.
--
tbone1, YAPS (Yet Another Perl Schlub)
And remember, if he succeeds, so what.
- Chick McGee
|
|---|