hungrystarfish has asked for the wisdom of the Perl Monks concerning the following question:
I'm trying to find a way to extract the body text from PDF documents. The modules on CPAN (specifically PDF) only seem to be able to get the document meta data not the actual text itself. Is this correct? Has anyone actually used it? Are there any other modules that can do what I want?
Any help you can give this Perl novice would be gratefully received.
Cheers!
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: PDF Parser
by Courage (Parson) on Jul 04, 2002 at 17:17 UTC | |
|
Re: PDF Parser
by amphiplex (Monk) on Jul 04, 2002 at 16:48 UTC | |
|
Re: PDF Parser
by traveler (Parson) on Jul 04, 2002 at 18:25 UTC | |
|
Re: PDF Parser
by hungrystarfish (Initiate) on Jul 05, 2002 at 08:25 UTC |