| [reply] |
| [reply] |
I haven't tried them yet but PDF and PDF::Parse APIs do not have the capability to search within a PDF file. The documentation shows that only the PDF doc properties can be retrived. I plan to try it out tonite anyways.
Thanks for your help.
-Sagar
| [reply] |
Have you tried grep? Strings, if I remember correctly, are stored as plain-text within the PDF format ...
Being right, does not endow the right to be rude; politeness costs nothing. Being unknowing, is not the same as being stupid. Expressing a contrary opinion, whether to the individual or the group, is more often a sign of deeper thought than of cantankerous belligerence. Do not mistake your goals as the only goals; your opinion as the only opinion; your confidence as correctness. Saying you know better is not the same as explaining you know better.
| [reply] |
Much of what I see in PDF files is enclosed in 'stream' blocks, which appears to be a compression encoding. grep won't do it. When I am forced to do this myself, I sure hope one of the above mentioned modules or other will take care of pulling out the text I need to look at. (Oh, I'm not looking forward to this!)
| [reply] |
Could you update us on your progress?
| [reply] |