Hello Monks,
I have a bunch of PDFs that were originally created in MS Word, printed, scanned and saved in PDF format. Now I need to run through those files, parse their text and single out all those files that fit some regexp. What is the best way to do it?
Thanks for your help,