karavay has asked for the wisdom of the Perl Monks concerning the following question:

Dear monks I am writing a short script, which will perform a search in a bunch of a doc (MS Office) files and output the results - the question is , which module will best suited this purpose ...my guess it will be Win32::OLE are there any alternatives or OLE is my tool? Thanks...

Replies are listed 'Best First'.
Re: Doc file manipulation with perl
by erroneousBollock (Curate) on Sep 09, 2007 at 19:59 UTC
    Will there be a lot of documents? Will the search need to be fast? If so, you might consider KinoSearch or Lucene.

    Both systems build efficient searchable text-based indexes of the content you wish to search, and allow you to later perform searches on that index.

    In both systems you'll still need to have a way to export the documents as ascii text, so unless you can pre-arrange that ahead of time, perhaps you might use Win32::OLE to drive a "batch" export of the ascii versions.

    -David

      yes there will be alot of documents - I'll try out your suggestion, thanks...