karavay has asked for the wisdom of the Perl Monks concerning the following question:

Dear monks, Came back, to seek for your wisdom :) , I am currently writing an application which will extract acii from doc files and save it in a separate text files through, which the user will be able to search. i've got as far as doc->txt (with win32::ole) and got stuck with txt->searchable interface could you advise, which module/s or technique will be the most suitable to perform this task (there are loads of files to process ,so speed is an issue) I've looked through Kinosearch, Serach-Indexer etc ... but could not find an "escape route". Platform – win32 Thanks,

Replies are listed 'Best First'.
Re: Doc->txt->searchable interface
by erroneousBollock (Curate) on Sep 16, 2007 at 04:02 UTC
    I've looked through Kinosearch, Serach-Indexer etc ... but could not find an "escape route".
    KinoSearch or Lucene sounds like exactly what you need.

    What exactly do you mean by an "escape route"? Have you tried anything since your previous request?

    You'll likely get a better response if you post the code you've tried so far. Please remember to follow the writeup formating tips (use <code> and <readmore> tags).

    -David