Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:

Sir,

Can anyone let me know how to read and extract (REGEX) information from the contents of
1. pdf file
2. word file
3. excel file

Thanks
  • Comment on How to extract (regex) data from pdf, word, excel files

Replies are listed 'Best First'.
Re: How to extract (regex) data from pdf, word, excel files
by kennethk (Abbot) on Feb 05, 2009 at 17:51 UTC
Re: How to extract (regex) data from pdf, word, excel files
by eff_i_g (Curate) on Feb 05, 2009 at 18:20 UTC
    I like Spreadsheet::ParseExcel. For Word, I believe your only options are OLE, or saving the document into another format and parsing that.
Re: How to extract (regex) data from pdf, word, excel files
by Anonymous Monk on Feb 06, 2009 at 09:32 UTC
    All,

    Thanks for the help.

    Regards