in reply to Extracting text from pptx

What part of the linked program do you have problems with?

Replies are listed 'Best First'.
Re^2: Extracting text from pptx
by welle (Beadle) on Mar 08, 2013 at 18:46 UTC

    Please, consider that I'm a novice with Perl

    I just can't understand the whole script: where is the unzipping part? If I run the script (on Windows) I simply get an error message (System not able to find the directory + Failed to extract required information from <file>). It must - I guess - with the script setting:

    my $unzip = "/usr/bin/unzip"; to do

      So, what have you done to find out where $unzip is then used?

      Also, the error message would suggest to me that somewhere, the program expects some other program, possibly unzip.exe to exist. What have you done to find out whether that is really the case?

      Likely an unzip utility can be found in the unxutils package.

      that looks like an Unix comand!
      (ok, that IS an unix path to a program, you'll want to find the call for something like 7zip or the likes and replace  "/usr/bin/unzip/ it with that).
      J -

        Here's a one-liner for *nix

        unzip -lp <filename>.pptx ppt/slides/* ppt/notesSlides/* | perl -wne' +while ( /<a:t>(.*?)<\/a:t>.*?(?=(<a:t>|<\/p:txBody>))/g ) {print "$1" + and print "\n" if $2 eq "<\/p:txBody>"}'