in reply to Re^3: Extracting text from pptx
in thread Extracting text from pptx

Here's a one-liner for *nix

unzip -lp <filename>.pptx ppt/slides/* ppt/notesSlides/* | perl -wne' +while ( /<a:t>(.*?)<\/a:t>.*?(?=(<a:t>|<\/p:txBody>))/g ) {print "$1" + and print "\n" if $2 eq "<\/p:txBody>"}'