in reply to Re^2: Extracting text from pptxin thread Extracting text from pptx
Here's a one-liner for *nix
unzip -lp <filename>.pptx ppt/slides/* ppt/notesSlides/* | perl -wne' +while ( /<a:t>(.*?)<\/a:t>.*?(?=(<a:t>|<\/p:txBody>))/g ) {print "$1" + and print "\n" if $2 eq "<\/p:txBody>"}' [download]