in reply to To Read and Edit docx files in Windows 7

Didn't find much in the way of Perl modules for working with MS Word files.

Depending on just what needs to be done, maybe you could extract text from the document with Text::Extract::Word, edit the resulting text, then convert the text to HTML using HTML::FromText, saving the resulting HTML with a .doc extension. MS Word will read that without complaint.* Of course, the original formatting will be lost. (Though any formatting your Perl program puts in the HTML will be accepted by MS Word.)

(There is Win32::Word::Writer, but it would be harder to use than what I suggested, above.)

Alternately, there are tools for converting docx files to ODF files and several Perl modules for working with ODF files. Then your program can modify it and MSWord can read the result. (Supposedly, MSWord can also export to ODF - assuming you can convince your users to do that.)

---

* I know this because, when I had a website, resume.doc was just a symbolic link to resume.html

Replies are listed 'Best First'.
Re^2: To Read and Edit docx files in Windows 7
by DVCHAL (Novice) on Dec 11, 2014 at 07:07 UTC
    As you referred "Text::Extract::Word" is only to read from ".doc" file and its not working for ".docx" files. Any Similar modules to convert docx to text?