Did you try to google, to see what fits your needs "html to word" perl?
IMHO there is no trivial answer to such a general question!
Word-doc is effectively a print format like PDF, HTML is a multi device format, that means you have to decide what the print-version has to look like, where the page breaks are, font-sizes ... and so on.
Furthermore word-doc is (was???) a closed proprietary format, converting to RTF is much easier and better supported.
If you just want the default formating MS-Word produces you should simply use it's API and script the load and save-as.
UPDATE: IIRC word-doc can embed HTML-Objects. And I wouldn't be surprised if the IE has a feature to export word-doc, so maybe another approach to script it... |