in reply to Re: Converting HTML to DOC
in thread Converting HTML to DOC

Did you try to google, to see what fits your needs "html to word" perl?

IMHO there is no trivial answer to such a general question!

Word-doc is effectively a print format like PDF, HTML is a multi device format, that means you have to decide what the print-version has to look like, where the page breaks are, font-sizes ... and so on.

Furthermore word-doc is (was???) a closed proprietary format, converting to RTF is much easier and better supported.

If you just want the default formating MS-Word produces you should simply use it's API and script the load and save-as.

Cheers Rolf

UPDATE: IIRC word-doc can embed HTML-Objects. And I wouldn't be surprised if the IE has a feature to export word-doc, so maybe another approach to script it...