in reply to Publish or Polish

Also, can't newer versions of Word export as XML?

Replies are listed 'Best First'.
Re^2: Publish or Polish
by GrandFather (Saint) on Jun 22, 2005 at 21:38 UTC

    Maybe, but that is a whole different thing.

    I don't see XML options for saving with Word 2002 and even if there were I'd think the issues are unlikely to be solved that way.

    When starting this project I considered parsing .rtf or HTML and decided that the HTML tools for Perl were probably more mature so I went that way.


    Perl is Huffman encoded by design.
Re^2: Publish or Polish
by willyyam (Priest) on Jun 23, 2005 at 13:13 UTC

    Future versions of Word will publish in an XML "Open" format (it's not open, and is in fact a weapon MS is using against Open Source Software), but for now, no. From the looks of it the XML export option has a lot in common with Word-generated HTML. Ick.

    More importantly, tools are needed to get documents out of proprietary, vendor-locked formats into something you have a hope of opening in ten years.