in reply to Publish or Polish

Semi-OT: word generated .html is, in a word, "YUCKY!" ...and, increasingly so, per version. 97 output was somewhat sane; not so, more recent versions.

So, to address "polishing" issues, you may want to look at demoronizer (which is rather limited but eminently expansible... a project which has been (mostly 'sitting') on my ToDo shelf for far too long.

But as to your basic question, I can only echo the advice: publish and update; don't delay.

Replies are listed 'Best First'.
Re^2: Publish or Polish
by GrandFather (Saint) on Jun 22, 2005 at 22:20 UTC

    I've looked at both demoronizer and tidy. Tidy strips out stuff that is usefull (like <span> tags). Demoronizer I glanced at, but decided I didn't gain much using it as a pre-pass over the HTML.

    It's easier to use HTML::TreeBuilder to suck in the lot, then pull out the elements that I'm interested in. Mostly works pretty well. I get headings, tables, some character styles (like <code>) and anchors.


    Perl is Huffman encoded by design.