in reply to Re: A copyeditor needs help to get started with a Perl project
in thread A copyeditor needs help to get started with a Perl project
Word reads its own HTML very well
That is a very good thought. As horrid as Word HTML is to the naked eye HTML parser should let you whip through it with ease, editing the text but leaving the puke vomit markup formatting. Then as you say let Word convert its own excreta back into native format. This conversion is essentially just padding with huge numbers of null bytes for every real character, thus 'Hello World!' as a text file is 13 bytes but in .DOC format it needs a mere 19,456 :-)
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^3: A copyeditor needs help to get started with a Perl project
by gaal (Parson) on Nov 04, 2004 at 11:06 UTC | |
by tachyon (Chancellor) on Nov 04, 2004 at 15:30 UTC | |
by gaal (Parson) on Nov 04, 2004 at 15:51 UTC | |
|
Re^3: A copyeditor needs help to get started with a Perl project
by ww (Archbishop) on Nov 04, 2004 at 14:55 UTC |