in reply to changing format of the first word of every line in an HTML doc

A good general approach would be to save the document as .DOCX, which is a compressed-XML format.   Then, use an XML parsing package.

In particular, you want to look at “XPath expressions,” which allows you to traverse the entire XML data-structure looking for what you need to find ... without writing code to do so.

Replies are listed 'Best First'.
Re^2: changing format of the first word of every line in an HTML doc
by sharkyflip (Initiate) on Jan 13, 2011 at 12:03 UTC

    Thanks for this suggestion, sundial. =) This approach looks promising. I've done what you suggested, by saving the document to .DOCX, and extracted the document.xml from the compressed file-directory by using ZIP-extraction tool. By making a quick analysis of the xml file, I can identify where the first words of ever line are. Can you recommend a (free) software that would allow me to examine and modify the node-structure, attributes and elements of an XML-file?

    Thanks so much!