in reply to html parsing
HTML::FormatText::Html2text is designed to do precisely that (as its name might suggest).