in reply to Parsing with HTML::Parser

The above answers are excellent, but if you are looking to present this text in a clean formatted way, you might want to look at some of the html2txt programs that are floating around on the internet. Debian has one prepackaged that I have used and works quite well.

Of course this is useless if you just want the data, and don't care about the formatting.