in reply to Parsing with HTML::Parser
The above answers are excellent, but if you are looking to present this text in a clean formatted way, you might want to look at some of the html2txt programs that are floating around on the internet. Debian has one prepackaged that I have used and works quite well.
Of course this is useless if you just want the data, and don't care about the formatting.
|
|---|