And if you're going to want to parse the document, take a look
at HTML::Parser on CPAN. It's the best way to parse
HTML and make sure that you get it right.
I definetly agree with this. If you attempt to parse HTML yourself, you will end up with all kinds of problems (e.g. angle brackets embedded in quotes will probably trip up your code etc.) Leave it to the experts!