in reply to [untitled node, ID 186662]

use HTML::Parser;

It handles maniacal markup you'll never think of in your homerolled regexen

Update: ++mkmcconn suggested I add HTML::TokeParser to the recommendation, and I agree (I knew I was forgetting a good one)

After Compline,
Zaxo