in reply to Parsing HTML

Non-perl, but what about lynx -dump ? Granted you have to fiddle with the output a tad but that gives you nice rendered text without any html..