I need to parse some HTML pages, but these pages also have HTML::Mason markup in them. The <% %> tags in particular are challenging since they may have perl code containing > and < characters which may confuse strict HTML parsers.
Is is possible to make HTML::Parser (or some other module) correctly handle the extra markup? Any advice on how to go about it would also be appreciated.