Another alternative is
HTML::TokeParser::Simple. Whichever HTML parsing module you eventually choose will obviously be up to yourself. The main point is that you should definitely, definitely use one of them, rather than trying to create custom regexps.