I personally like
Web::Scraper, but this talk from TPC in the Cloud this year shows some compelling alternate approaches - the the main point of the talk is not primarily parsing HTML, but he does give it a good treatment -
Bruce Gray - Refactoring and Readability: Crouching Regex, Hidden Structures. IMO easily one of the top 3 talks of the conference, if not the best.