in reply to HTML Search Engine/Parser

Why not check cpan.org? You may want to look at: HTML::Parser and/or HTML::TreeBuilder