in reply to Extracting similar data from html
I think you will find that using CPAN's HTML modules will be your best bet, rather than using regular expressions to parse out the sample HTML you posted.