Yes, I think that regexp's are for the simplest digs from html or xml only. Of course, you can write very sophisticated regexp, but this way is, imho, read only and more painfull.
So I suggest some html parser, especially, if you _really_ cannot get better data sources than html.