in reply to regex hangs forever

Unless i'm missing something, you never tell us what you're using for your web bot. I'd assume some version of LWP, but i could be wrong. May I suggest something that's not reinventing the wheel (if that's the case), such as WWW::Robot? You pretty much just plug in whatever web page you want retrieved, and it does all the work for you. Then you can move on to HTML::TokeParser to manage your HTML. Hope this helps.

meh.