in reply to Regex bafflement
I also note though that the HTML you've shown us isn't necessarily invalid, if we assume a "transitional" doctype and a suitable container so that dangling paragraph gets auto-closed. If you haven't yet actually seen what an HTML parser will do with it, it might still be an option.
|
|---|