in reply to Problem with parsing HTML with Regex's

None of the proposed regex solutions properly handle:

<img alt=">click here<" src="/images/button.gif" NAME=">click<">

Yet Another Reason to use an HTML parser.

Updated:
None of the proposed purely regex solutions ... happy now? ;-)

Yes, the given regexes can all be modified to work with this (psychotic) example. The point is that parsing HTML is difficult, and that this wheel has already been invented a few times.


My parents just came back from a planet where the dominant life form had no
bilateral symmetry, and all I got was this stupid F-Shirt.

Replies are listed 'Best First'.
Re: Re: Problem with parsing HTML with Regex's
by Anonymous Monk on Nov 10, 2003 at 16:13 UTC
    *cough*cough* YAPE::HTML is a regex solution ;)