in reply to regex and HTML

You can simplify your regexes a lot, by deleting all the HTML tags. Thereafter, you have to look at the structure of the remaining text and whitespace. You could try:
undef $/; my $html = <DATA>; $html =~ s/<.+?>//sg; #strips all HTML, in a quick'n'dirty way $html =~ s/\s+/ /sg; #normalizes whitespace
The final match is left as an exercise...

Jeroen
"We are not alone"(FZ)

Replies are listed 'Best First'.
Re: Re: regex and HTML
by Beatnik (Parson) on Apr 09, 2001 at 20:14 UTC
Re: Re: regex and HTML
by toadi (Chaplain) on Apr 09, 2001 at 23:49 UTC
    lol,
    Got same idea as you. Just stripped the HTML. But your regex is prettier then mine so I won't post it :P

    --
    My opinions may have changed,
    but not the fact that I am right