in reply to Matching UTF8 Regexps

Your regex source variable is double-quoted. This leads me to believe you'll have problems with backslashes and other regex-specific symbols that are incorrectly interpreted in a double-quoted string. Try using qr// to contain your regex instead:
my $regexp_source = qr/... regexp string .../;
_____________________________________________________
Jeff japhy Pinyan, P.L., P.M., P.O.D, X.S.: Perl, regex, and perl hacker
How can we ever be the sold short or the cheated, we who for every service have long ago been overpaid? ~~ Meister Eckhart

Replies are listed 'Best First'.
Re^2: Matching UTF8 Regexps
by lestrrat (Deacon) on Mar 07, 2005 at 09:23 UTC

    Thanks. Tried it, but no luck (same problem). Your suggestion has a valid point, but if that were the source of the problem, I don't see why encoding the content of the page makes a difference. (And in this particular case, I had no escapes in my regexp string -- which was just by pure luck)

    I'm guessing it has more to do with UTF8-funkiness than regular expression per say...