in reply to Comparing utf8 strings

Perl 5.8+ has UNICODE support integrated so that eq, regular expressions, etc all work without issue.

However, you need to decode the encoded bytes into UNICODE text strings. There are a few ways of doing that:

Don't forget that you need to do the inverse — encode the characters into UTF-8 bytes — when you output.

If your source code is UTF-8 encoded, add use utf8; at the top of your file.

Replies are listed 'Best First'.
Re^2: Comparing utf8 strings
by Anonymous Monk on Apr 16, 2008 at 00:53 UTC
    Of course, you'll need to find a way to deal with precomposed and combined characters. The easiest thing to do is to ignore it completely...