![]() |
|
more useful options | |
PerlMonks |
Re: Modern best practices for multilingual regexp alphabetical character matching?by ikegami (Patriarch) |
on Jan 12, 2009 at 21:40 UTC ( #735814=note: print w/replies, xml ) | Need Help?? |
Make sure it's decoded using one or more of the following
And make sure the string us stored internally as UTF-8.
(No need to do use utf8; to use utf8:: functions. use utf8; means the source is in UTF-8.) If you do those two things, regexp will use UNICODE semantics, so \w and character classes will match accented letters, etc.
In Section
Seekers of Perl Wisdom
|
|