hotshot has asked for the wisdom of the Perl Monks concerning the following question:
I'm checking the overhead of supporting unicode in my Perl project, as I managed to see till now, without using any unicode module (utf8), Perl just "gives what she gets", for example when I used opendir to get dirs list under a given directory and I have there dirs opened in korean or german language (in utf8), perl receives it and displays it properly.
The problem starts when I try to manipulate the directory with a regular expression. does it mean I'll have to change all my regexps (endless regexps) to support unicode (using IsAlnum and '-' for \w for example), the regexps will be much complicated (long), and won't have all the power of old ones?
HotshotEdited: ~Wed Oct 30 16:38:08 2002 (GMT) by footpad: Retitled (was Unicode), added <P> tags, and fixed minor spelling errors - per Consideration
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Unicode and regexes
by dakkar (Hermit) on Oct 30, 2002 at 17:51 UTC | |
by hotshot (Prior) on Oct 31, 2002 at 07:54 UTC |