DrWhy has asked for the wisdom of the Perl Monks concerning the following question:
I wonder if any of you know where I could get my hands on a perlio layer that does Unicode normalization. I have data that is (supposed to be) in UTF-8. I am writing code that uses the :encoding(utf8) layer to validate that it is in fact good UTF-8/Unicode data, but to work with the data I'd like to have it in normalized form (NFKC, to be specific). I'd really like to have that done in a layer on top of :encoding(utf8) so that I can read the data in blocks and not have to worry about the block boundaries falling between a base character and following combining characters.
Thanks,
--DrWhy
"If God had meant for us to think for ourselves he would have given us brains. Oh, wait..."
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: unicode normalization layer
by graff (Chancellor) on Sep 16, 2009 at 03:44 UTC | |
by ikegami (Patriarch) on Sep 16, 2009 at 14:41 UTC | |
by DrWhy (Chaplain) on Sep 17, 2009 at 05:18 UTC | |
|
Re: unicode normalization layer
by ikegami (Patriarch) on Sep 15, 2009 at 21:39 UTC | |
by DrWhy (Chaplain) on Sep 15, 2009 at 22:24 UTC |