I wonder if any of you know where I could get my hands on a perlio layer that does Unicode normalization. I have data that is (supposed to be) in UTF-8. I am writing code that uses the :encoding(utf8) layer to validate that it is in fact good UTF-8/Unicode data, but to work with the data I'd like to have it in normalized form (NFKC, to be specific). I'd really like to have that done in a layer on top of :encoding(utf8) so that I can read the data in blocks and not have to worry about the block boundaries falling between a base character and following combining characters.
Thanks,
--DrWhy
"If God had meant for us to think for ourselves he would have given us brains. Oh, wait..."
In reply to unicode normalization layer by DrWhy
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |