Right, thanks again! I hadn't thought about codepoints vs. characters, but I'll keep this in mind; combining accents and other diacritics in particular I might well encounter.
Searching CPAN shows that there's a module for this, Unicode::Normalize, which I'll look into.
In reply to Re^4: length() miscounting UTF8 characters?
by AppleFritter
in thread length() miscounting UTF8 characters?
by Anonymous Monk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |