in reply to Conversion of combined into pre-combined Unicode characters
Unicode::Normalize's NFC
use charnames ':full'; use Unicode::Normalize qw( NFC ); sub dump_str { print(charnames::viacode(ord($_)), "\n") for split //, $_[0]; } $_ = "\N{LATIN SMALL LETTER A}\N{COMBINING DIAERESIS}"; dump_str($_); print("--\n"); $_ = NFC($_); dump_str($_);
LATIN SMALL LETTER A COMBINING DIAERESIS -- LATIN SMALL LETTER A WITH DIAERESIS
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Conversion of combined into pre-combined Unicode characters
by Locutus (Beadle) on Mar 25, 2010 at 16:44 UTC |