G'day tos,
Adding some additional code to print values and improve output, and a -C (see "perlrun: -C") so I don't see garbled output, I get:
$ perl -C -wE 'use Unicode::GCString;use Unicode::Normalize;$t="Hütte" +; say "\$t[$t]"; print length("$t"), "\n";$g=Unicode::GCString->new(" +$t"); say "\$g[$g]"; print $g->columns, "\n"; say $g->chars;'
$t[Hütte] 6 $g[Hütte] 6 6
If I then tell Perl that the source code is written in UTF-8 (use utf8;):
$ perl -C -wE 'use utf8; use Unicode::GCString;use Unicode::Normalize; +$t="Hütte"; say "\$t[$t]"; print length("$t"), "\n";$g=Unicode::GCStr +ing->new("$t"); say "\$g[$g]"; print $g->columns, "\n"; say $g->chars +;'
$t[Hütte] 5 $g[Hütte] 5 5
Both of those outcomes seem reasonable to me. Does that help you at all? If not, please explain why you were expecting a 6 then a 5.
— Ken
In reply to Re: incorrect length of strings with diphthongs
by kcott
in thread incorrect length of strings with diphthongs
by tos
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |