in reply to Strange Unicode normalization question
but I have no evidence of NonspacingMark ever being in the normalized string.
There are three in the example you gave:
use strict;
use warnings qw( all );
use feature qw( say );
use utf8;
use open ':std', ':encoding(UTF-8)';
use charnames qw( );
use Unicode::Normalize qw( NFKD );
my $html = "Lubomír,Bartoňová";
my $decomposedHtml = NFKD( $html );
say charnames::viacode(ord($_))
for $decomposedHtml =~ /(\p{NonspacingMark})/g;
Output:
COMBINING ACUTE ACCENT COMBINING CARON COMBINING ACUTE ACCENT
The code you posted is a hack to find an ASCII "equivalent" to the input.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: Strange Unicode normalization question
by mje (Curate) on Aug 16, 2018 at 18:10 UTC | |
|
Re^2: Strange Unicode normalization question
by Veltro (Hermit) on Aug 16, 2018 at 11:43 UTC | |
by ikegami (Patriarch) on Aug 16, 2018 at 14:43 UTC | |
|
Re^2: Strange Unicode normalization question
by mje (Curate) on Aug 16, 2018 at 17:35 UTC | |
by Your Mother (Archbishop) on Aug 16, 2018 at 17:46 UTC |