in reply to Re: The Unicode Bug with Transliteration or Substitution
in thread The Unicode Bug with Transliteration or Substitution

You can use the Japanese Wikipedia Perl page . Perl 5.8.3 at work outputs different files for
tr/ / /s; tr/\t/ /s;
and

s/ +/ /g; s/\t+/ /g;

I tested with diff -w against the original, i.e. ignoring whitespace.

utf8::upgrade didn't change anything, before or after the substitution/transliteration.

لսႽ† ᥲᥒ⚪⟊Ⴙᘓᖇ Ꮅᘓᖇ⎱ Ⴙᥲ𝇋ƙᘓᖇ