# your sample string
my $orig = "\x{2be}\x{e0}-da-um-=T\x{da}G-:2 1 AKTUM-=T\x{da}G";
my $s = $orig;
$s =~ s/(\p{Ll}+)/<i>$1<\/i>/g; # lower --> italic
$s =~ s/(\p{Lu}+)/lc($1)/ge; # upper --> lower
open my $fh, ">:utf8", "sample.html" or die $!;
print $fh qq|<html>
<header>
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
</header>
<body>
$orig<br />
$s
</body>
</html>
|;
close $fh;
Then load the sample.html in your browser; the second line should
be the modified string. Except for the ʾ, it appears to work. I'm
not sure what the ʾ (\x{2be}) is. It doesn't seem to be treated as
a lowercase character (the Unicode database lists it among "spacing
modifying letters")... I'm afraid you'll have to figure that one out
yourself :)
|