in reply to (OT) How to deal with non-ascii names
sub clean { my $s = unidecode(shift); $s =~ s/[^A-Za-z0-9'-]+/ /g; return $s; }
Text::Unidecode's unidecode will strip accents, while the substitution will get rid of leftover weird characters. unidecode will do a lot more than just that, though, so it might be a viable solution for you.
The main problem either approach present is when you have to deal with oriental names ( Korean, Japanese, Chinese, etc )
Perhaps you should ask for the name as it appears on the passport. They are romanised.
|
|---|