in reply to Convert international characters to plain ASCII
Once you have the string in that form, you get rid of the diacritic marks (leaving the letters in place) as follows:
(See the description of the "\p" regex options in perlunicode, perluniprops and perlre.)s/\pM+//g;
Update: I forgot to mention -- even after taking care of the diacritic marks, be aware that you are likely to still have some non-ASCII characters left behind (i.e. things that don't involve an ASCII letter plus a diacritic mark, but are letter or punctuation that fall outside the ASCII range). You might need to tailor some ad-hoc replacements for those if you really need the data to be coherent in an ascii-only environment.
|
|---|