O masters of international encoding:

Perl veteran, but Unicode n00b. Let's say I have a title in a foreign language, say: "Vals På Vinkelgränd", but I need to convert it to "ASCII": that is, I want to get "Vals Pa Vinkelgrand" out of it. What Perl module can do what I want? (Note: This will be a personal script, and it will always run on 5.14.2 or higher, so I'm not worried about Unicode bugs/weird implementations in older Perls.)

For the curious, here's the application: I have a bunch of MP3 files, and many of them have these international characters (the example title is from the Movits! album Äppelknyckarjazz). I want to script a way to turn the song titles into filenames. But, unfortunately, I'm using Dropbox, and apparently its filesystem doesn't deal with Unicode characters in filenames (or filenames that differ only by case, FTM--perhaps they have a giant DOS server farm :-/ ). So, every time I store a filename with a Unicode character in it in my Dropbox folder, it comes out all garbled on the other side. So I figured the simplest thing would be to just give up, and leave the full Unicode strings in the MP3 tags, but turn the filenames into "plain ASCII." And I'd like to script this solution.

I humbly await enlightenment.


In reply to Convert international characters to plain ASCII by Oberon

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.