Unicode is not an area that I have had occasion to delve into in any great depth. But I have read a few resources on the subject and can recommend them as a good place to start.

You can start with perluniintro from the perldocs. I think it was introduced with Perl version 5.8.0, so you'll have to look at a recent release of Perl to find that POD. perluniintro seems to be a user-friendly introduction to Unicode as it pertains to Perl.

Once you've digested that, you can go straight to the meat and potatos: perlunicode, also from the perldocs.

As for modules, there are a plethora of them on CPAN. However, Unicode support is native to Perl in the more recent releases such as 5.8.0. You don't need a module to take advantage of the built-in support. However, as always, the modules on CPAN may serve to make your life easier as you identify specific needs. A quick search of http://search.cpan.org for unicode turned up a number of them under the Unicode:: namespace. Scan through the PODs for those modules and you'll have a pretty good idea of whether any of them might be of use to your endeavor.

The official unicode home page is http://www.unicode.org. It has a lot of good articles on using unicode encoding.

Good luck!


Dave


"If I had my life to do over again, I'd be a plumber." -- Albert Einstein

In reply to Re: Japanese characters in web application by davido
in thread Japanese characters in web application by ezekiel

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.