That's how I understand it, but it doesn't always work. I've brought it up in other newsgroups and am told that strings will eventually be stored "transparantly" but it will still handle binary data OK, though I can't really get a better answer than that.
Currently you need use utf8 in the scope of the regex to enable certain behavior.
There is also the fact that any particular string may be byte or character encoded, but no function to tell which. A regex on a byte string will work with binary data—no special pragma is needed.
However, use byte and use utf8 are not simple opposites as I had thought from reading those docs. They mean different things, and the real behavior is different from what p5p people tell me.
So, always test it and try it. Unicode support is still "experimental" according to the docs.
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.