perl has to assume a character encoding.

Not at all. If it must assume an encoding, and that encoding is iso-8859-1 for

"\x{E4}" =~ /\w/

then what encoding is assumed for the following?

"\x{2660}" =~ /\w/

It never deals with any encoding. It always deals with string elements (characters). And those string elements (characters) are assumedrequired to be Unicode code points.

It's entirely up to you to create a string with the right elements, which may or may not involve character encodings.

Or what do you think it is, if not ISO-8859-1?

A Unicode code point, regardless of the state of the UTF8 flag.


In short, you're over complicating things. It's NOT:

Each character is expected to be an iso-8859-1 byte if UTF8=0 or a Unicode code point if UTF8=1.

It's simply:

Each character is expected to be a Unicode code point.


In reply to Re^4: How does the built-in function length work? by ikegami
in thread How does the built-in function length work? by PerlOnTheWay

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.