Perl has two ways to represent strings,
- without UTF-8 flag as "octet streams" i.e. a list of bytes
- with UTF-8 flag as "characters" in the internal representation°
\x{FEFF} represents the unicode character with the code-point #FEFF, since
Devel::Peek shows that the flag is missing, this character can't be found in the octet stream while replacing.
You need to tell Perl how to interpret the read data, the fact that it's "bytewise utf-8" alone doesn't help to see it as list of characters.
The use utf8; in my example just told Perl to read the script's source and all embedded literal strings as utf8.
see Encode for more.
°) which is almost UTF-8, hence the flag is - for historical reasons - a bit of a misnomer
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.