Here's the offending thing (as displayed in a text editor)
The Widget&reg is the perfect...
What is this? I know & is the ampersand, and ® is the (R) symbol -- is this semi-mangled markup, with the amp 'double encoded'? I'm trying the
my $octets = encode("utf8", $x, Encode::FB_DEFAULT); $x = decode("utf8", $x, Encode::FB_DEFAULT);
suggestion and the character is still there. Should I be using  use bytes; as well?

All are welcome to downvote this node and just tell me to "RTFM", but I've been trying to make sense of the docs, and golly, UTF is hard to understand. For me, at least. Mea culpa.

I just want to strip this stuff and make it go away from my files.

utf befuddled --

water


In reply to Re: UTF-8 Malformed Char Error -- how to find and remove bad chars by water
in thread UTF-8 Malformed Char Error -- how to find and remove bad chars by water

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.