What is this? I know & is the ampersand, and ® is the (R) symbol -- is this semi-mangled markup, with the amp 'double encoded'? I'm trying theThe Widget&reg is the perfect...
suggestion and the character is still there. Should I be using use bytes; as well?my $octets = encode("utf8", $x, Encode::FB_DEFAULT); $x = decode("utf8", $x, Encode::FB_DEFAULT);
All are welcome to downvote this node and just tell me to "RTFM", but I've been trying to make sense of the docs, and golly, UTF is hard to understand. For me, at least. Mea culpa.
I just want to strip this stuff and make it go away from my files.
utf befuddled --
In reply to Re: UTF-8 Malformed Char Error -- how to find and remove bad chars
by water
in thread UTF-8 Malformed Char Error -- how to find and remove bad chars
by water
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |