in reply to UTF-8 Malformed Char Error -- how to find and remove bad chars
What is this? I know & is the ampersand, and ® is the (R) symbol -- is this semi-mangled markup, with the amp 'double encoded'? I'm trying theThe Widget&reg is the perfect...
suggestion and the character is still there. Should I be using use bytes; as well?my $octets = encode("utf8", $x, Encode::FB_DEFAULT); $x = decode("utf8", $x, Encode::FB_DEFAULT);
All are welcome to downvote this node and just tell me to "RTFM", but I've been trying to make sense of the docs, and golly, UTF is hard to understand. For me, at least. Mea culpa.
I just want to strip this stuff and make it go away from my files.
utf befuddled --
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: UTF-8 Malformed Char Error -- how to find and remove bad chars
by iburrell (Chaplain) on Jun 23, 2004 at 16:19 UTC | |
by water (Deacon) on Jun 23, 2004 at 16:56 UTC | |
by iburrell (Chaplain) on Jun 23, 2004 at 18:18 UTC |