in reply to Re: can't get rid of BOM from UTF-8 webpage
in thread can't get rid of BOM from UTF-8 webpage

Thanks to both of you for your responses, that helps clarify things. Looks like it's a CMD issue which can be solved in Perl with a little manual work. Fortunately the display isn't critical to my project.

Since you asked, I found the suggestion for changing the code page to 65001 here:
http://stackoverflow.com/questions/379240/is-there-a-windows-command-shell-that-will-display-unicode-characters
This was also a helpful walkthrough:
http://stackoverflow.com/questions/1259084/what-encoding-code-page-is-cmd-exe-using

For the encoding/decoding suggestions, my main source was just the Perl Unicode Tutorial.

  • Comment on Re^2: can't get rid of BOM from UTF-8 webpage