in reply to Re^3: convert a string(which contains the contents of a file) into UTF-8 encoding
in thread convert a string(which contains the contents of a file) into UTF-8 encoding

Hi , Actually the above code is working fine in perl 5.6 and it is not able to convert the copyright and trademark signal into utf-8 in perl 5.8 . Please advice. Regards kamalakar
  • Comment on Re^4: convert a string(which contains the contents of a file) into UTF-8 encoding

Replies are listed 'Best First'.
Re^5: convert a string(which contains the contents of a file) into UTF-8 encoding
by ikegami (Patriarch) on Oct 20, 2009 at 01:23 UTC

    While iso-latin-1 includes the Copyright symbol (©, U+00A9), it doesn't include the Trademark symbol (™, U+2122). Seeing as it's impossible to represent them in iso-latin-1, it's impossible to convert them from iso-latin-1 to UTF8. Maybe you are using Microsoft's derivative of iso-latin-1, cp1252?

    Update: I initially stated the Copyright symbol wasn't in iso-latin-1 either. Fixed.