I have some image files which have unicode caption.
Unicode is not a character encoding. If ExifTool doesn't decode the strings for you, you have to do it yourself. And you have to know its encoding first. There's no way around that.
However, I don't know how to output them to HTML file in their native encoding.
In which "native encoding"? That of the HTML files? which encoding is that?
Let me get this straight: When you want to change the encoding of something, Encode (or the IO layers) are they way to go, but you have to know both the source and destination encoding.
Also make sure to always test with reliable tools and as soon as possible. hexdump in conjunction with an encoding table is reliable. Browsers (that often try to guess an encoding, and sometimes fail) are not.
In reply to Re: How to encode for non-unicode output
by moritz
in thread How to encode for non-unicode output
by cheerful
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |