That works fine for XML since XML must specify its encoding within the document (binary format), but not so much with HTML where the encoding is specified outside of the document (text format).
I don't see any way of specifying the encoding of an HTML document, which is weird because XML::LibXML supposedly handles HTML.
XML::LibXML handles UTF-16 just fine.
In reply to Re^2: HTML parsing module handles known and unknown encoding
by ikegami
in thread HTML parsing module handles known and unknown encoding
by ambrus
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |