in reply to Re: HTML parsing module handles known and unknown encoding
in thread HTML parsing module handles known and unknown encoding

This extracts the content-type from the meta tag, which is a good start, but it's not a complete solution. Here are some additional parts a complete solution will need.