I am trying to process texts (actually, those are HTML pages) that contain text in either or both languages: {English, Hebrew}. The Hebrew text is written either in
CP1255 or in ISO-8859-8 or in UTF-8.
My questions: How can I detect which encoding is used in the texts that I process?