A few weeks ago, I found some odd text in a <code> section on a node, and figured out that it was UTF-8 bytes being presented as Latin-1. I examined the source for the page, and did not see any mention of "charset", as in a "charset=something" or "charset: something".

Someone else mentioned that this site is "in" Latin-1. Is it marked somewhere in the HTML that I'm not seeing from a plain text search, such as in a LINK of some kind, or a default?

Before I continue with some musings on the subject, I'd like to get all my facts straight.

—John

Replies are listed 'Best First'.
Re: Text encoding on this site's HTML
by FamousLongAgo (Friar) on Dec 23, 2002 at 05:14 UTC
    Encoding is often set by an HTTP header - when I telnet to perlmonks on port 80 and ask for the headers, I get this:
    Content-Type: text/html; charset=iso-8859-1
    Does that help?
      Yes, thanks. Now I know that it is specifically stated somewhere.