in reply to Re^2: UTF-8 entities in XML/HTML?
in thread UTF-8 entities in XML/HTML?

Hi, Jot -

I was working with the SALSA corpus of syntactically and semantically annotated German newspaper sentences. The corpus follows the TIGER annotation standards.

In the corpus, a UTF-8 encoded lowercase German a-umlaut ('ä'), e.g., would be rendered in ISO-8859-1 as ä. I am not sure, however, which encoding variant of those you mention this corresponds to.

Hope this helps anyway.

Pat