in reply to Guess between UTF8 and Latin1/ISO-8859-1
If all else fails, you might take a look at--and get a good laugh from--Verifying Unicode (The mother of all regex)..
Quick it isn't--but it is thorough and entirely correct according to the XML spec.
|
|---|