When downloading chess games with international players I have to decode utf8 coded names. This works as far as the names include only simple acute, umlaut etc. characters. In this case both "
" do the job. On the other hand especially with russian players where the name includes cyrillic characters the names seem to be coded in a more complicated way. In the latter case utf8 decoding results in a string which seems to be utf8 coded itself. This gave me the idea to apply decoding twice. However this only works with "
" (it reports an error for the second time - string not being an utf8 string).
My questions: how do I know that I have to apply decoding more than once and why different utf8 decoders behave differently in perl?