"+¦" is suppose to represent "ö"? So at least 10 bytes? No one encoding would produce that. Even in UTF-8, ö only takes two bytes. Repeatedly encoding using UTF-8 doesn't result in anything like what you have either. (There would be repetition.)