Спасибо, анонимный монах. I try to run all the posted source on threads where I'm OP, and I was very pleased to run yours and have an iconv command that worked 100 percent. The command gave me a lot of partial credit for failed attempts, which helped diagnose the way. I sense that you are experienced with cyrillic encodings, so I'm very happy to have your attention to my issues, which must seem parochial by your standards.
$ printf '\xf0\xd2\xc9\xd7\xc5\xd4\n' > 1.file $ iconv -f koi8-r 1.file Привет $ iconv -f koi8-r 1.file -o 1.prubyet $ file 1.prubyet 1.prubyet: UTF-8 Unicode text $ cat 1.prubyet Привет $ cat 1.file ������ $ file 1.file 1.file: ISO-8859 text
I know how these look to in the terminal and in my editor. 3.file shows the cyrillic greeting. 1.prubyet has six diamonds with question marks in the middle.
I wondered what diff would think of them:
$ echo Привет >3.file $ diff 1.file 3.file 1c1 < ������ --- > Привет $
I'm looking at 1.file and 3.file in the hex editor. 1.file was exactly what I expected, but 3.file has one value more than the 12 I expected. (?)
D0 9F D1 80 D0 B8 D0 B2 D0 B5 D1 82 0AI'd hoped this renders faithfully with monastery code tags. Do I gather that code tags unravel things that aren't us-ascii? Has anyone ever suggested having a form of code tag that did not do this?
In reply to Re^2: create clone script for utf8 encoding
by Aldebaran
in thread create clone script for utf8 encoding
by Aldebaran
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |