in reply to getting Unicode character names from string
Feel free to use any code from my uchar-nopro script that I use on a daily basis in order to analyze encodings in undocumented data files:
$ uchar-nopro -v λαάὰὸς
λ U003bb \N{GREEK SMALL LETTER LAMDA}
α U003b1 \N{GREEK SMALL LETTER ALPHA}
ά U01f71 \N{GREEK SMALL LETTER ALPHA WITH OXIA}
ὰ U01f70 \N{GREEK SMALL LETTER ALPHA WITH VARIA}
ὸ U01f78 \N{GREEK SMALL LETTER OMICRON WITH VARIA}
ς U003c2 \N{GREEK SMALL LETTER FINAL SIGMA}
|
|---|