Sure, but the Han script is probably about 40000 characters big: no way to write a list by hand.
That's why my example queries each character for the Unicode property \p{Han}, ie if the character is in that script block.
For a better description of Unicode properties and script blocks in Regexes I recommend "Mastering Regular Expressions" by Jeffrey Friedl, pages 121pp.
In reply to Re^3: The “real length" of UTF8 strings
by moritz
in thread The “real length" of UTF8 strings
by Anonymous Monk
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |