in reply to Regular expression for chinese character

First, learn about the "regexp" feature. Perhaps start with perlretut. For example, /\d+/ will match a sequence of digits (0 through 9). Similarly, you can find a sequence of characters that are used in Asian languages, as opposed to ASCII or other Latin, Greek, etc. characters.

There are built-in classifications, including "Han", which another poster illustrated.

So, use a pattern that finds all occurrences of Han characters within your mixed text.