in reply to How to remove wide chars (ex: 年 or 或) from a text file?
(Note that the number of hex digits per character may vary.) In the case of the OP data sample, that regex leaves 7 lines that are either empty or contain just hyphens and/or spaces.s/\&#x\w+;//g;
|
|---|