If your string is properly decoded, \w \p{Letter} properly match the non-ASCII word characters too.
So one way to match that word is \w+; knowing nothing about what the regex should not match, it's hard to give more specific advice.
See also: Encodings and Unicode in Perl.
In reply to Re: Question On Unicode characters
by moritz
in thread Question On Unicode characters
by kprasanna_79
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |