in reply to Re: UTF8 versus \w in pattern matching
in thread UTF8 versus \w in pattern matching

Is your source file encoded as UTF-8?

Yes, I am reading many UTF-8 files. As part of an earlier project, I have ensured that the input really is UTF-8. However, on two different systems, I get the problem that \w does not match any non-ASCII letters.