in reply to Re^3: split text into words -- Unicode problem (I guess)
in thread split text into words -- Unicode problem (I guess)

If your script is written in utf8, use utf8 is needed to tell Perl about it. See more in utf8.