in reply to Re: \b in Unicode regex
in thread \b in Unicode regex
The actual strings are quite a mess. I just wanted to know whether there's some issue with \b in Unicode. If you insist, then $string is something like
8^1589-20170113-102647-ויחי-דב +12;י_הספד_על_הר +ב_משה_שפירא.mp3 +^עברית^הרב מ +504;שה גולד^ויח +י-דברי הספד  +506;ל הרב משה שפ +;ירא, טו' טבת, ת +;שע'ז^שיעורי +501; בתנ"ך ובפרש +;ת השבוע|שיע +493;רים בפרשת ה +שבוע|שיעור• +7;ם קודמים|בר&# +1488;שית|ויחי
and $_ is just
שפירא
(it's hebrew, and I'm afraid your broweser might mess up the right-to-left presentation, or even just show the Unicode numbers instead of the characters themselves. My browser makes a mess here. That's why I didn't think posting the strings would help).
In Section
Seekers of Perl Wisdom