The actual strings are quite a mess. I just wanted to know whether there's some issue with \b in Unicode. If you insist, then $string is something like
8^1589-20170113-102647-ויחי-דב +12;י_הספד_על_הר +ב_משה_שפירא.mp3 +^עברית^הרב מ +504;שה גולד^ויח +י-דברי הספד  +506;ל הרב משה שפ +;ירא, טו' טבת, ת +;שע'ז^שיעורי +501; בתנ"ך ובפרש +;ת השבוע|שיע +493;רים בפרשת ה +שבוע|שיעור• +7;ם קודמים|בר&# +1488;שית|ויחי
and $_ is just
שפירא
(it's hebrew, and I'm afraid your broweser might mess up the right-to-left presentation, or even just show the Unicode numbers instead of the characters themselves. My browser makes a mess here. That's why I didn't think posting the strings would help).
In reply to Re^2: \b in Unicode regex
by Arik123
in thread \b in Unicode regex
by Arik123
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |