in reply to Re^2: Any good ways to handle NARROW NO-BREAK SPACE characters in regex in newer versions of Perl?
in thread Any good ways to handle NARROW NO-BREAK SPACE characters in regex in newer versions of Perl?

The #8239 popped in after submitting this post. It's not actually in the code.

Yeah, PerlMonks does that to Unicode characters in <code> blocks - see my node here.

In that case, I would suspect an encoding error - see ikegami's reply and my node here.

  • Comment on Re^3: Any good ways to handle NARROW NO-BREAK SPACE characters in regex in newer versions of Perl?
  • Download Code

Replies are listed 'Best First'.
Re^4: Any good ways to handle NARROW NO-BREAK SPACE characters in regex in newer versions of Perl?
by nysus (Parson) on Aug 13, 2024 at 16:03 UTC

    I copy and pasted the file name into a file and did a hex dump:

    00 01 02 03 04 05 06 07 - 08 09 0A 0B 0C 0D 0E 0F 012345678 +9ABCDEF 00000000 53 63 72 65 65 6E 73 68 - 6F 74 2D 32 30 32 34 2D Screensho +t-2024- 00000010 30 32 2D 32 33 2D 61 74 - 2D 31 2E 30 35 2E 31 34 02-23-at- +1.05.14 00000020 E2 80 AF 41 4D 2D 31 30 - 32 34 78 36 39 38 2E 70 ...AM-102 +4x698.p 00000030 6E 67 0A ng.

    E2 80 AF

    is the UTF8. I wonder if the acting of cutting and pasting is modifying the string. I'm using tmux.

    $PM = "Perl Monk's";
    $MC = "Most Clueless Friar Abbot Bishop Pontiff Deacon Curate Priest Vicar Parson";
    $nysus = $PM . ' ' . $MC;
    Click here if you love Perl Monks