in reply to What bytes are in character class \p{Pe} ClosePunctuation?

Look in .../unicore/lib/gc_sc/Pe.pl of your Perl distribution  (the exact path may vary slightly depending on Perl version).

Replies are listed 'Best First'.
Re^2: What bytes are in character class \p{Pe} ClosePunctuation?
by Anonymous Monk on Mar 16, 2010 at 08:08 UTC
    Thanks, that looks like what I want, but what is the format? Is 301E    301F a range, or two entries?
      The syntax of the format is described in the perlunicode manual page.

      By looking at the first entry in Ll.pl, which is 0061    007A, I would guess it's a range.

      --
       David Serrano
       (Please treat my english text just like Perl code, i.e. feel free to notify me of any syntax, grammar, style and/or spelling error. Thank you!).

        I would guess it's a range

        ...which is also confirmed by this piece of code from mktables (which creates those tables):

        sub Table::Write { ... for my $set (@$Table) { my $start = $set->[RANGE_START]; my $end = $set->[RANGE_END]; my $name = $set->[RANGE_NAME]; if ($start == $end) { push @OUT, sprintf "%04X\t\t%s\n", $start, $name; } else { push @OUT, sprintf "%04X\t%04X\t%s\n", $start, $end, $name +; } }