http://qs1969.pair.com?node_id=11155690


in reply to uparse - Parse Unicode strings

See also: unichars and uniprops from Unicode::Tussle.

$ unichars '\p{Emoji}' | wc -l
178

$ unichars '\p{Emoji}' | head -n 30 | tail -n 20
 8  U+0038 DIGIT EIGHT
‭ 9  U+0039 DIGIT NINE
‭   U+00A9 COPYRIGHT SIGN
‭   U+00AE REGISTERED SIGN
‭ ‼  U+203C DOUBLE EXCLAMATION MARK
‭ ⁉  U+2049 EXCLAMATION QUESTION MARK
‭   U+2122 TRADE MARK SIGN
‭ ℹ  U+2139 INFORMATION SOURCE
‭ ↔  U+2194 LEFT RIGHT ARROW
‭ ↕  U+2195 UP DOWN ARROW
‭ ↖  U+2196 NORTH WEST ARROW
‭ ↗  U+2197 NORTH EAST ARROW
‭ ↘  U+2198 SOUTH EAST ARROW
‭ ↙  U+2199 SOUTH WEST ARROW
‭ ↩  U+21A9 LEFTWARDS ARROW WITH HOOK
‭ ↪  U+21AA RIGHTWARDS ARROW WITH HOOK
‭ ⌚ U+231A WATCH
‭ ⌛ U+231B HOURGLASS
‭ ⌨  U+2328 KEYBOARD
‭ ⏏  U+23CF EJECT SYMBOL

$ uniprops 🧑
U+1F9D1 🧑 \N{ADULT}
    \pS \p{So}
    All Any Assigned Common Zyyy EBase Emoji_Modifier_Base Emoji Emoji_Presentation EPres Extended_Pictographic ExtPict
       So S Gr_Base Grapheme_Base Graph X_POSIX_Graph GrBase Other_Symbol Print X_POSIX_Print Symbol
       Sup_Symbols_And_Pictographs Supplemental_Symbols_And_Pictographs InSupSymbolsAndPictographs Unicode

$ uniprops U+1F9D1
U+1F9D1 🧑 \N{ADULT}
    \pS \p{So}
    All Any Assigned Common Zyyy EBase Emoji_Modifier_Base Emoji Emoji_Presentation EPres Extended_Pictographic ExtPict
       So S Gr_Base Grapheme_Base Graph X_POSIX_Graph GrBase Other_Symbol Print X_POSIX_Print Symbol
       Sup_Symbols_And_Pictographs Supplemental_Symbols_And_Pictographs InSupSymbolsAndPictographs Unicode

Replies are listed 'Best First'.
Re^2: uparse - Parse Unicode strings
by kcott (Archbishop) on Nov 19, 2023 at 11:21 UTC

    Thanks for that. There's a huge amount of documentation to go through. I've had a brief look and it seems like there are a number of very useful tools.

    — Ken

Re^2: uparse - Parse Unicode strings (uparse/uchar/unichars/uniprops)
by eyepopslikeamosquito (Archbishop) on Nov 21, 2023 at 00:55 UTC