in reply to uparse - Parse Unicode strings
See also: unichars and uniprops from Unicode::Tussle.
$ unichars '\p{Emoji}' | wc -l
178
$ unichars '\p{Emoji}' | head -n 30 | tail -n 20
8 U+0038 DIGIT EIGHT
9 U+0039 DIGIT NINE
© U+00A9 COPYRIGHT SIGN
® U+00AE REGISTERED SIGN
‼ U+203C DOUBLE EXCLAMATION MARK
⁉ U+2049 EXCLAMATION QUESTION MARK
™ U+2122 TRADE MARK SIGN
ℹ U+2139 INFORMATION SOURCE
↔ U+2194 LEFT RIGHT ARROW
↕ U+2195 UP DOWN ARROW
↖ U+2196 NORTH WEST ARROW
↗ U+2197 NORTH EAST ARROW
↘ U+2198 SOUTH EAST ARROW
↙ U+2199 SOUTH WEST ARROW
↩ U+21A9 LEFTWARDS ARROW WITH HOOK
↪ U+21AA RIGHTWARDS ARROW WITH HOOK
⌚ U+231A WATCH
⌛ U+231B HOURGLASS
⌨ U+2328 KEYBOARD
⏏ U+23CF EJECT SYMBOL
$ uniprops 🧑
U+1F9D1 ‹🧑› \N{ADULT}
\pS \p{So}
All Any Assigned Common Zyyy EBase Emoji_Modifier_Base Emoji Emoji_Presentation EPres Extended_Pictographic ExtPict
So S Gr_Base Grapheme_Base Graph X_POSIX_Graph GrBase Other_Symbol Print X_POSIX_Print Symbol
Sup_Symbols_And_Pictographs Supplemental_Symbols_And_Pictographs InSupSymbolsAndPictographs Unicode
$ uniprops U+1F9D1
U+1F9D1 ‹🧑› \N{ADULT}
\pS \p{So}
All Any Assigned Common Zyyy EBase Emoji_Modifier_Base Emoji Emoji_Presentation EPres Extended_Pictographic ExtPict
So S Gr_Base Grapheme_Base Graph X_POSIX_Graph GrBase Other_Symbol Print X_POSIX_Print Symbol
Sup_Symbols_And_Pictographs Supplemental_Symbols_And_Pictographs InSupSymbolsAndPictographs Unicode
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: uparse - Parse Unicode strings
by kcott (Archbishop) on Nov 19, 2023 at 11:21 UTC | |
|
Re^2: uparse - Parse Unicode strings (uparse/uchar/unichars/uniprops)
by eyepopslikeamosquito (Archbishop) on Nov 21, 2023 at 00:55 UTC |