in reply to Re^4: regexing for non-standard characters...
in thread regexing for non-standard characters...

It took me a while to get around to it, but I have updated unichist -- count/summarize characters in data so that it uses the version of "Blocks.txt" that comes with Perl, so the next time someone asks "what version of Unicode does the tool use", it will be correct to say "the same version used by Perl (whatever Perl version you happen to be using)".

I checked on the "Scripts.txt" file, but I didn't see a good enough reason for incorporating it in addition to "Blocks.txt" -- the latter is sufficient for what "unichist" was meant to do.

  • Comment on Re^5: regexing for non-standard characters...