in reply to length() miscounting UTF8 characters?

Would be kind of handy to have a few of those words you are reading in to test against... :-)

...the majority is always wrong, and always the last to know about it...
Insanity: Doing the same thing over and over again and expecting different results...
  • Comment on Re: length() miscounting UTF8 characters?

Replies are listed 'Best First'.
Re^2: length() miscounting UTF8 characters?
by AppleFritter (Vicar) on Apr 27, 2014 at 21:21 UTC

    Certainly! Here's an excerpt:

    æ æð æða æðaber æðahnútur æðakölkun æðardúnn æðarfugl æðarkolla æðarkóngur æðarvarp æði æðimargur æðisgenginn æðiskast æðislegur æðrast æðri æðrulaus æðruleysi æðruorð æðrutónn æðstur æður æfa

    Which produces the following output:

    2 æ 4 æð 4 æð 4 æð 6 æðú 6 æðö 6 æðú 4 æð 4 æð 6 æðó 4 æð 4 æð 4 æð 4 æð 4 æð 4 æð 4 æð 4 æð 4 æð 4 æð 6 æðð 6 æðó 4 æð 4 æð 2 æ