Beefy Boxes and Bandwidth Generously Provided by pair Networks
P is for Practical

Re^4: Length and Chomp ??

by afoken (Chancellor)
on Aug 23, 2009 at 19:22 UTC ( #790684=note: print w/replies, xml ) Need Help??

in reply to Re^3: Length and Chomp ??
in thread Length and Chomp ??

Some bean counting:

With a Unicode argument, length returns the number of characters in the argument. Unicode has the (no so) new / unusual / odd property that a character may be represented by more than one byte.

With a non-Unicode / pre-Unicode / legacy encoding argument, length still returns the number of characters in the argument. Those legacy encodings have the old / usual / familiar property that a character is represented by exactly one byte.

So, there is no need to remember any special cases. length always returns the character count.

Before Unicode support was added to Perl, there was no need to distinguish between byte and character, because both were equal. And as long as you don't work with Unicode, they still are. The quote from perlfunc, "if the EXPR is in Unicode, you will get the number of characters, not the number of bytes", is a hint that bytes and characters are different things when you work with Unicode, nothing more, nothing less.


Today I will gladly share my knowledge and experience, for there are no sweeter words than "I told you so". ;-)

Log In?

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://790684]
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others surveying the Monastery: (3)
As of 2023-12-08 16:34 GMT
Find Nodes?
    Voting Booth?
    What's your preferred 'use VERSION' for new CPAN modules in 2023?

    Results (36 votes). Check out past polls.