in reply to Unicode::UTF8 and perl Unicode compatibility

Problem that I don't really understand how it would be compatible with perl own unicode implementation

Don't worry about it

What will happen if some Unicode 6 characters recognized by this module, but then misinterpreted by perl?

This wouldn't happen. Once octets/bytes are decoded into characters, they're characters (codepoints)

possible insignificant edge case (including but not limited to security issues), for all perl 5.8.8+. But I am not sure where to start.

If I were you I wouldn't even start :) Why ? Because starting is starting to sound more and more like reinventing-Unicode::UTF8, or tracking-perl-bugs-since-2006-six-decades-ago

I would stick with 5.18.x

  • Comment on Re: Unicode::UTF8 and perl Unicode compatibility

Replies are listed 'Best First'.
Re^2: Unicode::UTF8 and perl Unicode compatibility
by vsespb (Chaplain) on Aug 31, 2013 at 15:31 UTC
    This wouldn't happen. Once octets/bytes are decoded into characters, they're characters (codepoints)
    So, you think, once it decoded to characters, it will work perfectly without breaking anything?

    What about utf8::valid(). Will it pass? Will it affect anything?

    What if I try encoding back to bytes with Encode::encode("UTF-8" .. ?
    I would stick with 5.18.x
    No, I specified in OP that I need compatibility with any version starting from perl5.8.8.

      :)

      So, you think, once it decoded to characters, it will work perfectly without breaking anything?

      Yes, in so much as once Unicode::UTF8 does its thing its done, your perl takes over (with all that entails)

      What about utf8::valid(). Will it pass? Will it affect anything?

      I think it will "pass" and will not "affect anything", but I don't see how it matters -- by using Unicode::UTF8 you're saying the hell with Encode.pm / utf8.pm , i'll let Unicode::UTF8 take care of everything , so there should be no reason to consult utf8 or Encode

      What if I try encoding back to bytes with Encode::encode("UTF-8" .. ?

      That ought to work fine as well (call me optimistic)

      No, I specified in OP that I need compatibility with any version starting from perl5.8.8.

      Yes, I've read this, I understand, and its why I didn't make jokes :) food for thought: Re: Why upgrade perl?, Re: perldeltas - every perl*delta in one file (pod.lst)

        Ok, thanks.