Re: Base64 and byte arrays

Strings are internally stored "C" byte arrays. You can even do some traditionally numerical operations on them.

use MIME::Base64 qw( decode_base64 );

my $scrambled = decode_base64($_);
my $orig      = substr($scrambled, 0, -1) ^ substr($scrambled, 1);
[download]

But if you prefer to deal with an array of numbers, no problem! ord converts allows a character to be treated as a number, and chr does the inverse.

use MIME::Base64 qw( decode_base64 );

my $scrambled = decode_base64($_);
my @scrambled = map ord, split //, $scrambled;
my @orig      = map { $scrambled[$_-1] ^ $scrambled[$_] } 1..$#scrambl
+ed;
my $orig      = join '', map chr, @orig;
[download]

Comment on Re: Base64 and byte arrays Select or Download Code

Replies are listed 'Best First'.
Re^2: Base64 and byte arrays by joec_ (Scribe) on Feb 05, 2009 at 00:50 UTC
thank you thank you thank you! your first example did precisely what i needed!	[reply]
Re^2: Base64 and byte arrays by jwkrahn (Abbot) on Feb 05, 2009 at 02:52 UTC
No need to split and then rejoin the string: `use MIME::Base64 qw( decode_base64 ); ( my $orig = decode_base64( $_ ) ) =~ s/(.)(?=(.))/ $1 ^ $2 /seg;` [download]	[reply] [d/l]
Re^3: Base64 and byte arrays by ikegami (Patriarch) on Feb 05, 2009 at 03:18 UTC
If you're going to use string xor, why do it a character at a time? The point of the second snippet was to show how to get an array of bytes for future reference. And your solution doesn't work. It appends a junk character.	[reply]
Re^4: Base64 and byte arrays by jwkrahn (Abbot) on Feb 05, 2009 at 04:38 UTC
If you're going to use string xor, why do it a character at a time? Why indeed? `use MIME::Base64 qw( decode_base64 ); my $orig = decode_base64( $_ ); $orig ^= substr $orig, 1;` [download] And your solution doesn't work. It appends a junk character. joec_'s explaination didn't say what to do with the last byte so I left it in.	[reply] [d/l]
Re^5: Base64 and byte arrays by ikegami (Patriarch) on Feb 05, 2009 at 05:01 UTC
Re^2: Base64 and byte arrays by gone2015 (Deacon) on Feb 05, 2009 at 16:36 UTC
Or, so that they don't feel left out, `pack` and `unpack`: `use MIME::Base64 qw( decode_base64 ); my $scrambled = decode_base64($_); my @scrambled = unpack('C', $scrambled) ; my @orig = map { $scrambled[$_-1] ^ $scrambled[$_] } 1..$#scram +bled; my $orig = pack('C', @orig) ;` [download] In this case the magic byte-wise string xor (`^`) is clearly the best bet. But in general I'd use `pack`/`unpack` to do string <-> byte array -- and, of course, so much more. However, watch out for utf8 strings... `unpack('C', ..)` will unpack a utf8 string as characters, so may return values > 0xFF -- so: `my $b = "123\xC4" ; my $u = "a\xC4\x{107}\x{1C4}" ; show(unpack('C', $b)) ; # 0x31, 0x32, 0x33, 0xC4 show(unpack('C', $u)) ; # 0x61, 0xC4, 0x107, 0x1C4 { use bytes ; show(unpack('C', $b)) ; # 0x31, 0x32, 0x33, 0xC4 show(unpack('C', $u)) ; # 0x61, 0xC3, 0x84, 0xC4, 0x87, 0xC7, 0x +84 } ; sub show { print join(", ", map(sprintf("0x%02X", $_), @_)), "\n" ; } ;` [download] as shown, `use bytes` causes the `unpack` to operate on the bytes of encoded form of the utf8 string. Note that with utf8 strings, `pack('C', ...` and `unpack('C', ...` are not symmetrical, observe: `my $u = "a\xC4\x{107}\x{1C4}" ; my @u = unpack('C', $u) ; show(@u) ; # 0x61, 0xC4, 0x107, 0x1C4 bytes(pack('C',@u)) ; # Character in 'C' format wrapped in pac +k ... # Character in 'C' format wrapped in pac +k ... # 61:C4:07:C4 -- byte sub show { print join(", ", map(sprintf("0x%02X", $_), @_)), "\n" ; } ; sub bytes { my ($s) = @_ ; my $w = utf8::is_utf8($s) ? "utf8" : "byte" ; use bytes ; print join(":", map(sprintf("%02X", $_), unpack('C', $s))), " -- +$w\n" ; } ;` [download] but this is wandering off topic into a significantly murky area, of possibly marginal utility...	[reply] [d/l] [select]