Hi monks,
i have alot of DNA strings with only characters of A, C, G or T. This string is now 32 bits, while I only need two bits for all the characters;
A = 00
C = 01
G = 10
T = 11
How can i convert, for example, 'AGTCACA' to a more compact string with less bits?
Basically, I want to store this string to a hash and be able to compare them, and see if substrings are availabe. If so, it needs to be printed.
Cheers,
Marten