in reply to convert ambiguous letters in a DNA sequence to all possible non-ambiguous letters
For strings with upto say a half a dozen or so wildcards, glob does this quite efficiently:
$in = 'ARGBCTC';; $in =~ s[[^ACGT]][{A,C,G,T}]g;; print $in;; A{A,C,G,T}G{A,C,G,T}CTC print for glob $in;; AAGACTC AAGCCTC AAGGCTC AAGTCTC ACGACTC ACGCCTC ACGGCTC ACGTCTC AGGACTC AGGCCTC AGGGCTC AGGTCTC ATGACTC ATGCCTC ATGGCTC ATGTCTC
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: convert ambiguous letters in a DNA sequence to all possible non-ambiguous letters
by 7stud (Deacon) on Feb 16, 2013 at 02:55 UTC | |
|
Re^2: convert ambiguous letters in a DNA sequence to all possible non-ambiguous letters
by lrl1997 (Novice) on Feb 18, 2013 at 23:49 UTC |