Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:
where 'ATG' and 'TAA' are somewhere within it but now in the start and end positions, as they should.GTGGCAAATGCAGAACGTTTTCTGCGTGTTGCCGATATTCTGGAAAGCAATGCCAGGCAAGGGCAGGTAG +CGACCGTACTTTCCGCCCCCGCGAAAATTACCAACCATCTGGTGGCGATGATTGAAAAAACTATCGGCG +GTCAGGATGCTTTGCCGAATATCAGCGATGCCGAACGTATTTTTTCTGACCTGCTCGCAGGACTTGCCA +GCGCGCAGCCGGGATTCCCGCTTGCACGGTTGAAAATGGTTGTCGAACAAGAATTCGCTCAGATCAAAC +ATGTTCTGCATGGTATCAGCCTGCTGGGTCAGTGCCCGGATAGCATCAACGCCGCGCTGATTTGCCGTG +GCGAAAAAATGTCGATCGCGATTATGGCGGGACTCCTGGAGGCGCGTGGACATCGCGTCACGGTGATTG +ATCCGGTAGAAAAATTGCTGGCGGTGGGCCATTACCTTGAATCTACCGTCGATATCGCGGAATCGACTC +GCCGTATCGCCGCCAGCCAGATCCCAGCCGATCACATGATCCTGATGGCGGGCTTTACCGCCGGTAATG +AAAAGGGTGAACTGGTGGTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCGCCTGTT +TACGCGCTGACTGCTGTGAAATCTGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGTCAGGTGC +CGGACGCCAGGCTGCTGAAATCGATGTCCTACCAGGAAGCGATGGAACTCTCTTACTTCGGCGCCAAAG +TCCTTCACCCTCGCACCATTACGCCCATCGCCCAGTTCCAGATCCCCTGTCTGATTAAAAATACCGGTA +ATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGACGATGATAACCTACCAGTTAAAGGGATCT +CTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGAAAGGGATGATTGGGATGGCGG +CGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACCCAGTCCTCCTCTGAGT +ACAGCATCAGTTTCTGTGTGCCGCAGAGTGACTGCGCGCGTGCCCGCCGTGCGATGCAGGATGAGTTCT +ATCTGGAGCTGAAAGAGGGGCTGCTGGAGCCGCTGGCGGTTACGGAGCGGTTGGCGATTATCTCTGTTG +TCGGCGACGGTATGCGCACGCTACGCGGCATTTCAGCGAAATTCTTCGCCGCGCTGGCGCGGGCCAATA +TCAATATCGTGGCGATCGCTCAGGGATCTTCTGAGCGTTCCATTTCTGTGGTGGTGAATAACGACGATG +CCACCACCGGCGTGCGGGTAACGCACCAGATGCTGTTCAATACCGATCAGGTGATTGAAGTGTTTGTCA +TTGGCGTCGGCGGCGTCGGCGGCGCGCTACTGGAACAGCTTAAACGTCAGCAAACCTGGTTGAAGAACA +AGCACATCGATCTACGCGTGTGCGGCGTGGCGAACTCAAAGGCGTTGCTAACCAATGTGCATGGCCTGA +ATCTGGACAACTGGCAGGCGGAACTGGCGCAAGCGAACGCGCCGTTCAATCTGGGACGCTTAATTCGCC +TGGTGAAAGAATATCATCTACTCAATCCGGTGATTGTTGATTGCACCTCCAGTCAGGCGGTGGCCGACC +AGTATGCTGACTTCCTGCGTGAAGGATTCCATGTGGTGACGCCAAACAAGAAAGCGAACACCTCGTCGA +TGGACTACTACCATCAGCTACGTTTCGCCGCCGCGCAATCACGGCGCAAATTCTTGTATGACACCAACG +TCGGCGCCGGTTTGCCGGTAATCGAAAACCTGCAAAACCTGCTGAATGCGGGTGATGAACTGCAAAAAT +TTTCCGGCATTCTTTCCGGGTCGCTCTCTTTTATTTTCGGTAAACTGGAAGAGGGGATGAGTCTCTCAC +AGGCGACCGCCCTGGCGCGCGAGATGGGCTATACCGAACCCGATCCGCGCGACGATCTTTCCGGTATGG +ATGTGGCGCGGAAACTGTTGATCCTCGCCCGCGAGACGGGCCGCGAGCTGGAGCTTTCCGATATTGTGA +TTGAACCGGTGTTGCCGGACGAGTTTGACGCCTCCGGCGATGTGACCACCTTTATGGCGCATCTGCCGC +AGCTTGACGACGCGTTTGCCGCCCGTGTGGCGAAAGCTCGTGATGAAGGTAAGGTATTGCGCTATGTGG +GCAATATCGAAGAGGATGGCGTGTGCCGCGTGAAGATTGCCGAAGTTGATGGTAACGATCCGCTCTTCA +AAGTGAAAAACGGTTAAGAAAACGCGCTGGCGTTCTACAGCCACTATTATCAGCCCTTGCCGTTGGTGC +TGCGCGGCTACGGCGCAGGCAATGATGTGACGGCGGCGGGCGTGTTTGCCGATCTGTTACGGACCCTCT +CATGGAAGTTAGGAGTT
$seq='GTGGCAAATGCAGAACGTTTTCTGCGTGTTGCCGATATTCTGGAAAGCAATGCCAGGCAAGGGC +AGGTAGCGACCGTACTTTCCGCCCCCGCGAAAATTACCAACCATCTGGTGGCGATGATTGAAAAAACTA +TCGGCGGTCAGGATGCTTTGCCGAATATCAGCGATGCCGAACGTATTTTTTCTGACCTGCTCGCAGGAC +TTGCCAGCGCGCAGCCGGGATTCCCGCTTGCACGGTTGAAAATGGTTGTCGAACAAGAATTCGCTCAGA +TCAAACATGTTCTGCATGGTATCAGCCTGCTGGGTCAGTGCCCGGATAGCATCAACGCCGCGCTGATTT +GCCGTGGCGAAAAAATGTCGATCGCGATTATGGCGGGACTCCTGGAGGCGCGTGGACATCGCGTCACGG +TGATTGATCCGGTAGAAAAATTGCTGGCGGTGGGCCATTACCTTGAATCTACCGTCGATATCGCGGAAT +CGACTCGCCGTATCGCCGCCAGCCAGATCCCAGCCGATCACATGATCCTGATGGCGGGCTTTACCGCCG +GTAATGAAAAGGGTGAACTGGTGGTGCTGGGCCGTAATGGTTCCGACTATTCCGCCGCCGTGCTGGCCG +CCTGTTTACGCGCTGACTGCTGTGAAATCTGGACTGACGTCGATGGCGTGTATACCTGTGACCCGCGTC +AGGTGCCGGACGCCAGGCTGCTGAAATCGATGTCCTACCAGGAAGCGATGGAACTCTCTTACTTCGGCG +CCAAAGTCCTTCACCCTCGCACCATTACGCCCATCGCCCAGTTCCAGATCCCCTGTCTGATTAAAAATA +CCGGTAATCCGCAGGCGCCAGGAACGCTGATCGGCGCGTCCAGCGACGATGATAACCTACCAGTTAAAG +GGATCTCTAACCTTAACAACATGGCGATGTTTAGCGTCTCCGGCCCGGGAATGAAAGGGATGATTGGGA +TGGCGGCGCGTGTTTTCGCCGCCATGTCTCGCGCCGGGATCTCGGTGGTGCTCATTACCCAGTCCTCCT +CTGAGTACAGCATCAGTTTCTGTGTGCCGCAGAGTGACTGCGCGCGTGCCCGCCGTGCGATGCAGGATG +AGTTCTATCTGGAGCTGAAAGAGGGGCTGCTGGAGCCGCTGGCGGTTACGGAGCGGTTGGCGATTATCT +CTGTTGTCGGCGACGGTATGCGCACGCTACGCGGCATTTCAGCGAAATTCTTCGCCGCGCTGGCGCGGG +CCAATATCAATATCGTGGCGATCGCTCAGGGATCTTCTGAGCGTTCCATTTCTGTGGTGGTGAATAACG +ACGATGCCACCACCGGCGTGCGGGTAACGCACCAGATGCTGTTCAATACCGATCAGGTGATTGAAGTGT +TTGTCATTGGCGTCGGCGGCGTCGGCGGCGCGCTACTGGAACAGCTTAAACGTCAGCAAACCTGGTTGA +AGAACAAGCACATCGATCTACGCGTGTGCGGCGTGGCGAACTCAAAGGCGTTGCTAACCAATGTGCATG +GCCTGAATCTGGACAACTGGCAGGCGGAACTGGCGCAAGCGAACGCGCCGTTCAATCTGGGACGCTTAA +TTCGCCTGGTGAAAGAATATCATCTACTCAATCCGGTGATTGTTGATTGCACCTCCAGTCAGGCGGTGG +CCGACCAGTATGCTGACTTCCTGCGTGAAGGATTCCATGTGGTGACGCCAAACAAGAAAGCGAACACCT +CGTCGATGGACTACTACCATCAGCTACGTTTCGCCGCCGCGCAATCACGGCGCAAATTCTTGTATGACA +CCAACGTCGGCGCCGGTTTGCCGGTAATCGAAAACCTGCAAAACCTGCTGAATGCGGGTGATGAACTGC +AAAAATTTTCCGGCATTCTTTCCGGGTCGCTCTCTTTTATTTTCGGTAAACTGGAAGAGGGGATGAGTC +TCTCACAGGCGACCGCCCTGGCGCGCGAGATGGGCTATACCGAACCCGATCCGCGCGACGATCTTTCCG +GTATGGATGTGGCGCGGAAACTGTTGATCCTCGCCCGCGAGACGGGCCGCGAGCTGGAGCTTTCCGATA +TTGTGATTGAACCGGTGTTGCCGGACGAGTTTGACGCCTCCGGCGATGTGACCACCTTTATGGCGCATC +TGCCGCAGCTTGACGACGCGTTTGCCGCCCGTGTGGCGAAAGCTCGTGATGAAGGTAAGGTATTGCGCT +ATGTGGGCAATATCGAAGAGGATGGCGTGTGCCGCGTGAAGATTGCCGAAGTTGATGGTAACGATCCGC +TCTTCAAAGTGAAAAACGGTTAAGAAAACGCGCTGGCGTTCTACAGCCACTATTATCAGCCCTTGCCGT +TGGTGCTGCGCGGCTACGGCGCAGGCAATGATGTGACGGCGGCGGGCGTGTTTGCCGATCTGTTACGGA +CCCTCTCATGGAAGTTAGGAGTT'; if($seq=~/.*(ATG.*)/) {$substring_with_correct_start=$1;}
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: how to find this substring?
by roboticus (Chancellor) on Jun 08, 2012 at 11:53 UTC | |
|
Re: how to find this substring?
by RichardK (Parson) on Jun 08, 2012 at 14:32 UTC | |
by bulk88 (Priest) on Jun 08, 2012 at 17:04 UTC | |
by GrandFather (Saint) on Jun 09, 2012 at 01:53 UTC | |
by bulk88 (Priest) on Jun 09, 2012 at 05:30 UTC | |
by GrandFather (Saint) on Jun 10, 2012 at 05:04 UTC | |
|
Re: how to find this substring?
by ckj (Chaplain) on Jun 08, 2012 at 12:24 UTC | |
by Not_a_Number (Prior) on Jun 08, 2012 at 14:08 UTC | |
by Anonymous Monk on Jun 08, 2012 at 14:15 UTC | |
|
Re: how to find this substring?
by temporal (Pilgrim) on Jun 08, 2012 at 13:24 UTC |