mind_frame has asked for the wisdom of the Perl Monks concerning the following question:
Hello Monks, Am a beginner. I have two files, the first (file1) contains several rexeges, while the other(file2) contains FASTA sequences . My intention is to use the regex in file1 to check if they match any Fasta sequences in file2 and print any regexes that match atleast one sequence, with the number of sequences they match.
file1 is structured in such a way that each line has an ID, followed by '>>', then the regex;file2 has the an idenfier on one line and the sequence on the next;e.g FGER_HWW_PRT >> ..DW[ALK]..[^P]..[VI]{2,4} TKAR_GLW_NQW >> [^VKR]{0,2}..FP[D].T.N.Q. etc...
Many thanks.e.g >lac9_B: details details GFVTSDRWPALKMSRWSLEMVWASRGYPLVNDRMWSWSDDDP >serP_A: otherdetails details2 GFVLSDPPPPALKMSRWSLEMVWASRGYPLVNDPWQRTKRKRKDRTCWASNYIHDRP
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: How do i use regexes in one file to match FASTA sequences in another file
by lune (Pilgrim) on Nov 22, 2013 at 14:46 UTC | |
by Kenosis (Priest) on Nov 22, 2013 at 19:30 UTC | |
|
Re: How do i use regexes in one file to match FASTA sequences in another file
by Random_Walk (Prior) on Nov 22, 2013 at 13:36 UTC | |
|
Re: How do i use regexes in one file to match FASTA sequences in another file
by choroba (Cardinal) on Nov 22, 2013 at 13:46 UTC | |
|
Re: How do i use regexes in one file to match FASTA sequences in another file
by Random_Walk (Prior) on Nov 22, 2013 at 13:36 UTC |