Hi all,
I am a PhD student who is using perl to analyse some genome data.
I am trying to write some code to extract a piece of sequence (given start and end co-ordinates ) from a fasta file of one genome.
The fasta file format is:-
>genome1
ACTGTTACTTGTACCTCAGGGTTTTCTCTTTTTTTACGCGCTCAGTCAGTCCCATG
GTGCTGCCTGCATGCGTCAGTCA etc
and then i have a text file which is tab-deliminated and has 3 columns, gene number, gene start and gene finish
eg
1 0 825
2 837 1000
etc
I would like the perl script to output the sequence for each gene to a different file with the gene number as the file name.
Please help, even if its just to suggest where to start, i am feeling a bit lost at the moment. I presume i need to make an array of the gene positions, start and finish? but not sure where to go from there.
Thanks
Gemma