I want a perl script that retreives only the gi id's i.e 159040991, 159041230, and so on..in a straight line from fasta files (I pasted below the file as an example) I have more than 250 files to work with. can anyone help me with the perl script.
Thanks>gi|159040991|ref|YP_001540243.1| glycoside hydrolase family protein [ +Caldivirga maquilingensis IC-167] >gi|157919826|gb|ABW01253.1| glycos +ide hydrolase family 1 [Caldivirga maquilingensis IC-167] MDISFPKSFRFGWSQAGFQSEMGTPGSEDPNTDWYVWVHDPENIASGLVSGDLPEHGPGYWGLYRMFHDN +AVKMGLDIAR INVEWSRIFPKPMPDPPQGNVEVKGNDVLAVHVDENDLKRLDEAANQEAVRHYREIFSDLKARGIHFILN +FYHWPLPLWV HDPIRVRKGDLSGPTGWLDVKTVINFARFAAYTAWKFDDLADEYSTMNEPNVVHSNGYMWVKSGFPPSYL +NFELSRRVMV NLIQAHARAYDAVKAISKKPIGIIYANSSFTPLTDKDAKAVELAEYDSRWIFFDAIIKGELMGVTRDDLK +GRLDWIGVNY YSRTVVKLIGEKSYVSIPGYGYGCERNSISPDGRPCSDFGWEFYPEGLYDVIMKYWSRYHLPIYVTENGI +ADAADYQRPY YLVSHIYQVYRAIQEGANVKGYLHWSLTDNYEWASGFSMRFGLLQVDYSTKKQYWRPSAYVYREIAKSKA +IPEELMHLNT IPPTRSLRR >gi|159041230|ref|YP_001540482.1| glycoside hydrolase family protein [ +Caldivirga maquilingensis IC-167] >gi|157920065|gb|ABW01492.1| glycos +ide hydrolase family 1 [Caldivirga maquilingensis IC-167] MIKFPSDFRFGFSTVGTQHEMGTPGSEFVSDWYVWLHDPENIASGLVSGDLPEHGPGYWDLYKQDHSIAR +DLGLDAAWIT IEWARVFPKPTFDVKVKVDEDDGGNVVDVEVNESALEELRRLADLNAVNHYRGILSDWKERGGLLVINLY +HWAMPTWLHD PIAVRKNGPDRAPSGWLDKRSVIEFTKFAAFIAHELGDLADMWYTMNEPGVVITEGYLYVKSGFPPGYLD +LNSLATAGKH LIEAHARAYDAIKAYSRKPVGLVYSFADYQPLRQGDEEAVKEAKGLDYSFFDAPIKGELMGVTRDDLKGR +LDWIGVNYYT RAVLRRRQDAGRASVAVVDGFGYSCEPGGVSNDRRPCSDFGWEIYPEGVYNVLMDLWRRYRMPMYITENG +IADEHDKWRS WFIVSHLYQIHRAMEEGVDVRGYFHWNLIDNLEWAAGYRMRFGLVYVDYATKRRYFRPSALVMREVAKQK +AIPDYLEHYI KPPRIE
In reply to perl script by sainath
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |