I'm given this file with several partial DNA sequences. The first thing I need to do is parse this string into the different sequences at the ">". I want to store these records as variables because in each of the sequences I need to search for a certain pattern. I've been working on the code for the search on the individual sequences, but I can't figure out how to just split these into variables that I can then analyze separately.(And can I keep an identifier on the parsed strings?) Here's the file below. I think I might want to use this? /^>(\w+)\s(. +)$/; Thanks for your help! I know what I want to do but just don't know the language well at all.
http://www.med.nyu.edu/rcr/rcr/course/smutans.fas