in reply to Regexp problems

I do not have a solution for you ... just some possible work arounds.

If you are storing gene sequences in an XML-like markup, why not just use XML? Now you have the power of XML::Parser, XML::Twig, etc. at your disposal. You can also use DTD to enforce rules.

Those regexes look like they will be a nightmare to maintain ... i would look for another solution. Besides, do attributes really have to be in a certain order with exactly one space inbetween? Sounds very fragile.

UPDATE:
Here is some code for you to play with, just install XML::Simple first. Notice how easy it is for me to get the individual attributes from the XML tag:

use strict; use warnings; use XML::Simple; my $data = do {local $/;<DATA>}; my $xml = XMLin($data); print $xml->{label}, "\n"; print $xml->{id}, "\n"; __DATA__ <?xml version="1.0" ?> <gene id = "3" label = "gene_of_interest" />

jeffa

L-LL-L--L-LL-L--L-LL-L--
-R--R-RR-R--R-RR-R--R-RR
B--B--B--B--B--B--B--B--
H---H---H---H---H---H---
(the triplet paradiddle with high-hat)