An example of the two lines of interest in the intput text is shown below:
<gene id = "251" label = "gene_of_interest">
<gene_seq id = "321" status = "Sanger source DNA code" CDS_number
+= "1" number_of_CDSs = "" sequence_source "/data/databases/flatfiles/
+sequences/species/genome/embl/ch1.embl" startpos = "2435591" endpos =
+ "2436562" startopen = "1" endopen = "1" complement = "F"/>
If I could ask an XML question in relation to this. Does any XML convention suggest that the two ids here should have the same value? | [reply] [d/l] |
If your data is valid XML, run, don't walk, to the XML namespace. Something there will save you a bazillion headaches and make this job easy.
Yes, you haven't yet reached a closing </gene> tag so the gene_seq data is part of gene's data. If there is no closing tag, you don't have valid XML.
After Compline, Zaxo
| [reply] |