The code shown in Re: Removing duplicate subtrees from XML works very well with the data shown in this posting. But with my data (as below)
<species name = "hs"> <sequence id = "1" title = "chromosome 1"> <genome_feature type = "CDS"> <gene id = "01"> <gene_seq number = "1"></gene_seq> </gene> </genome_feature> </sequence> </species> <species name = "hs"> <sequence id = "1" title = "chromosome 1"> <genome_feature type = "CDS"> <gene id = "17"> <gene_seq number = "2"></gene_seq> </gene> </genome_feature> </sequence> </species>
bash-2.05$ perl perl_monks_F.pl sorted doc generation NOK: expected: <species name = "hs"> <sequence id = "1" title = "chromosome 1"> <genome_feature type = "CDS"> <gene id = "17"> <gene_seq number = "2"></gene_seq> </gene> </genome_feature> </sequence> </species> found: <species name="hs"> <sequence id="1" title="chromosome 1"> <genome_feature type="CDS"> <gene id="01"> <gene_seq number="1"></gene_seq> </gene> </genome_feature> </sequence> </species> Use of uninitialized value in substitution (s///) at perl_monks_F.pl l +ine 143, <DATA> chunk 2. Use of uninitialized value in substitution (s///) at perl_monks_F.pl l +ine 144, <DATA> chunk 2. Use of uninitialized value in substitution (s///) at perl_monks_F.pl l +ine 145, <DATA> chunk 2. Use of uninitialized value in string eq at perl_monks_F.pl line 50, <D +ATA> chunk 2. Use of uninitialized value in concatenation (.) or string at perl_monk +s_F.pl line 56, <DATA> chunk 2. merged doc generation NOK: expected: found: <species name="hs"> <sequence id="1" title="chromosome 1"> <genome_feature type="CDS"> <gene id="01"> <gene_seq number="1"></gene_seq> </gene> </genome_feature> </sequence> </species> bash-2.05$
Alterations that I made to the provided script were the addition of a few ErrorContext => 1 statements. And I have tested these slight alterations with the data used in the top posting link.
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |