Hi,
so, some sample lines could be:
STN PF07660 597 I5EU07 - 125
+9 1e-140 467.1 11.7 1 2 5.9e-140 3.4e-137 455.5 7.3
+ 9 595 238 822 232 824 0.97 -
STN PF07660 597 I5EU07 - 125
+9 1e-140 467.1 11.7 2 2 3.9e-05 0.023 10.2 0.0 1
+48 227 864 949 842 962 0.80 -
and, in that case, we would need to store the first line, because the $iEvalue for the first line is 3.4e-137, which is lower (thus better) than the second line's $iEvalue which is 0.023.
So, the only thing I need is somehow, if I have the same $protein_ac (in this example I5EU07) and more than one lines of the same $pfam_ac_hit (in this example PF07660), I keep only the one with the lowest $iEvalue. | [reply] [d/l] [select] |