| [reply] |
I posted the test file and the script in the scratchpad. Again thanks for everything Poj. I really appreciate your effort. | [reply] |
# Remove comment line(s)
$para =~ s/^\s*#.*//mg;
# Trim trailing white space
$para =~ s/\s+$//; # add this
..
poj | [reply] [d/l] |
Hi,
That fixed the issue. Thank you so much.
I have a related question. Is there a way to remove all the duplicate entries and/or replace them with a newline character? Duplicates are skewing my data.
Also, there are some entries which have incomplete sequences and have the word "(Fragments)" in the header. Can you suggest a way to delete all those entries with the word (Fragment) in their header?
Thanks again for all your help Poj. I will be presenting this data at a conference in Winnipeg next month. I would be happy to acknowledge your help in this work. Please let me know if you want me to do that.
| [reply] |