a. File has multiple close relatives for a gene of interest. I only need one close relative in the output file. b. Some lines have a repeat of same sequence. For example: Metac1_3189(Metac1_3189) I want to exclude these from output file b. File has parentheses which I want to exclude from output file c. Close relative and gene of interest are not separated into two distinct columns. I want to include these in output file.