anasuya has asked for the wisdom of the Perl Monks concerning the following question:
Hi. I have a file which looks like this. It has two fields which are separated by a '+' sign.
d_145_1_2- + c_3_1_8-e_74_1_1- a_100_1_6-c_2_1_6- + b_50_1_2- c_69_1_17- + b_61_6_1- c_2_1_2- + a_123_1_1- d_83_1_1- + c_2_1_5-d_162_1_1- c_2_1_2- + a_123_1_1- a_123_1_1- + c_2_1_2-
What I need to do is to extract out lines which are unique in this file. For example here,from the snippet of the file above, the following lines are unique:
d_145_1_2- + c_3_1_8-e_74_1_1- a_100_1_6-c_2_1_6- + b_50_1_2- c_69_1_17- + b_61_6_1- c_2_1_2- + a_123_1_1- d_83_1_1- + c_2_1_5-d_162_1_1-
One shall notice that the fields a_123_1_1- and c_2_1_2- occur as a pair more than once, however in such a way that their relative order is reversed. Is there anyway I can extract out unique lines, keeping only one occurrence of such pairs i.e. a_123_1_1- and c_2_1_2-? I have as of now tried awk. There, I was unable to retrieve unique lines using the uniq function as that doesn't take care of the same combinations of fields repeating in reverse orders. Also I tried merging the two fields together and then carrying out awk operations but to no avail. Is there any way such that perl makes the job easier?
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: xtracting unique lines
by Happy-the-monk (Canon) on Mar 27, 2012 at 18:22 UTC | |
|
Re: xtracting unique lines
by nemesdani (Friar) on Mar 27, 2012 at 18:19 UTC | |
|
Re: xtracting unique lines
by Cristoforo (Curate) on Mar 28, 2012 at 02:03 UTC | |
by anasuya (Novice) on Mar 28, 2012 at 11:07 UTC | |
|
Re: xtracting unique lines
by johngg (Canon) on Mar 28, 2012 at 07:29 UTC |