Remove Duplicates from an Array of Arrays

Dru has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Remove Duplicates from an Array of Arrays by davorg (Chancellor) on Jul 14, 2004 at 15:22 UTC
`my %seen; for (0 .. $#AoA) { print "$AoA[$_][0] -> $AoA[$_][1]\n" unless $seen{$AoA[$_][0]}{$AoA[$_][1]}++; }` [download] (although that doesn't remove the duplicates, it just stops you from printing them) -- <http://www.dave.org.uk> "The first rule of Perl club is you do not talk about Perl club." -- Chip Salzenberg	[reply] [d/l]
Re: Remove Duplicates from an Array of Arrays by ccn (Vicar) on Jul 14, 2004 at 15:27 UTC
`my %H = map {("$_->[0]!$_->[1]" => 1)} @AoA; @AoA = map{[split /!/]} keys %H;` [download]	[reply] [d/l]
Re: Remove Duplicates from an Array of Arrays by dragonchild (Archbishop) on Jul 14, 2004 at 15:31 UTC
When removing duplicates, the standard way is to use a hash. The key of the hash is what has to be unique. A little manipulation shoudl get you what you need. ------ We are the carpenters and bricklayers of the Information Age. Then there are Damian modules.... sigh* ... that's not about being less-lazy -- that's about being on some really good drugs -- you know, there is no spoon.* - flyingmoose I shouldn't have to say this, but any code, unless otherwise stated, is untested	[reply]
Re: Remove Duplicates from an Array of Arrays by rjbs (Pilgrim) on Jul 14, 2004 at 15:35 UTC
Consider mapping? `my %seen_values; for my $aref (@array_of_arrayrefs) { $aref = [ map { $seen_values{$_}++ ? () : $_ } @$aref ]; }` [download] For every arrayref in the list, transform the arrayref's array as follows: if the value has been seen before, ignore it. Otherwise, include it. Either way, make sure you note that you've seen it. NOTE! This removes any value that's seen more than once. I think, though, that you want to remove entire pairs that are seen more than once. Your use of arrows makes it seem like you care about pairs. After all, it's like =>, used in marking hash pairs. So, if you indeed meant duplicated arrayrefs, not duplicated deep values... `my %seen_values; my @new_AofA = map { $seen_values{$_->[0]}{$_->[1]}++ ? () : $_ } @Aof +A;` [download] It's the same thing, but we're using a two-level hash. After all, it's a lot like what you were thinking, isn't it? If this isn't clear, I can elaborate on how it works. I know using $_ makes some people glaze over... rjbs	[reply] [d/l] [select]
Re^2: Remove Duplicates from an Array of Arrays by Roy Johnson (Monsignor) on Jul 14, 2004 at 16:43 UTC
Doesn't it seem like grep is a more natural choice? `my %seen_values; my @new_AofA = grep { !($seen_values{$_->[0]}{$_->[1]}++) } @AofA;` [download] We're not really tightening our belts, it just feels that way because we're getting fatter.	[reply] [d/l]
Re^3: Remove Duplicates from an Array of Arrays by rjbs (Pilgrim) on Jul 14, 2004 at 17:01 UTC
Yes, you're quite right. I think I got map in my head for some reason and ran with it! Anyway, fortunately both will work, and the use of %seen is the thing Dru probably needs. Roy's grep example helps make that clearer. rjbs	[reply]
Re^3: Remove Duplicates from an Array of Arrays by ysth (Canon) on Jul 14, 2004 at 17:33 UTC
Given the data, using old-style multidimensional hashing would work also: `my @new_AofA = grep { !$seen{$_->[0],$_->[1]}++ } @AofA;` [download]	[reply] [d/l]
Re: Remove Duplicates from an Array of Arrays by NetWallah (Canon) on Jul 14, 2004 at 20:33 UTC
This looks like a "netstat" or NAT table. If so, a simple hash may be a better structure to hold the information. You can transform the AOA into a hash while eliminating duplicates using this simple code: `my @AoA=([11,1],[22,2],[11,1],[44,4]); my %h; $h{$_->[0]} = $_->[1] for @AoA; print qq($_ -> $h{$_}\n) for sort keys %h --Output--- 11 -> 1 22 -> 2 44 -> 4` [download] Earth first! (We'll rob the other planets later)	[reply] [d/l]
Re: Remove Duplicates from an Array of Arrays by Dru (Hermit) on Jul 14, 2004 at 18:24 UTC
Monks, Thanks, lots of good solutions here. I'll play with a few and pick my favorite.	[reply]