in reply to Re: How to match duplicate lines in a text file and extract only one of those lines to a new file
in thread How to match duplicate lines in a text file and extract only one of those lines to a new file

Thank you very much! Though you said it needs improving, I felt like I understood your code!
  • Comment on Re^2: How to match duplicate lines in a text file and extract only one of those lines to a new file

Replies are listed 'Best First'.
Re^3: How to match duplicate lines in a text file and extract only one of those lines to a new file
by perlnewbie012215 (Novice) on Aug 19, 2015 at 06:25 UTC

    Hi Guys, I am new to perl, I have a situation which is very similar to this, where my input rows are given below and I have to find the duplicates on the first column

    green apple green grapes blue blueberries orange pappaya orange orange
    Output: green apple/grapes blue blueberries orange pappaya/orange

    can one of you guys please explain this code... Thanks

      What have you tried so far? Please post it here, inside <code></code> tags.

      The way forward always starts with a minimal test.

        googling it and trying to understand this code from sub print can you explain it

        #!/usr/bin/perl use strict; use warnings; my $input_file = 'HGDP.txt'; my $output_file = 'output.txt'; open my $fh, '<', $input_file or die "Unable to open for read $input_file: $!"; open my $out_fh, '>', $output_file or die "Unable to open for write $output_file: $!"; local $, = q{ }; local $\ = "\n"; my @rows; my $static_i = 3; # number of first unjoinable columns sub print_rows { print {$out_fh} @{$rows[0]}[0 .. $static_i - 1], map { my @columns; foreach my $x (0 .. $#rows) { push @columns, $rows[$x][$_]; } join q{/}, @columns; } $static_i .. $#{$rows[0]}; } while (defined(my $line_1 = <$fh>)) { my ($x) = $line_1 =~ /^(\d+)/ or next; push @rows, [split q{ }, $line_1]; while (defined(my $line_2 = <$fh>)) { next unless $line_2 =~ /^\d/; if ($line_2 =~ /^$x\b/) { push @rows, [split q{ }, $line_2]; } else { print_rows(); @rows = [split q{ }, $line_2]; last; } } } print_rows(); close $fh; close $out_fh;