comment on

Hmm, does this represent everything the program needs to do? Because if, so, I note that you only ever use the first field of each line - you can load the first field of all the lines in both files into memory, and avoid the painfully slow re-reading; something like (untested):

  my $hold = readfile('holds');
  my $copy = readfile('copies');
  for my $key (keys %$hold) {
    if ($copy{$key}) {
      print "$key: hold and copy (or copy and hold)\n";
    }
  }

  sub readfile {
    my $file = shift;
    my $hash = {};
    open(my $fh, "<$file") or die "$file: $!";
    local $_;
    while (<$fh>) {
      # fields are '|' delimited - pick up the first field
      my $key = substr $_, 0, index($_, "|");
      ++$hash->{$key};
    }
    close $fh;
    return $hash;
  }
[download]

Even if this is only the starting point, and the real code needs to access all the fields, you could for example cache in memory the first field and the offset into the file for each row, and then use seek() to locate the complete record whenever you need it.

Hugo

In reply to Re: Re: Re: Re: many to many join on text files by hv
in thread many to many join on text files by aquarium

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.