comment on

On top of what hippo says, you are reading and parsing the mapping files anew for every line you process, that's an awful lot of wasted work. Similarly, you are splitting the line into @source repeatedly for each mapping record.

As a first step: separate out the reading and parsing of the mapping files into data structures, do that once, then walk through the data structures in the loop over @lines. That might look something like this:

# Mapping des charges directes
my @mapping1 = map {
  my @mapping = split /\t/, $_;
  # account, mapped account, mapped source
  [ $mapping[0], $mapping[2], $mapping[3] ];
} <MAPPINGFILE1>;

# Mapping des charges indirectes
my @mapping2 = map {
  my @mapping = split /\t/, $_;
  # account, mapped account, mapped source
  [ $mapping[0], $mapping[4], $mapping[2] ];
} <MAPPINGFILE2>;


LINE: for my $line (@lines) {
  my $source = (split /\t/, $line)[2];

  # Mapping des charges directes
  for my $mapping1 (@mapping1) {
    my($account, $mapped_account, $mapped_source) = @$mapping1;

    # Account is matching with source
    if ($line =~ /$account/) {
      # Account substitution
      if ($mapping eq "") {
        $line =~ s/$account/"Compte cible non défini !"/;
      } else {
        $line =~ s/$account/$mapped_account/;
      }
      # Mapping = target Unit, Alloc_ + Unit source
      if ($mapped_source eq 'Unit source') {
        # Mapping : source Unit, Alloc_ + source Unit
        $line =~ s/$source/$source\tALLOC_$source/;
      } elsif ($mapped_source eq "") {
        $line =~ s/$source/"Unit cible non définie !"\tALLOC_$source/;
      } else {
        # Unit substitution
        $line =~ s/$source/$mapped_source\tALLOC_$source/;
      }
      push @lines2, $line;
      next LINE;
    }
  }

  # Mapping des charges indirectes
  for my $mapping2 (@mapping2) {
    my($account, $mapped_account, $mapped_source) = @$mapping2;
    if ($line =~ /$account/) {
      $line =~ s/$account/$mapped_account/;
      $line =~ s/$source/$mapped_source\tALLOC_$source/;
      push @lines2, $line;
      next LINE;
    }
  }
  push @rejects, "Lignes non mappées (Account): \t$line";
}
[download]

However I suspect that even bigger savings are possible: for example, if the account name appears in a specific column in $line, you could probably turn the whole thing into a single hash lookup.

In reply to Re: Optimization tips by hv
in thread Optimization tips by sroux

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.