in reply to Issue with regex matching

There are some things I don't understand about your code:

  1. Why does your regex start with a caret?
  2. Why are there brackets around $constant?
  3. Is there any reason not to use index instead of the regex? That should simplify things further I would have thought.

Perhaps there is more to the spec than originally stated, but the suspicion is that you might be over-complicating it a little.

Replies are listed 'Best First'.
Re^2: Issue with regex matching
by sowais (Sexton) on Aug 29, 2013 at 15:06 UTC

    1. the understanding is $constant will be in the beginning of the row 2. because thats how a variable is compared in regex 3. I have found regex to be much simpler and have more experience with it, so that why.

      2. because thats how a variable is compared in regex

      No, the variable $constant will be interpolated into the regex without the parentheses (round brackets). Consider:

      1:31 >perl -wE "my $constant = 'fred'; my $string = 'fredflintstone'; + print qq[found $1\n] if $string =~ /^$constant/;" Use of uninitialized value $1 in concatenation (.) or string at -e lin +e 1. found 1:32 >perl -wE "my $constant = 'fred'; my $string = 'fredflintstone'; + print qq[found $1\n] if $string =~ /^($constant)/;" found fred 1:32 >

      If the regex matches, any parentheses capture their contents into the special variables $1, $2, etc. But this incurs a performance penalty:

      WARNING: Once Perl sees that you need one of $&, $`, or $' anywhere in the program, it has to provide them for every pattern match. This may substantially slow your program. Perl uses the same mechanism to produce $1, $2, etc, so you also pay a price for each pattern that contains capturing parentheses. (“Capture groups” in perlre#Regular-Expressions)

      Since you’re not using $1, the capturing parentheses aren’t needed.

      Hope that helps,

      Athanasius <°(((><contra mundum Iustus alius egestas vitae, eros Piratica,

        I stand corrected. Thanks!

      Thanks for those clarifications. Athanasius has supplied a clear explanation the issue of the brackets and you've provided the extra information that $constant need only be counted at the start of the line. With that in mind we can use the simple (but potentially very inefficient) method with index thus:

      $constant = '$$'; $line_count =0; while (<FILE>){ $line_count++ unless index ($_, $constant); }