in reply to regex match in list context

Short note first off:
(?:[A-Za-z0-9_\-,.+]+)?
is the same as
[A-Za-z0-9_\-,.+]*

except the former makes the matching engine work much harder.

Another note: the whole point of using alternate delimiters is to not have to escape everything.. so get rid of some of that backslashed eye sore. :)

Anyway, let's get to the point I wanted to make: you might have a look at \G which forces a match to start where the last one left off. For the very first match, it is equivalent to \A.

my $mm = qq("000.E+3","","","","QCA-086_2","-1","P","FALSE"); my @p = ( $mm =~ m[\G ("[A-Za-z0-9_\-,.+]*") (?: , | \z ) ]g ); print map {qq($_\n)} @p; 1;
This ensures that if any part of the line does not match your specification exactly, the matching engine will fail and abort, instead of trying to find something that looks like your specification somewhere further down the string. Coupled with some other check (f.ex, if you have a fixed number of fields, check if you matched that number of fields), it can be used to make sure that your script will die with a loud moan on invalid input, rather than go ahead silently to produce garbage output from garbage input.

Makeshifts last the longest.

Replies are listed 'Best First'.
Re: Re: regex match in list context
by Anonymous Monk on Oct 02, 2003 at 20:52 UTC
    shorter note:
    [A-Za-z0-9_\-,.+]*
    is the same as
    [\w,.+-]*
    except the former makes my brain work much harder.
      Be aware that the latter is affected by user's locale settings and/or Unicode encoding of strings, the former is not and always means the same thing.

      Makeshifts last the longest.