Regex: plucking numbers from a large string

Baz has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.

Re: Regex: plucking numbers from a large string
by RMGir (Prior) on May 01, 2002 at 18:15 UTC

my @counts; # since there are only 10 possible values, all
            # digits, why use a hash?
while($largeStr=~/Tel: 06(\d)/g) 
{
    $counts[$1]++
}
[download]

After the loop is done, the @counts array should have the answers you're looking for.
--
Mike

Edit: Wow, foreach doesn't work, but while does. Wierd...

[reply]
[d/l]

Re: Regex: plucking numbers from a large string
by broquaint (Abbot) on May 01, 2002 at 18:29 UTC

~~You could always use the funky regex eval features~~

~~my @exts; $largeStr =~ /Tel: 06(\d+)(?{push @exts, $1})/g;~~

Or the ever handy \G zero-width assertion

my @exts;
push @exts, $1 while $largeStr =~ /\GTel: 06(\d+)/g;

# or better yet
my %exts;
$exts{$1}++ while $largeStr =~ /\GTel: 06(\d+)/g;
[download]

%exts

print qq[found "$_" $exts{$_} times],$/ for sort keys %exts;
[download]

_________ broquaint

update: removed first suggestion as it doesn't seem to work as I expected :-/

[reply]
[d/l]
[select]

Re: Re: Regex: plucking numbers from a large string

by Juerd (Abbot) on May 01, 2002 at 19:13 UTC

Or the ever handy \G zero-width assertion

Which is great if your data is "Tel: 061Tel: 062Tel: 063", 'cause you'd have to use something to match stuff in between, and there's probably a better solution to this than using .*.

You can use m//g in list context, and get a list of matches (or a list of captures if you use them):

my @extensions = $large_string =~ /Tel: 06(\d+)/g;

my %extension;
$extension{$_}++ for @extensions;
[download]

- Yes, I reinvent wheels.
- Spam: Visit eurotraQ.

[reply]
[d/l]

Watch $1 vs. $_ in for loops...

by RMGir (Prior) on May 01, 2002 at 20:12 UTC

If you don't need the list, you can of course use the match itself as for's expression.
I thought so, too :(
$ perl -e'$x="1 2 3 4 5 5 5 5 5 5"; $counts[$1]++ for $x=~/(\d)/g; pri +nt "$_ $c ounts[$_]\n" foreach (0..$#counts)' 0 1 2 3 4 5 10
[download]
How's that for a wierd problem?
Even stranger, if you s/for/while/:
$ perl -e'$x="1 2 3 4 5 5 5 5 5 5"; $counts[$1]++ while $x=~/(\d)/g; p +rint "$_ $counts[$_]\n" foreach (0..$#counts)' 0 1 1 2 1 3 1 4 1 5 6
[download]
This is with 5.6.1.

Ignore me; it makes sense that $1 would be the last value with a for loop. $_ works fine.

$ perl -e'$x="1 2 3 4 5 5 5 5 5 5"; $counts[$_]++ for $x=~/(\d)/g; pri
+nt "$_ $counts[$_]\n" foreach (0..$#counts)'
0
1 1
2 1
3 1
4 1
5 6
[download]

$ perl -e'$x="1 2 3 4 5 5 5 5 5 5"; $counts[$1]++ while $x=~/(\d)/g; p
+rint "$_ $counts[$_]\n" foreach (0..$#counts)'
0
1 1
2 1
3 1
4 1
5 6
[download]

[reply]
[d/l]
[select]

Re: Regex: plucking numbers from a large string
by abaxaba (Hermit) on May 01, 2002 at 18:27 UTC

(@extensions) = $largeStr =~ /Tel:\s(06\d)/g;
[download]

ÅßÅ×ÅßÅ
"It is a very mixed blessing to be brought back from the dead." -- Kurt Vonnegut

[reply]
[d/l]

Re: Regex: plucking numbers from a large string
by mephit (Scribe) on May 01, 2002 at 21:37 UTC

my %hash;
hash{$_}++ foreach ($largeStr =~ /Tel: (06\d+)/g);
[download]

[reply]
[d/l]