Count occurrences and rename words in order

Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: Count occurrences and rename words in order by davorg (Chancellor) on Sep 18, 2002 at 15:38 UTC
`#!/usr/bin/perl -w use strict; $_ = 'doc123/print doc456/read doc789/print doc145/print doc123/read'; my %counts; s\|(/\w+)\|$1 . ++$counts{$1}\|ge; print;` [download] Seems to give the right output for the sample you gave. -- <http://www.dave.org.uk> "The first rule of Perl club is you do not talk about Perl club." -- Chip Salzenberg	[reply] [d/l]
Re^2: Count occurrences and rename words in order by flounder99 (Friar) on Sep 18, 2002 at 16:28 UTC
If the data is in a file (say `logfile`) a one-liner could be used. `$ perl -pe 's\|(/\w+)\|$1 . ++$counts{$1}\|ge' logfile>countedlogfile $ cat logfile doc123/print doc456/read doc789/print doc145/print doc123/read $ cat countedlogfile doc123/print1 doc456/read1 doc789/print2 doc145/print3 doc123/read2` [download] -- flounder	[reply] [d/l] [select]
Re: Re^2: Count occurrences and rename words in order by davorg (Chancellor) on Sep 18, 2002 at 16:33 UTC
Yeah, that works. It's unclear what the original poster wanted to do for multiple lines. I assumed that the counts should be cleared. In which case you'd do something like this: `$ perl -pe '%counts=();s\|(/\w+)\|$1 . ++$counts{$1}\|ge' logfile>counted +logfile` [download] -- <http://www.dave.org.uk> "The first rule of Perl club is you do not talk about Perl club." -- Chip Salzenberg	[reply] [d/l]
Re: Re: Count occurrences and rename words in order by Anonymous Monk on Sep 18, 2002 at 16:46 UTC
Thanks Monk It works for the first line of the text file, but the events on the rest of the file continue the numeration from the first line, and the event from each line should be independent. But I will it use to count the events in a whole document!!:)	[reply]
Re: Re: Re: Count occurrences and rename words in order by davorg (Chancellor) on Sep 18, 2002 at 16:50 UTC
You just need to reset `%count` to be empty before processing each line. `while (<INPUT>) { my %counts; s\|(/\w+)\|$1 . ++$counts{$1}\|ge; print OUTPUT; }` [download] -- <http://www.dave.org.uk> "The first rule of Perl club is you do not talk about Perl club." -- Chip Salzenberg	[reply] [d/l]
Re: Count occurrences and rename words in order by BrowserUk (Patriarch) on Sep 18, 2002 at 15:39 UTC
Something like this? `#! perl -sw my $data ='doc123/print doc456/read doc789/print doc145/print doc123/r +ead '; my @events = qw(print read); for (@events) { my $count = 1; $data =~ s/($_)/$1.$count++/ge; } print $data; __END__ # Output C:\test>198864 doc123/print1 doc456/read1 doc789/print2 doc145/print3 doc123/read2 C:\test>` [download] Cor! Like yer ring! ... HALO dammit! ... 'Ave it yer way! Hal-lo, Mister la-de-da. ... Like yer ring!	[reply] [d/l]
Re: Re: Count occurrences and rename words in order by Anonymous Monk on Sep 18, 2002 at 16:49 UTC
It was something like that but I have more than those two events, but it works!! Thank you very much	[reply]
Re: Count occurrences and rename words in order by Molt (Chaplain) on Sep 18, 2002 at 15:40 UTC
Not quite sure what this is needed for, but the following seems to meet your requirements well enough. `#!/usr/bin/perl use warnings; use strict; my $test = 'doc123/print doc456/read doc789/print doc145/print doc123/ +read'; my %count; print join '/', grep {s/^([a-z]+)\s/"$1".(++$count{$1}).' '/ie \|\| $_} split '/', $test;` [download]	[reply] [d/l]
Re: Re: Count occurrences and rename words in order by Anonymous Monk on Sep 18, 2002 at 16:41 UTC
Thanks for your help, It works!! Just one question why do you need to check the beginning of the string? I need this for reschedule the events according with document (DOC) parameters.	[reply]
Re: Re: Re: Count occurrences and rename words in order by Molt (Chaplain) on Sep 19, 2002 at 10:12 UTC
After the split on '/' each part is tested individually, since it looks like the command bit has to start at the beginning of the segment- and because I like to make my regexps as specific and safe as possible- I added the ^ anchor there.	[reply]