comment on

First, let us do something a bit easier. We'll use the character offset instead of the line number as key to the hash:

my %hash;
my $re= qr{
     ( /\* .*? \*/) | ( \/\/[^\n]*)           
     | " (?: [^"\\]* | \\. )* "               
     | ' (?: [^'\\]* | \\. )* '               
     | . [^/"']*                              
   }xs;

while(  /$re/g  ) {
    $hash{pos($_)}= $1;
}
[download]

Then there are several ways to convert character offsets into line numbers. If none of your patterns spanned lines, then I'd probably update the regex to match newlines separately so I could increment a line number count in the same loop. But /* */ can span lines so I think I'd instead do a merge-sort-ish thing similar to:

my @nl;
while(  /\n/g  ) {
    push @nl, pos($_);
}
my $ln= 1;
while(  /$re/g  ) {
    $ln++   while  $nl[$ln-1] < pos($_);
    $hash{$ln}= $1;
}
[download]

Except I think there is probably at least one off-by-one error in that code. For example, pos($_) might need to be replaced with something from @- or @+ in one or both of those places.

I hope it gives you an idea where to start to get what you are looking for.

- tye

In reply to Re: Regex "(un)Knowledge" (loop) by tye
in thread Regex "(un)Knowledge" by nofernandes

Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!

Titles consisting of a single word are discouraged, and in most cases are disallowed outright.

Read Where should I post X? if you're not absolutely sure you're posting in the right place.

Please read these before you post! —

Posts may use any of the Perl Monks Approved HTML tags:

a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr

You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)

	For:		Use:
	&		`&`
	<		`<`
	>		`>`
	[		`[`
	]		`]`

Link using PerlMonks shortcuts! What shortcuts can I use for linking?

See Writeup Formatting Tips and other pages linked from there for more info.