Let's start with a simple case, n = 1 and the charcter a. We want to match from the beginning of the string as many non-a characters as possible, and we know that we must only stop if we encounter an a :

$foo =~ /^([^a]*)a/;

Now let's look at an example of how we could do this for everything the second a. We can't use .* because we would then lose count. We can use .*?, but it won't help much. We will try to match as many non-a characters as possible before the first a, the first a and then again as many non-a characters, and then there must be the second a :

$foo =~ /^([^a]*a[^a]*)a/;

For three as, the RE will look like this :

$foo =~ /^([^a]*a[^a]*a[^a]*)a/;

and if we now look closely, we see a pattern [^a]*a which we can reuse with the Perl RE engine, as we must repeat that pattern n-1 times :

$m = $n -1; $foo =~ /^(([^a]*a){$m}[^a]*)a/;

Of course, as this pattern has to be recompiled every time we use it, we could as well use the above, unlooped pattern to match.

Update: 20020409 : Fixed small but important typo in the last line of code.$foo =~ /^(([^a]*a){m}[^a]*)a/; obviously won't match $m times...

perl -MHTTP::Daemon -MHTTP::Response -MLWP::Simple -e ' ; # The $d = new HTTP::Daemon and fork and getprint $d->url and exit;#spider ($c = $d->accept())->get_request(); $c->send_response( new #in the HTTP::Response(200,$_,$_,qq(Just another Perl hacker\n))); ' # web

In reply to Re: match n EMth/EM occurence by Corion
in thread match n EMth/EM occurence by arindamm

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.