I recently received some puzzled looks over my answer to this question. My regex was:
($wanted) = $line =~ /(\S*)/;
Two people questioned it explicitly; one of them said I was missing the ^ anchor.

This is perhaps the trickiest lesson to learn about regexes, partly because it's so simple and in its simplicity it is incorrectly interpreted. The * modifier matches 0 or more. It will always succeed, and it will always try to take as much as it can and still have the rest of the regex succeed. Furthermore, it will match as early in the string as possible.

In "this is cool", the regex matches "this". In "  is this cool?" it matches nothing, but successfully. Why doesn't it match "is"? Because the \S* matches at the very beginning of the string, where there are (quite plainly) zero non-whitespace characters. The "classic" example of this trickery is:

$_ = "fred xxxxxxx barney"; s/x*//; print; # fred xxxxxxx barney

_____________________________________________________
Jeff japhy Pinyan: Perl, regex, and perl hacker.
s++=END;++y(;-P)}y js++=;shajsj<++y(p-q)}?print:??;


In reply to Shouldn't \S* be ^\S* by japhy

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.