I've just implemented an experimental regex extension, which allows you
to do things like match balanced parentheses, without having to embed any
code in your regex. The implementation
isn't in perl (yet?), but
this is still a firm Perl topic, and it raises questions about what
features Perl's regexes ought to have.
I wrote a web page
about it, but here's the rough idea. You can treat each parenthesised group
in your regex as if it were a little subroutine, and make calls to it from
within your regex. The power comes from the fact that you can make recursive
calls, so you can write
/^([^()]|\((?1)*\))*$/
to match strings which have balanced parentheses, for example.
My favourite so far is a regex to detect palindromic sentences, ignoring
spacing and punctuation.
/^\W*(?:((.)\W*(?1)\W*\2|)|((.)\W*(?3)\W*\4|\W*.\W*))\W*$/i
If you want to try it out, follow the instructions
here.
Let me know what you think.
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.