Using Text::CSV would probably be better. However, this regex approach, while more verbose, is perhaps more maintainable. Needs 5.10+ Special Backtracking Control Verbs. (I've made a guess at the proper regex for a A423.23 thingy.)
>perl -wMstrict -le
"my $s = 'The fox did it[ at 12.23 ] well, Cf. 23 A423.23. The '
. 'swallow was even better';
print qq{''$s''};
;;
my $parens = qr{ \[ [^]]* \] }xms;
my $cf = qr{ (?i) cf \. }xms;
my $ref = qr{ [[:alpha:]]+ \d+ (?: \. \d+)+ }xms;
my $splitter =
qr{ (?: $parens | $cf | $ref) (*SKIP)(*FAIL)
|
\.
}xms;
;;
my @ra = split $splitter, $s;
print qq{'$_'} for @ra;
"
''The fox did it[ at 12.23 ] well, Cf. 23 A423.23. The swallow was eve
+n better''
'The fox did it[ at 12.23 ] well, Cf. 23 A423.23'
' The swallow was even better'
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.
|