> but is there a "better" way to do it?
you could start with two input fields instead trying to parse one.
Do you know the famous author "Orson Scott Card"?
Whenever I stumble over him, my neural parser is puzzling again, to which name category the "Scott" belongs.
(I've looked it up already, every time...)
And names can get much harder than this ... see also Names_of_Sun_Yat-sen
> The obvious problem is that it fails with extended characters such as Zoë.
Well, the obvious issue is encoding
Could be your HTML/HTTP settings or your script, or both.
Unicode rulez...
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.