From your descripton and examples, you might be able to simply assume that the last space delimited "word" is the title? In which case something like

my ($lastname, $firstnames, $title) = $name =~ /^\s*([^,]+),\s*(.+)\s+ +(\S+)\s*$/;

might work for you. (That's untested but should give the general idea).

If that fails, post a few examples of those that it fails on and someone will probably be able to improve it for you.

Update: I did a little testing and that regex seems to work provided that the title consists of a single "word" where word is defined as a string on non-space chars, and that is the last non-space string on the line. If you need to seperate out any middle initials, or if there are some without the title present, you'll need to match that word against a list (hash) of possible titles and decide whether it is part of the first names or initial or the title on that basis. If David Robert Smith uses his initials and omits his title, you have a problem:)


Examine what is said, not who speaks.

The 7th Rule of perl club is -- pearl clubs are easily damaged. Use a diamond club instead.


In reply to Re: Matching Doctors (MD, DR, M.D., DO etc) help by BrowserUk
in thread Matching Doctors (MD, DR, M.D., DO etc) help by efreed

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.