Parsing out first names

Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.

Re: Parsing out first names
by prasadbabu (Prior) on Oct 13, 2006 at 05:23 UTC

I think Lingua::EN::NameParse also might help you.

Prasad

Re: Parsing out first names
by ysth (Canon) on Oct 13, 2006 at 04:44 UTC

/(.*) [a-zA-Z]\.?\z/
[download]

my ($f_name) = $name =~ m/...

And I think you mean to have (), not [], in the assignment to @names.

[reply]
[d/l]
[select]

Re: Parsing out first names
by grep (Monsignor) on Oct 13, 2006 at 04:44 UTC

split

popping

Also watch your brackets. You want () for an array.

my @names = ( "Mark K.", "Bob H", "Kurt", "Mary Kay K",
"Mary Jo Z.",  "Mary Jo" );

foreach ( @names ) {

  my ($f_name) = (split)[0];
  my ($l_name) = (split)[-1];
  print "$f_name \n";

}
[download]

grep

One dead unjugged rabbit fish later

[reply]
[d/l]
[select]

Re^2: Parsing out first names

by driver8 (Scribe) on Oct 13, 2006 at 07:17 UTC

Mark 
Bob 
Kurt 
Mary 
Mary 
Mary
[download]

[reply]
[d/l]

Re^3: Parsing out first names

by Hofmator (Curate) on Oct 13, 2006 at 07:42 UTC

grep

foreach ( @names ) {
  my @name_parts = split;
  pop @name_parts if $name_parts[-1] =~ /^[a-zA-Z]\.?$/; # throw last 
+name away
  my $f_name = join ' ', @name_parts;
  print "$f_name\n";
}
[download]

Updated code to reflect Not_a_Number's correction, thanks!

-- Hofmator

Code written by Hofmator and posted on PerlMonks is public domain. It is provided as is with no warranties, express or implied, of any kind. Posted code may not have been tested. Use of posted code is at your own risk.

[reply]
[d/l]

Re^4: Parsing out first names

by Not_a_Number (Prior) on Oct 13, 2006 at 09:19 UTC

Re^4: Parsing out first names

by driver8 (Scribe) on Oct 13, 2006 at 12:03 UTC

Re: Parsing out first names
by McDarren (Abbot) on Oct 13, 2006 at 04:42 UTC

"..The initial is always 1 letter with our without a period..."

Will it always be uppercase? (lets assume yes)

So how about this (untested)...

# Non-greedy capture of everything from the start of the string, until
+ we see
# A whitespace character
# followed by a single upper-case letter
# followed by an (optional) period
# followed by the end of the string
/^(.*?)\s[A-Z]\.?$/;
[download]

Cheers,
Darren :)

[reply]
[d/l]

Re: Parsing out first names
by awohld (Hermit) on Oct 13, 2006 at 04:54 UTC

#!/usr/bin/perl -w

use strict;
use Data::Dumper;

my @names = ( "Mark K.", "Bob H", "Kurt", "Mary Kay K",
"Mary Jo Z.",  "Mary Jo" );

foreach my $name ( @names ) {

  $name =~ s/ [a-zA-Z].?$//;
  print "$name \n";

}
[download]

Mark
Bob
Kurt
Mary Kay
Mary Jo
Mary
[download]

[reply]
[d/l]
[select]

Re^2: Parsing out first names

by driver8 (Scribe) on Oct 13, 2006 at 07:10 UTC

$name =~ s/ [a-zA-Z]\.?$//

[reply]
[d/l]

Re: Parsing out first names
by johngg (Canon) on Oct 13, 2006 at 14:15 UTC

(?:\s[A-Z]\.?)?\z

use strict;
use warnings;

my @names = (
   q{Mark K.},
   q{Bob H},
   q{Kurt},
   q{Mary Kay K},
   q{Mary Jo Z.},
   q{Mary Jo});

print
   qq{Original    Substituted Matched\n},
   qq{--------    ----------- -------\n};
foreach my $name (@names)
{
    (my $firstNameBySubs = $name) =~ s{(?:\s[A-Z]\.?)?\z}{};
    my ($firstNameByMatch) = $name =~ m{^(.*?)(?:\s[A-Z]\.?)?\z};
    printf qq{%-12s%-12s%-s\n}
       , $name
       , $firstNameBySubs
       , $firstNameByMatch;
}
[download]

This produces

Original    Substituted Matched
--------    ----------- -------
Mark K.     Mark        Mark
Bob H       Bob         Bob
Kurt        Kurt        Kurt
Mary Kay K  Mary Kay    Mary Kay
Mary Jo Z.  Mary Jo     Mary Jo
Mary Jo     Mary Jo     Mary Jo
[download]

Cheers,

JohnGG

[reply]
[d/l]
[select]

Re: Parsing out first names
by swampyankee (Parson) on Oct 13, 2006 at 17:49 UTC

Like grep. I'd use split, splitting on whitespace.

This will give you a list (which may have one element). Since your list doesn't contain surnames, I'd filter the list grep to eliminate initials, except when their elimination leaves nothing (I know people who use initials in lieu of their given names).

I've got a sample here:

#!perl

use strict;

use warnings;

my @names = (    'J Q', 'G Gordon', 'Mary Jane', 'Tommy K', 'Madonna',
        'George W.', 'Jacques-Yves');

foreach my $name (@names){

    my @nl = split(/\s+/, $name);

    my @list = grep { ! /^[A-Z][^A-Z]*$/i} @nl;

    pop(@nl) if (@list and $#nl > 0 and $nl[-1] =~ /^[A-Za-z][^A-Za-z]
+*$/);

    $name = join(' ', @nl);

    }

print join("\n", @names) . "\n";
[download]

J Q
G Gordon
Mary Jane
Tommy
Madonna
George
Jacques-Yves
[download]

I know it could be both better written and much shorter, but it's intended to be a demonstration, not production code.

emc

At that time [1909] the chief engineer was almost always the chief test pilot as well. That had the fortunate result of eliminating poor engineering early in aviation.

—Igor Sikorsky, reported in AOPA Pilot magazine February 2003.

[reply]
[d/l]
[select]