Anonymous Monk has asked for the wisdom of the Perl Monks concerning the following question:

Hi Peeps,

I was wondering if you would be able to help me with someting. I have been ripping my hair out over it and i can't find a way to do it. I have a text file that contains a load of crap as well as some e-mail addresses that i want to keep. Below is a section of the file:
","lliu5@yahoo.com","lliu5@yahoo.com","SMTP","support@domain.com","sup +port@do= main.com","SMTP" ","ishmale423@excite.com","ishmale423@excite.com","SMTP","support@doma +in.com"= ,"support@domain.com","SMTP"
Now, the idea is that i want to extract all e-mail addresses apart from ones ending in @domain.com. Each e-mail address is entered twice, so idealy i would like it to do something like: pick first e-mail address => write it to file seperated with return => skip second, third and fourth e-mail address. So in the final file i would end up with:
lliu5@yahoo.com ishmale423@excite.com
Help is VERY much appreciated :)

Replies are listed 'Best First'.
Re: Search And Export - Perl Needed?
by Coruscate (Sexton) on May 11, 2003 at 19:26 UTC

    This is what came to mind for me. You could change the line that states my %addrs = map { $_ => 1 } @addrs to my %addrs; @addrs{@addrs} = (1) x @addrs; to do a hash slice instead. I just felt like a map ;)

    #!/usr/bin/perl -w use strict; my $addrs = do { local $/; <DATA> }; my @addrs = grep { !/domain\.com\z/ } $addrs =~ /"(\w+\@[\w\.]+)"/g; my %addrs = map { $_ => 1 } @addrs; print join("\n", keys %addrs), "\n"; __DATA__ ","lliu5@yahoo.com","lliu5@yahoo.com","SMTP","support@domain.com","sup +port@do= main.com","SMTP" ","ishmale423@excite.com","ishmale423@excite.com","SMTP","support@doma +in.com"= ,"support@domain.com","SMTP"


    If the above content is missing any vital points or you feel that any of the information is misleading, incorrect or irrelevant, please feel free to downvote the post. At the same time, please reply to this node or /msg me to inform me as to what is wrong with the post, so that I may update the node to the best of my ability.

      Cheers for that. Even better, could the addresses be piped straight to sendmail?
Re: Search And Export - Perl Needed?
by Ovid (Cardinal) on May 11, 2003 at 19:35 UTC

    It would help if you showed us what you've already tried, if anything. The basic structure of the program is fairly straightforward, but the actual implementation can vary, depending upon how the input data looks and how you need output. Does your input data really begin with ","lliu5@yahoo.com","? If so, that's a strange beginning to the line. In any event, here's one structure for doing this (pseudo-code):

    foreach line of input:
        split line into tokens
        foreach token in tokens
            if token is valid email and not from "domain.com"
                add token to hash
            end if
        end foreach token
    end foreach line

    A straight-forward implementation would go something like this (untested):

    use Text::CSV_XS; use Email::Valid; my %email; my $csv = Text::CSV_XS->new; while (my $line = <DATA_FILE>) { chomp $line; if (my $status = $csv->parse($line)) { my @columns = $csv->fields; foreach my $column (@columns) { $email{$column} = 1 if good_email($column); } } else { warn "Could not parse ",$csv->error_input,"\n"; } } sub good_email { my $email = shift; if (Email::Valid->address($email) && $email !~ /domain\.com$/i) { return $email; } return; }

    When that's done, your email hash keys have all unique emails. However, the list is unsorted and I my domain check is a bit dodgy, but this does give you an idea of how to proceed.

    Unlike Coruscate's solution, this is fairly procedural, but it's also very easy to read.

    Cheers,
    Ovid

    New address of my CGI Course.
    Silence is Evil (feel free to copy and distribute widely - note copyright text)

      cheers Coruscate that worked. I now have the e-mail addresses in a text file. what would be the best way to get a script to take each one of these addresses one by one and pass it to sendmail? i want to be able to send an e-mail to each one of the addresses in that file.