in reply to How to improve the accuracy of Lingua::NamedEntity ?

I have a vague idea of what you are asking for, but instead of second-guessing, I'll offer some friendly advice:

  1. Please read How (Not) To Ask A Question
  2. Please wrap any code in <code> tags when posting.
  3. Please read How (Not) To Ask A Question
  4. Please post a sample of your current code that isn't giving the desired result.
  5. Please read How (Not) To Ask A Question
  6. Please ask a specific question, if you have one (I know what I mean. Why don't you?)
  7. Finally, please read How (Not) To Ask A Question

    Update: Oh, and:

  8. Please post questions in Seekers of Perl Wisdom, not Perl Monks Discussion
  9. Please try to use a meaningful title for your question.
  10. Please have a read of The Perl Monks Guide to the Monastery

Update 2:

Okay, having said all the above - you could simply do something like this:

#!/usr/bin/perl -w use strict; use Lingua::EN::NamedEntity; $/ = undef; my $text = <DATA>; my @entities = extract_entities($text); my @unwanted_entities = qw( Monday Tuesday Wednesday Thursday Friday S +aturday Sunday); for (@entities) { my $entity = ${$_}{entity}; if ( grep { $_ eq $entity } @unwanted_entities ) { print "Skipping unwanted entity: $entity\n"; } else { print "Valid entity: $entity\n" } }

And the data for the above code was taken from a "recent BBC News story". The output is as follows:

Valid entity: Mr Murakami Valid entity: Takafumi Horie Valid entity: Singapore Valid entity: Mr Horie Valid entity: Societe General Asset Management Valid entity: Asset Management Skipping unwanted entity: Friday Valid entity: Tokyo Stock Exchange Valid entity: Livedoor Valid entity: Yoshiaki Murakami Valid entity: Murakami Valid entity: Tokyo Valid entity: International Trade and Industry Ministry Valid entity: Akio Yoshino

Cheers,
Darren :)