I have a vague idea of what you are asking for, but instead of second-guessing, I'll offer some friendly advice:

  1. Please read How (Not) To Ask A Question
  2. Please wrap any code in <code> tags when posting.
  3. Please read How (Not) To Ask A Question
  4. Please post a sample of your current code that isn't giving the desired result.
  5. Please read How (Not) To Ask A Question
  6. Please ask a specific question, if you have one (I know what I mean. Why don't you?)
  7. Finally, please read How (Not) To Ask A Question

    Update: Oh, and:

  8. Please post questions in Seekers of Perl Wisdom, not Perl Monks Discussion
  9. Please try to use a meaningful title for your question.
  10. Please have a read of The Perl Monks Guide to the Monastery

Update 2:

Okay, having said all the above - you could simply do something like this:

#!/usr/bin/perl -w use strict; use Lingua::EN::NamedEntity; $/ = undef; my $text = <DATA>; my @entities = extract_entities($text); my @unwanted_entities = qw( Monday Tuesday Wednesday Thursday Friday S +aturday Sunday); for (@entities) { my $entity = ${$_}{entity}; if ( grep { $_ eq $entity } @unwanted_entities ) { print "Skipping unwanted entity: $entity\n"; } else { print "Valid entity: $entity\n" } }

And the data for the above code was taken from a "recent BBC News story". The output is as follows:

Valid entity: Mr Murakami Valid entity: Takafumi Horie Valid entity: Singapore Valid entity: Mr Horie Valid entity: Societe General Asset Management Valid entity: Asset Management Skipping unwanted entity: Friday Valid entity: Tokyo Stock Exchange Valid entity: Livedoor Valid entity: Yoshiaki Murakami Valid entity: Murakami Valid entity: Tokyo Valid entity: International Trade and Industry Ministry Valid entity: Akio Yoshino

Cheers,
Darren :)


In reply to Re: How to improve the accuracy of Lingua::NamedEntity ? (was: How can i do it ??) by McDarren
in thread How to improve the accuracy of Lingua::NamedEntity ? by Anonymous Monk

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.