You mentioned that your module will only return true/false and the calling program will decide what to do. However, it may help to look at the bigger picture - like what the calling program actually is going to do.

This one of the problems in dealing with natural language - so much depends on context. While certain words are only used in a vulgar context, there are also a lot of gray words besides "cock". You can even use normal words in a vulgar context, for example "up yours" can offend, so now you have to match phrases, not just words.

If the positives are sent to a moderator, you could be strict without too much problem. If they're deleted, you may want to be more lenient. Also, knowing what the calling program does, you may be able to provide better solutions, such as obfuscating gray area words ie: cocks = c****, or providing a true/false/maybe response.

I don't mean to throw you off track, but sometimes its too easy to get caught up in the details and miss other possible solutions to the overall problem.


In reply to Re: Regex solution needed by hangon
in thread Regex solution needed by spivey3587

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.