I'd suggest that you pick from two broad choices. WordNet::SenseRelate or the less complicated approach using an exception list(freely available, just Google). The learning curve on the first is steepish and the results of the second are an '80%' solution but easy to put in place. A particularly good source of information is the journal 'Computers and the Humanities' (which may have stopped or changed names, your friendly reference librarian will know) found in most college libraries. It is almost entirely devoted to textual analysis of which a concordance is almost a beginners tool.
Second from chess software this is my favorite subject---I've always wanted to generate one for the Lord of the Rings (and related) just for grins. It was also a good way to learn IBM 360 assembler ;)!
--hsm
"Never try to teach a pig to sing...it wastes your time and it annoys the pig."
-
Are you posting in the right place? Check out Where do I post X? to know for sure.
-
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big>
<blockquote> <br /> <dd>
<dl> <dt> <em> <font>
<h1> <h2> <h3> <h4>
<h5> <h6> <hr /> <i>
<li> <nbsp> <ol> <p>
<small> <strike> <strong>
<sub> <sup> <table>
<td> <th> <tr> <tt>
<u> <ul>
-
Snippets of code should be wrapped in
<code> tags not
<pre> tags. In fact, <pre>
tags should generally be avoided. If they must
be used, extreme care should be
taken to ensure that their contents do not
have long lines (<70 chars), in order to prevent
horizontal scrolling (and possible janitor
intervention).
-
Want more info? How to link
or How to display code and escape characters
are good places to start.
|