Automatic categorisation is the panacea of Knowledge Management and it is something that a great many people are working on with a view to making some serious financial gain. The auto-cat software is therfore expensive but the blurb on the vendor's websites may be of interest.

I use search engine software from Verity who implement machine assisted categorisation in a workbench tool such that the output keyword net can be applied to content as it is indexed. This works well in a corporate environment where content doesn't change that much and you just want to locate it in a defined categorisation structure. Verity also have a 'social network' product that allows people to see locate subject matter experts. I haven't worked with this bit yet but the demo looked cool.

I have also looked at Autonomy who popularised Baysian techniques for clustering results. Their search engine works really well for newsfeeds where the clustering is generally unknown and fluid. The search results can appear really random until the internals have caught up with a new cluster of information. I am told that the BBC News website uses this technique to create the 'related stories' links on it's website.


In reply to Re: Re: Re: Machine learning by inman
in thread Machine learning by sri

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.