I was thinking about something along exactly these lines, so we may just be two talking @$$'$
What'd be interesting is trying to look for "contextual words", so does May refer to the month or the daughter, London is a place, or Jack London. It would be impossible to predict all of these ambiguities, so the "training" makes a lot of sense to me.
Of course, you will never achieve 100% accuracy but I don't think you want to.
Absolutely correct - we don't depend on this data, it just adds value when we can extract it.
thanks for the input
Clint
In reply to Re^2: Extracting structured data from unstructured text - just how difficult would this be?
by clinton
in thread Extracting structured data from unstructured text - just how difficult would this be?
by clinton
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |