If I were going to approach the problem, I'd define "Meaningful Content" in terms of it's characteristics (it appears in a <title> or <subtitle>, it uses a particular CSS strophe, it has an XPATH that looks like ___, etc).
Then, just do it.
----
I Go Back to Sleep, Now.
OGB
In reply to Re: Create a dictionary from wikipedia
by Old_Gray_Bear
in thread Create a dictionary from wikipedia
by vit
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |