What format are the filings in? Plain text, Word documents (which version), PDF, other? How large are the filings? What languages were used to write them? How standardised are they?
See for example the Lingua and Treex namespaces for modules that could help you process natural language.
($q=q:Sq=~/;[c](.)(.)/;chr(-||-|5+lengthSq)`"S|oS2"`map{chr |+ord }map{substrSq`S_+|`|}3E|-|`7**2-3:)=~y+S|`+$1,++print+eval$q,q,a,
In reply to Re: Will it work?
by choroba
in thread Perl Possibilities
by Gideau
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |