in reply to Perl Possibilities
What format are the filings in? Plain text, Word documents (which version), PDF, other? How large are the filings? What languages were used to write them? How standardised are they?
See for example the Lingua and Treex namespaces for modules that could help you process natural language.
($q=q:Sq=~/;[c](.)(.)/;chr(-||-|5+lengthSq)`"S|oS2"`map{chr |+ord }map{substrSq`S_+|`|}3E|-|`7**2-3:)=~y+S|`+$1,++print+eval$q,q,a,
|
|---|