in reply to Re^10: n-dimensional statistical analysis of DNA sequences (or text, or ...)
in thread n-dimensional statistical analysis of DNA sequences (or text, or ...)

Now in github you will find the data and states dirs as well.

It dramatically increases the size of the distro, but you're still less than a meg:

$ ls A* -l -rw-rw-r-- 1 bob bob 938559 Jul 6 17:44 'Algorithm-Markov-Multiorder- +Learner-master(1).zip' -rw-rw-r-- 1 bob bob 28855 Jun 28 14:11 Algorithm-Markov-Multiorder- +Learner-master.zip $
If you install it (make install) then it should be in a dir which is in your path. Then just call it analyse_text.pl

This, I did not realize...

$ pwd /home/bob/Documents/meditations/Algorithm-Markov-Multiorder-Learner-ma +ster/data $ analyse_text.pl --input-corpus 2.short.shelley.txt --ngram-length 8 + --output-state 2.short.state >8.txt $

and bingo:

$ cat 8.txt { "counts" => { "air|we|breathe|They|have|acquired|new|and" + => 1, "already|marked|I|will|pioneer|a|new|way" + => 1,