in reply to Algorithm inspiration required.
the best method I know of is using a suffix array (and longest common prefix array) with external memory. Here is a link to a research paper and the website. According to the paper it can handle a text size of 80 GiB using only 4 GiB of RAM. Since endless streams may contain infinite information, this algorithm cannot handle that. It only can work on a snapshot of the stream limited by memory (internal and external).
I would be surprised if that could be beaten by a different algorithm.
hexcoder
|
---|
Replies are listed 'Best First'. | |
---|---|
Re^2: Algorithm inspiration required.
by BrowserUk (Patriarch) on Jun 19, 2018 at 14:50 UTC | |
by hexcoder (Curate) on Jun 19, 2018 at 18:32 UTC | |
by BrowserUk (Patriarch) on Jun 19, 2018 at 19:32 UTC |