in reply to Re: Contextual/categorical Histogram
in thread Contextual/categorical Histogram

Many thanks for the help. I'm looking to do a rolling contextual histogram as data arrives. This is like a poor man's data classifier.

For comparing 2 histograms, are there fast (there are Gb of lines) methods for doing intersect? xor? join?

Replies are listed 'Best First'.
Re^3: Contextual/categorical Histogram
by Athanasius (Archbishop) on Aug 04, 2014 at 04:22 UTC

    Performing an intersection, xor (symmetric difference), or join (union) operation on two histograms is fairly straightforward:

    However, it is doubtful that this approach will scale to accommodate hashes containing gigabytes of data. For that scenario, you should probably be looking to use a database.

    Hope that helps,

    Athanasius <°(((><contra mundum Iustus alius egestas vitae, eros Piratica,