in reply to statistics of a large text
Does your current code work for your 1GB input file? How long does it take?