perl hashes take at least 1.5x the amount of space as the original data in memory -- so are you saying it makes more sense to you to build a huge hash to operate on in ram than it does to use a real database (on disk) because the database on disk will be too large? dont get it. With proper normilization the database should be smaller in size than the original data.