The memory footprint may not grow very fast, but it will most probably grow because the %seen hash is very likely to get larger with a bigger file (unless the data input has really many duplicates when the file grows larger).
Correction accepted; I was thinking only in terms of reading in the file, since in the OP, post-reading data storage seemed to be moot. But you are quite right.
The way forward always starts with a minimal test.