I second the earlier comments by
BrowserUK about
the inherent volatility of the data, and
would add just this: if you have a compiled "find" utility
available on your system (e.g. GNU find), try using that
in a "system()" call, and benchmark it
against whatever File::Find-based module approach you
want. If your results come out looking like
mine did,
you could save yourself a lot of runtime by avoiding the
module.