If you only need a rough estimation of the split files, you can use the input file size and tell to figure out where in the input file you are and when to split. This should save you from needing to read through the file twice.

Another option is to open up all output files at the start and just cycle through writing one input line to each output file in turn. This should also save you from needing to read through the file twice.

Finally, you mention @ARGV which has me a little confused. If the list of words is coming in @ARGV, I believe most OS have a limit to how many ARGS you can pass to a process, which is generally pretty low. I'm assuming your changing from passing in words via @ARGV to using a file.

Fendaria

In reply to Re: splitting up data... by Fendaria
in thread splitting up data... by chinamox

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.