reading file

saranperl has asked for the wisdom of the Perl Monks concerning the following question:

Replies are listed 'Best First'.
Re: reading file by moritz (Cardinal) on Aug 04, 2009 at 07:31 UTC
i tried by "open file ">file name" while(file){}" but it take too much time That's understandable, `while (file) { ... }` is an infinite loop that doesn't read anything. I suggest using `while (<file>) { ... }` instead.	[reply] [d/l] [select]
A reply falls below the community's threshold of quality. You may see it by logging in.
Re: reading file by dsheroh (Monsignor) on Aug 04, 2009 at 09:07 UTC
If you profile your program you will probably find (after removing the infinite loop that moritz pointed out) that it's spending most of its time waiting for data to be read from or written to the hard drive. File I/O takes a certain amount of time (and is relatively slow) and there's not really anything you can do about it in Perl. In some cases, operating system-level tuning may be able to help with it some, but even that rarely does much, since this is primarily a hardware limitation. To illustrate this slowness: `$ time cat [a random 700M file] > /dev/null real 0m46.649s user 0m0.020s sys 0m2.278s` [download] For a 1G file, we could reasonably expect the time required to be on the order of 67 seconds. And that's just to read the contents of the file and throw them away without doing any processing. For comparison, try copying your existing file to a new file using your operating system's normal file-copy methods. Since you're doing a copy, plus also doing some other work in the process, then you can reasonably assume that there's no way you can possibly do it faster than a plain copy. If you're not just encountering an I/O issue, then anything which can be done in your code to speed things up would be inside of the `while` loop, which you haven't shown us.	[reply] [d/l] [select]
A reply falls below the community's threshold of quality. You may see it by logging in.
Re: reading file by roboticus (Chancellor) on Aug 04, 2009 at 12:49 UTC
saranperl: You don't even need to use perl for that. If you're on a *NIX box, you could use the following command: `sort -u original_file >new_file` [download] But since you're here, you might want to know how to do it in perl. So you could do it in roughly the same way the sort command does it: Read the file into in array Sort the array, so all duplicates will be next to each other Scan through the array and remove adjacent duplicates Write the array to the new output file Give it a try and let us know if and where you get stuck! ...roboticus	[reply] [d/l]
Re: reading file by vinoth.ree (Monsignor) on Aug 04, 2009 at 07:34 UTC
Read have to open a file	[reply]
Re: reading file by vinoth.ree (Monsignor) on Aug 04, 2009 at 07:03 UTC
What is your RAM size?	[reply]
Re^2: reading file by saranperl (Initiate) on Aug 04, 2009 at 07:08 UTC
2 GB	[reply]
Re^3: reading file by vinoth.ree (Monsignor) on Aug 04, 2009 at 07:18 UTC
Whenever you update your post please update us. You update your post right ?	[reply]
A reply falls below the community's threshold of quality. You may see it by logging in.
A reply falls below the community's threshold of quality. You may see it by logging in.