I have a file that has a large chunk of identical text occuring multiple times, and I'd like to remove it. I've written the following, which I think would work if not for the fact that it sucks up 1.2GB of RAM in about 6 minutes and then crashes (winXP)
#!/usr/bin/perl
use strict;
my $a;
my $content;
open E, "<C:/auburn_courses_replace.txt" or die "Unable to open file $
+!";
undef $/;
$a = <E>;
$/ = "\n";
close E;
open F, "<C:/auburn_courses.txt" or die "Unable to open file $!";
undef $/;
$content = <F>;
$/ = "\n";
close F;
open G, ">>C:/auburn_courses2.txt" or die "Unable to open file $!";
#print $content;
#print $a;
$content =~ s/$a//g;
print G $content;
close G;
Anyone know how to do what I'm trying to do and succeed? :)
Update! All fixed, ignore me
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.