in reply to Re^2: Text Extraction
in thread Text Extraction

If you can define a "record separator" between the images,
then this should be easy. Let Perl find the images, say only
where the images are delimited
my $fn = 'file.dat'; $/ = "AZI\n"; open my $fh, '<', $fn or die "cant stand smell of $!"; my @images = <$fh>; close $fh; my $num = 1110; for my $img (@images) { open $fh, '>', ++$num .'.tiff' or die "can't dump image! $!"; print $fh $img }
The $/ sets the "image separator", please check which characters
are *exactly* in the file, line separators any? Is this Unix/Linux?
On Win, for example, you have to make sure to open the files in binmode mode ...

Another variant would be not to save the records in an array (which is unnecessary).
Like:
... while( my $img = <$fh> ) { # read one record # [update $fh => $ih] open my $ih, '>', ++$num .'.tiff' or die "can't dump image! $!"; print $ih $img } ...
I'll leave that one to your own exercise ...

Regards
mwa

(updated to correct stupid copy/paste error in second code block).

Replies are listed 'Best First'.
Re^4: Text Extraction
by karavay (Beadle) on Sep 25, 2007 at 20:22 UTC
    Thanks alot for the tips - I'll take over from here :)