in reply to Re: Text Extraction
in thread Text Extraction

the file is a container of tiff images - each image location is defined by AZII (image border).. so what i need to do is to extract multiple images from one file..
AZII
..
..(image 1)
..
..
AZII
..
..(image 2)
..
..
AZII - > etc
..

Speed is not important if the extraction process is not tooooo slow :)
Thanks

Replies are listed 'Best First'.
Re^3: Text Extraction
by mwah (Hermit) on Sep 25, 2007 at 18:36 UTC
    If you can define a "record separator" between the images,
    then this should be easy. Let Perl find the images, say only
    where the images are delimited
    my $fn = 'file.dat'; $/ = "AZI\n"; open my $fh, '<', $fn or die "cant stand smell of $!"; my @images = <$fh>; close $fh; my $num = 1110; for my $img (@images) { open $fh, '>', ++$num .'.tiff' or die "can't dump image! $!"; print $fh $img }
    The $/ sets the "image separator", please check which characters
    are *exactly* in the file, line separators any? Is this Unix/Linux?
    On Win, for example, you have to make sure to open the files in binmode mode ...

    Another variant would be not to save the records in an array (which is unnecessary).
    Like:
    ... while( my $img = <$fh> ) { # read one record # [update $fh => $ih] open my $ih, '>', ++$num .'.tiff' or die "can't dump image! $!"; print $ih $img } ...
    I'll leave that one to your own exercise ...

    Regards
    mwa

    (updated to correct stupid copy/paste error in second code block).
      Thanks alot for the tips - I'll take over from here :)