comment on

I took a look at your thread at that link but don't necessarily follow it all yet. I don't think I want to glob for any of this. Does this look like a fairly robust way to download images and rename them uniformly but with correct filetypes?

 $ perl lh2.pl
img: WWW::Mechanize::Image=HASH(0xa1fdc5c)
https://sites.google.com/site/lutherhavennm/_/rsrc/1255408739014/missi
+on/Attachment3.jpg?height=278&width=420 
ext: jpg?height=278&width=420
ext: jpg
img: WWW::Mechanize::Image=HASH(0xa207784)
https://sites.google.com/site/lutherhavennm/_/rsrc/1255386973661/missi
+on/Picture1.jpg?height=279&width=420 
ext: jpg?height=279&width=420
ext: jpg
img: WWW::Mechanize::Image=HASH(0xa2078ec)
https://sites.google.com/site/lutherhavennm/_/rsrc/1255408642180/missi
+on/Attachment10.jpg?height=280&width=420 
ext: jpg?height=280&width=420
ext: jpg
img: WWW::Mechanize::Image=HASH(0xa2074dc)
https://sites.google.com/site/lutherhavennm/_/rsrc/1255387202014/missi
+on/Looking%20up%20at%20the%20Bldg.JPG?height=315&width=420 
ext: JPG?height=315&width=420
ext: JPG
downloaded 4 images from https://sites.google.com/site/lutherhavennm/m
+ission
to folder site_17
$ cat lh2.pl
#!/usr/bin/perl -w
use strict;
use feature ':5.10';
use WWW::Mechanize;
use LWP::Simple;
use Errno qw[ EEXIST ];

# get information about images
my $domain = 'https://sites.google.com/site/lutherhavennm/mission';
my $m      = WWW::Mechanize->new();
$m->get($domain);
my @list = $m->images();

# create new folder and download images to it.
my $counter = 0;
my $dir     = &mk_new_dir;
for my $img (@list) {
     print "img: $img\n";
    my $url = $img->url_abs();
    print "$url \n";

    my $ext = ($url =~ m/([^.]+)$/)[0];
    print "ext: $ext\n";
    $ext =~ s/\?.+//;
    print "ext: $ext\n";
    $counter++;
    my $filename = $dir . "/image_" . $counter. '.' . $ext;
    getstore( $url, $filename ) or die "Can't download '$url': $@\n";
}

# output
print "downloaded ", $counter, " images from ", $domain, "\n";
print "to folder ", $dir, "\n";

sub mk_new_dir {
    my $counter2 = 1;
    while (1) {
        my $word = "site";
        my $name = $word . '_' . $counter2++;
        if ( mkdir $name, 0755 ) {
            return $name;    # success, return new dir name
        }
        else {
            next if $!{EEXIST};    # mkdir failed because file exists
            die sprintf "(%d) %s", $!, $!;    # other failure; bail ou
+t!
        }
    }
}
$
[download]

Is this how all images on the net really look to servers, with the size information on them as well? Where do I find information about the url_abs() method here? I don't see it in either of the modules used.

In reply to Re^2: renaming all files in a directory by Aldebaran
in thread renaming all files in a directory by Aldebaran

Are you posting in the right place? Check out Where do I post X? to know for sure.
Posts may use any of the Perl Monks Approved HTML tags. Currently these include the following:
<code> <a> <b> <big> <blockquote> <br /> <dd> <dl> <dt> <em> <font> <h1> <h2> <h3> <h4> <h5> <h6> <hr /> <i> <li> <nbsp> <ol> <p> <small> <strike> <strong> <sub> <sup> <table> <td> <th> <tr> <tt> <u> <ul>
Snippets of code should be wrapped in <code> tags not <pre> tags. In fact, <pre> tags should generally be avoided. If they must be used, extreme care should be taken to ensure that their contents do not have long lines (<70 chars), in order to prevent horizontal scrolling (and possible janitor intervention).
Want more info? How to link or How to display code and escape characters are good places to start.


"be consistent"
	PerlMonks