Beefy Boxes and Bandwidth Generously Provided by pair Networks
Your skill will accomplish
what the force of many cannot
 
PerlMonks  

Re^4: Need help with WWW::Mechanize and Chrome cookies

by marto (Cardinal)
on Jul 09, 2021 at 11:42 UTC ( #11134848=note: print w/replies, xml ) Need Help??


in reply to Re^3: Need help with WWW::Mechanize and Chrome cookies
in thread Need help with WWW::Mechanize and Chrome cookies

The latter is a hyperlink to another page/resource, you would never want the 'Save Complete page' method to follow links like that, it's not what it's for. Saving the same page in a browser will not save hyperlink targets.

  • Comment on Re^4: Need help with WWW::Mechanize and Chrome cookies

Replies are listed 'Best First'.
Re^5: Need help with WWW::Mechanize and Chrome cookies
by bakiperl (Beadle) on Jul 09, 2021 at 13:48 UTC
    In this case, is there a different approach to download the hyperlink targets from within WMC?

      Either find the links, get them, save them, or inject something like this and call it from the page for each target you've identified, or submit a patch to add the required functionality to this module, or choose something else to achieve your goal. Unless you need JavaScript there should be alternatives, but your post lacks enough detail to expand on that.

        The links for these files
        <a class="txt" href="file.txt"> Text File </a>
        can be obtained using the WMC instance by doing something like this
        my @links = $mech->find_all_links( text_contains => 'some description +etc... ' ); my @urls = map { $_->[0] } @links;
        In the case of WWW::Mechanize (WM) you can simply download the files using this code
        for my $foo (@urls) { my $filename = '/path/'.$foo; $mech->get($foo, ':content_file'=>$filename); }
        Unfortunately, this function does not work with WWW::Mechanize::Chrome (WMC). I hope the Author of WMC can shed some light on this or provide a patch. Thank you.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://11134848]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this? | Other CB clients
Other Users?
Others drinking their drinks and smoking their pipes about the Monastery: (3)
As of 2022-05-27 01:09 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?
    Do you prefer to work remotely?



    Results (94 votes). Check out past polls.

    Notices?