The hard way of approaching this problem is to install a logging proxy (written with, for example, HTTP::Proxy) or a network sniffer (for example ethereal or something using Net::PCap) between your browser and the network traffic, see what gets sent between the two, and replay that from the script.

This is hard, because you will see much stuff that is unrelated and/or have to set up things.

The easy way is to use WWW::Mechanize, which tries relatively hard to emulate a browser. It handles cookies already for you and it has easy ways of masquerading as a browser as well.

If a script using WWW::Mechanize does not work, then you will have to fall back onto the above soutions.

I can't test it from here, but I think that the following WWW::Mechanize script recreates what your script does:

use strict; use WWW::Mechanize; my $agent = WWW::Mechanize->new(); my $url = '"http://www.accountancy.smu.edu.sg/facultystaff/faculty.htm +'; $agent->get($url); print "Got return code ", $agent->code, "\n"; open FH, ">", "staff.txt"; print FH $agent->content; close FH;

In reply to Re: Help with LWP::UserAgent by Corion
in thread Help with LWP::UserAgent by shu

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.