Could this itself be a homework assignment?

The easiest is if you can spot a pattern in the URL's. A good "RESTful" site would make each resource available at its own URL. For example, ".../public_html/1112hcal/06222012/assignment.html" would lead directly to the assignment for 6/22/2012. Just mouse-over a few of the assignment links to see if there is some predictable pattern to their URL's. Then you can simply check a given date to see if it produces an assignment. Some form of this strategy is likely to work well for you.

If it turns out there is a predictable URL for each homework assignment, just use LWP::Simple (which you've already discovered) to grab the assignment. Maybe even just poll each day the URL that might correspond to that date, or the URL that might correspond to the next un-fetched assignment.

The harder way is to use something like HTML::LinkExtor to extract the links on an index page and determine which of the links pertain to homework assignments. Even that isn't too difficult; it's just not as automation-friendly as a nice RESTful approach.

To give any more specific advice I think we would need to see what you have written so far, and exactly where you are stuck.


Dave


In reply to Re: Getting Text from Website by davido
in thread Getting Text from Website by LaneM1234

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.