Welcome to the Monastery, EagerforPerl!

You've provided code, that's awesome (so is formatting it well!).

You're also using (at a quick glance) the majority of proper techniques (strict, warnings, 3-arg open etc ++).

What I'd ask you to do so the Monks may be better able to help is tell us what the code currently does, and how it deviates from what you're expecting. It would also be beneficial if you could provide the data that you're sending in as standard input so the Monks can test for themselves. If the URLs/input are off-limits somehow, that's understandable too... you'll just have to provide more detail on the expected/problematic situations.

ps. You do not need  sub main {... in Perl. If your file does not contain only a package (class), the code will run just fine without a main() function. You can just put your code left-justified (unlike eg: C).

pps. I would recommend, despite what I said above, one change to the 3-arg open you use. Bareword file handles (ie., things like OUTPUT are global in scope. It is best-common-practice to use lexical (ie. scoped) handles instead. To do this, simply assign a scalar variable to hold the handle as opposed to the bareword: open my $fh, '...', '...' or die ...


In reply to Re: Help with Web Scraping Script by stevieb
in thread Help with Web Scraping Script - Updated by EagerforPerl

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.