I need to write a module that will go to a specified web page or web site, slurp up the contents of a page posing as a random browser client so that if someone tries to be sneaky and use cloaking that the server will still cough up the cloaked page. It will parse through the HTML and find any JavaScript code and then parse through that to see if the page will produce any PopUps. To write the module I thought it would be best to use WWW::Mechanize to grab the pages and as I see that HTML::Parse has been marked as deprecated I'm not sure if there is something better to parse out the HTML page. Is there a module to parse out JavaScript or would I have to create one with Parse::RecDescent. I've looked all through cpan and have found nothing that will parse through JavaScript. It would be great to see some suggestion as to go about this.

BMaximus

In reply to PopUp Detection by BMaximus

Title:
Use:  <p> text here (a paragraph) </p>
and:  <code> code here </code>
to format your post, it's "PerlMonks-approved HTML":



  • Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
  • Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
  • Read Where should I post X? if you're not absolutely sure you're posting in the right place.
  • Please read these before you post! —
  • Posts may use any of the Perl Monks Approved HTML tags:
    a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
  • You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
            For:     Use:
    & &amp;
    < &lt;
    > &gt;
    [ &#91;
    ] &#93;
  • Link using PerlMonks shortcuts! What shortcuts can I use for linking?
  • See Writeup Formatting Tips and other pages linked from there for more info.