I need to write a module that will go to a specified web page or web site, slurp up the contents of a page posing as a random browser client so that if someone tries to be sneaky and use cloaking that the server will still cough up the cloaked page. It will parse through the HTML and find any JavaScript code and then parse through that to see if the page will produce any PopUps. To write the module I thought it would be best to use
has been marked as deprecated I'm not sure if there is something better to parse out the HTML page. Is there a module to parse out JavaScript or would I have to create one with
. I've looked all through
and have found nothing that will parse through JavaScript. It would be great to see some suggestion as to go about this.