in reply to Getting the innerHTML from the DOM , not the “source”

You need something that processes client-side javascript. WWW::Mechanize::Firefox comes to mind; I don't know it well enough to tell you for sure if it can do it, but it looks like a good option to investigate.

  • Comment on Re: Getting the innerHTML from the DOM , not the “source”

Replies are listed 'Best First'.
Re^2: Getting the innerHTML from the DOM , not the “source”
by Karels (Initiate) on Apr 25, 2013 at 16:45 UTC
    OK, so I am using WWW::Mechanize::Firefox and I am getting at the data I am after. From a processing POV I need to loop through several pages. I would like to update the contents of $mech-> with the contents of a new url--it there a way to do this? When I try re-issuing $mech->get with a new argument the program appears to hang... I don't see anything obvious in the Cookbook, examples, etc.

    Actually, now I get a warning:

    Subroutine MozRepl::__load_plugins redefined at C:/Perl/site/lib/Modul +e/Pluggable/Fast.pm line 104.
    Any thoughts would be welcome.