in reply to more screen scraping with embedded Javascript
update: The spidermonkey javascript engine only does Javascript. It has no concept of a browser: that means no document, no DOM, no HTML forms. A simple document.write() will not work because there is no document object. You might be able to extract the script from the HTML page, hand it a a fake document object and have the script write to that (provided it doesn't try to do any events, or read from or write from the DOM or anything like that) and then have that document object return its content to you.
Then you will have to figure out where the written pieces go in your HTML form, pass it into WWW::Mechanize, convince WWW::Mechanize the page you've just created is actually located on a remote server (not that hard, probably) and submit the form.
Repeat until you've reached the last page.
Actually, what you want is complete automated browser. I hear IE can be controlled via OLE or something like that. I don't know how well that works. I'm not familiar with any automation options for mozilla.
updated: fixed some typos
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re^2: more screen scraping with embedded Javascript
by geektron (Curate) on Oct 25, 2004 at 22:21 UTC |