Re: Mechanize, Javascript, Cookies, and you!
by Marshall (Canon) on Jun 21, 2011 at 02:33 UTC
|
You may need WWW-Mechanize-Firefox. Implementing a javascript engine is hard. The idea behind Mechanize-Firefox is to control firefox and have it do that part of the job. A cool idea. | [reply] |
|
|
Checking into this -- this looks like it would be right up my alley. I will read through it, but something I'm wondering: does it require X? This will eventually run on a remote machine w/o an X-server. :)
Anyway, cheers -- got something to explore for a while, that's for sure.
| [reply] |
|
|
| [reply] |
|
|
|
|
When I started playing with the mozrepl interface, I logged on with putty. Firefox has to be running, but don't think that you will need to "watch" the screen. Read the docs, install the firefox add-on, then play a bit to see how it works. This interface is what Mechanize-Firefox talks to. might want to google mozrepl also. Have fun!
| [reply] |
Re: Mechanize, Javascript, Cookies, and you!
by Anonymous Monk on Jun 21, 2011 at 02:53 UTC
|
| [reply] |
Re: Mechanize, Javascript, Cookies, and you!
by aquarium (Curate) on Jun 21, 2011 at 04:25 UTC
|
maybe jmeter could do it or something like serverside javascript like rhino...but seems you're heading down a slippery slope anyway. make sure you setup your http headers and response codes to look like a browser. but as per my slippery slope observation, these kinds of systems are also usually rigged with a bit of random checks anyway with captcha showing up. in the event someone is watching and doesn't appreciate your automation attempts, they may even start popping up more captchas based on your ip address etc., or could start popping up captchas incessantly if they're really against this...and thus defeat all your coding powers at press of a button.
the hardest line to type correctly is: stty erase ^H
| [reply] |
|
|
Indeed, advice to be considered. All in all, yes, I think they are against it, but not going to risk pissing of everyone. Without giving away too much, as my scraper translates for those who otherwise not be a customer, it's in their best interest to allow it. However, they've also gotta think about malicious scrapers. I'll bet anyone a shiny quarter the beef up in security is in response to Sony getting sodomized recently.
| [reply] |