Earlier I discussed the pros and cons of OLE + IE to do web things, vs.
Mechanize or
LWP.
Screen scraping is horrible, and clearly not the Right Way to access web resources. But sometimes screen scraping
is necessary to get things done.
For serious screen scraping (yuck), I'm more convinced OLE + IE is the way to go. Often the pages I need to access involve Javascript, pop-up confirmation menus, file selection windows, and file download windows.
To my understanding, LWP and its derivitatives can't handle this sort of complexity. And to my understanding, samie doesn't yet handle popup dialog screens, upload screens,
or file downloads.
In my opinion, there's a need for a solid Win32 app to drive IE in a serious way, to allow access to complex web environments involving non-vanilla pages. I asked a few months ago, but I'll toss out the question again:
Does anyone know of a robust full-featured module for driving IE?
Thanks for suggestions --
Posts are HTML formatted. Put <p> </p> tags around your paragraphs. Put <code> </code> tags around your code and data!
Titles consisting of a single word are discouraged, and in most cases are disallowed outright.
Read Where should I post X? if you're not absolutely sure you're posting in the right place.
Please read these before you post! —
Posts may use any of the Perl Monks Approved HTML tags:
- a, abbr, b, big, blockquote, br, caption, center, col, colgroup, dd, del, details, div, dl, dt, em, font, h1, h2, h3, h4, h5, h6, hr, i, ins, li, ol, p, pre, readmore, small, span, spoiler, strike, strong, sub, summary, sup, table, tbody, td, tfoot, th, thead, tr, tt, u, ul, wbr
You may need to use entities for some characters, as follows. (Exception: Within code tags, you can put the characters literally.)
| |
For: |
|
Use: |
| & | | & |
| < | | < |
| > | | > |
| [ | | [ |
| ] | | ] |
Link using PerlMonks shortcuts! What shortcuts can I use for linking?
See Writeup Formatting Tips and other pages linked from there for more info.