No such thing as a small change | |
PerlMonks |
Re: Re:x2 Scraping HTML: orthodoxy and realityby mojotoad (Monsignor) |
on Jul 08, 2003 at 19:54 UTC ( [id://272426]=note: print w/replies, xml ) | Need Help?? |
Here's a quick example, just to give you an idea. I apologize for the crufty code.
This solution is still vulnerable to layout changes from the printer manufacturer. I really don't like having to use depth and count with HTML::TableExtract because of this reason -- if the HTML tables had some nice, labeled columns it would be another story entirely. With that in mind you may well be better off with your solution in the long run, though I daresay the regexp solution might be more difficult to maintain. HTML::TableExtract is a subclass of HTML::Parser, in case you were unaware. I'm pretty sure HTML::Parser slows things down compared to your solution, but I'm curious to what degree.
Enjoy,
In Section
Meditations
|
|