in reply to Re: A grammar for HTML matching
in thread A grammar for HTML matching
Obviously, if you want to grab lots of different data out of the HTML file, my suggested approach would be inappropriate, and HTML::Parser would be better. But say you just want to write a quick script to grab the temperature in Seattle, WA and put it on your home page. Or say you want to mine weather.com for the temperature in various cities, each and each temperature appears on a different page. HTML::Parser on each page would be painfully slow.
But this debate over the speed of HTML::Parser has gotten away from my original intent. This HTML-matching idea could be implemented using HTML::Parser, and then provide an easier, faster to write, and more readable mechanism for matching HTML than writing a new HTML::Parser script. And I submit this idea to all of you...is this interesting and/or useful?
BTW, for FilterProxy I use index/rindex to traverse the document and find the interesting portion, then using pos() and m/\G/g which is much faster than a single regex for the entire document. But again implementation is not what I'm interested in at this point.
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
RE: RE: Re: A grammar for HTML matching
by little (Curate) on Nov 01, 2000 at 12:52 UTC | |
by Anonymous Monk on Nov 02, 2000 at 01:44 UTC |