in reply to RE: A grammar for HTML matching
in thread A grammar for HTML matching
Well, both. The idea is that I only care about a small part of the total document. I don't want to have to examine all the irrelevant parts of the document just to get to the part I'm interested in. The benefits of this are speed and invariance to document layout. If you know the summary for the book follows <p>Summary you can ignore the rest of the document. I want to respect document structure within the segment I'm interested in, but disregard the rest.
HTML::TreeBuilder is a subclass of HTML::Parser, and while this idea could be implemented using it, the idea is that it doesn't have to be.
The applications I have in mind for this are:
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
RE: RE: RE: A grammar for HTML matching
by dchetlin (Friar) on Nov 01, 2000 at 10:48 UTC | |
by mcelrath (Novice) on Nov 01, 2000 at 11:48 UTC |