The first thing I want is:
The text between the 3rd <p> and </p> tags. (not the first or 2nd).
This is to extract the article title.
The 2nd thing I want is:
Everything in the page past this:
<b>Notes:</b>
This is to extract the notes.
The 3rd thing I want is:
The text between the 5th <p> and </p> tags, but only if the text begins with "by" (as in by Larry Wall).
This is to extract the author line.
In reply to Very specific HTML parsing question by russmann
| For: | Use: | ||
| & | & | ||
| < | < | ||
| > | > | ||
| [ | [ | ||
| ] | ] |