in reply to Subdir globs
I'd start with one of the search engine/site crawlers that are available, and modify it as I go. Basically, you're looking for any link (so search the returned code of the front page for "<a href=" and work from there to find new search pages. But have it check each link for the extension you're looking for; in this case, when your code pulls out the URL of the link (however you have it do that) check it for the extension:
if(/.txt/i) { ...
|
|---|