in reply to Parsing Framed Web Pages

What you have is the underlying frameset page. You should be able to parse the html for the urls to the actual content pages that are loaded into the frames, then fetch them directly. It will look something like the following. You want the src= attribute inside the <frame> tags.

<frameset frameborder="0" framespacing="0" border="0" rows="120,*"> <frame src="menu.html" name="topmenu" target="body" scrolling="no" + noresize> <frame src="main.html" name="body" scrolling="auto"> </frameset>

Replies are listed 'Best First'.
Re^2: Parsing Framed Web Pages
by CaMelRyder (Pilgrim) on Aug 12, 2008 at 14:38 UTC
    Nice. When I read your post, I was hit with the "why didn't i think of that" feeling. While I am at work right now and can't try it, I'm sure that is exactly the insight that I needed.
    ¥peace from CaMelRyder¥