in reply to can't get www::mechanize to work on a web site

When I click on the title of the song in the search results page, I got the following error message:

Error InterScan HTTP Version 3.8-Build_1080 $Date: 01/31/2003 16:12:0037$ Connecting to display.lyrics.astraweb.com: Connection refused


May be that explains why your robot can't follow the link?

Replies are listed 'Best First'.
Re: Re: suck on www::mechanize question
by smackdab (Pilgrim) on Mar 01, 2004 at 05:44 UTC
    strange, this part works manually for me: (overall it still doesn't grab the lyrics though)...

    http://display.lyrics.astraweb.com:2000/display.cgi?beatles..beatles_1..hey_jude

    So you got the search results to be correct then?
      My company has a firewall running, may be that has something to do with forbidden access. I will try again when I go home later on my own ISP and see if I get the same error. I am guessing that you might be looking for some regex to extract the song lyrics? ... If so, could you post some HTML on your notepad and state which part you want to get extracted?

        I don't even have that problem yet ;-)

        There are 2 parts to get the lyrics for a song:
        1) Search for the one/multi matches on the song title
        which returns a list of possible matches
        2) Find the link that matches the song title
        and follow that link to the lyrics page
        3) Parse and save the lyrics

        Step 1 works, but 2 fails ;-(