Scott_J has asked for the wisdom of the Perl Monks concerning the following question:
I am currently having this problem. I have a section of HTML that looks like the following....
Go to the BBC's website <a href=http://www.bbc.co.uk">BBC</a> or you can visit the inland revenues pages at <a href="http://www.inlandrevenue.com">Inland Revenue</a> which will give you the information you need
From this code I need to extract out the data from the anchor tags so that the 'http://www.bbc.co.uk' part is in one variable, 'www.bbc.co.uk' (I understand that I can just strip the first part to get this) is in another and the 'BBC' part is also in a variable. I need to try this for each web link/anchor tag that I come across.
I have looked at the HTML toke parser module but I'm very new to Perl and it doesn't make much sense to me.
Thankyou
|
|---|
| Replies are listed 'Best First'. | |
|---|---|
|
Re: Extracting href's
by valdez (Monsignor) on Jan 10, 2003 at 13:04 UTC | |
by Scott_J (Initiate) on Jan 10, 2003 at 13:47 UTC | |
|
Re: Extracting href's
by andye (Curate) on Jan 10, 2003 at 12:26 UTC | |
by newrisedesigns (Curate) on Jan 10, 2003 at 20:17 UTC | |
by Anonymous Monk on Jan 10, 2003 at 12:52 UTC | |
|
Re: Extracting href's
by vek (Prior) on Jan 10, 2003 at 15:20 UTC | |
|
Re: Extracting href's
by jdporter (Paladin) on Jan 10, 2003 at 22:08 UTC |