Beefy Boxes and Bandwidth Generously Provided by pair Networks
good chemistry is complicated,
and a little bit messy -LW
 
PerlMonks  

Re^2: regex help or pointer to module needed

by Xxaxx (Monk)
on Jun 08, 2004 at 08:01 UTC ( [id://362234]=note: print w/replies, xml ) Need Help??


in reply to Re: regex help or pointer to module needed
in thread regex help or pointer to module needed

Good suggestion.

Unfortunately I believe it will match on

<a href="page.html">Link Text</a>
I tried expanding this prior to seeking help here with something like:
m[ < ( [^>\s]+ ) > .*? </ (?! \1> ) ]x
I hoped the no-space condition would solve things. Unfortunately eBay and Amazon send emails that were caught.

Still all in all I think this expression along with a white list may be the direction I go for speed.

Good suggestion.

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://362234]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others having an uproarious good time at the Monastery: (2)
As of 2024-04-26 02:28 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found