I am processing urls but dont want to process a url called:
<A HREF="http://www.cnn.com/WEATHER/index.html">
You haven't explained why this URL is undesireable. What criteria are you using to choose whether (sorry, pun intended) or not to retain a URL? Describe that to us and we can give you a better answer to 'How'.