Beefy Boxes and Bandwidth Generously Provided by pair Networks
Problems? Is your data what you think it is?
 
PerlMonks  

Re: Re: Re: Matching a question in text

by bastard (Hermit)
on Jun 26, 2001 at 01:55 UTC ( [id://91459]=note: print w/replies, xml ) Need Help??


in reply to Re: Re: Matching a question in text
in thread Matching a question in text

Search engines do this.
It was either htdig or swift-e that had a file that contained such "noise words". Just use that. (I think swift-e had them in it's source code).

You can find links to them here:
http://www.searchtools.com/

On another note, the source is available for alot of the search engines on the page. Code examples for things like fuzzy search and context searching might be available.

  • Comment on Re: Re: Re: Matching a question in text

Log In?
Username:
Password:

What's my password?
Create A New User
Domain Nodelet?
Node Status?
node history
Node Type: note [id://91459]
help
Chatterbox?
and the web crawler heard nothing...

How do I use this?Last hourOther CB clients
Other Users?
Others chanting in the Monastery: (6)
As of 2024-03-29 15:18 GMT
Sections?
Information?
Find Nodes?
Leftovers?
    Voting Booth?

    No recent polls found