in reply to Searching through text files

swish-e is a great search tool. It even has a nice Swish::API in perl. apache.org use it for instance to search their site. It will index and search any text file and there are plugins for all sorts of formats to convert them to text for indexing. Index searches are the way to go rather than grepping every file every time you search. With swish-e the core code is C, it handles stemming and indexing for you and you get a nice stable solution with a solid XS API.

cheers

tachyon

Replies are listed 'Best First'.
Re: Re: Searching through text files
by amw1 (Friar) on Mar 23, 2004 at 18:20 UTC
    I didn't even know that swish was still around. I used it quite a bit 5 or 6 years ago to add site search functionality for some websites I was working on. We eventually moved to verity due to the ammount of text we eventually had to search but I really dug swish.

      Alive an kicking (ass). The Swish::API XS interface makes it totally accessible from perl with no forking code system calls to get at it. We use the current version to search and index some quite websites and it runs search+custom highlight times in the order of several milliseconds. No stability issues AFAIK.

      cheers

      tachyon