in reply to Perl Search Applicance
As a reference point, I once built a search engine combining Apache/mod_perl, MySQL and Glimpse. It took around 4 months to complete working alone. It indexed all of the Open Directory project and served most queries in under a second running on a PII/600. The search result format was actually more complicated than Googles - it included the category hierarchy and had advanced tree-limiting features.
The project was generally successful. However, it never came close to providing something comparable to Google. Why not? The search results sucked, to put it mildly. All it did was a simple partial-word match. Glimpse supported more but the more advanced features were too slow to use. Also, the indexing was really really slow. It would never scale to indexing the entire Internet no matter how much hardware you put behind it. As it was it took around 6 hours to index the Open Directory database (although much of that was in character-set translation).
So, in short, be very careful about what you attempt here. If you need Google, buy Google (or one of the competitors like Verity, etc.). If you can make do with much less then you might build it yourself. But have no illusions about what you'll end up with.
-sam
|
---|
Replies are listed 'Best First'. | |
---|---|
Re: Re: Perl Search Applicance
by PyroX (Pilgrim) on Jun 20, 2002 at 05:08 UTC | |
by samtregar (Abbot) on Jun 20, 2002 at 05:16 UTC | |
by PyroX (Pilgrim) on Jun 20, 2002 at 05:22 UTC | |
Re: Re: Perl Search Applicance
by johnseq (Initiate) on Jun 21, 2002 at 01:45 UTC |