Like it or not Apache Lucene has become the de-facto standard for open-source high-performance search. It has a large user-base, is well documented and has plenty of committers. Unfortunately Apache Lucene is entirely written in Java and therefore of relatively little use for perl programmers. Fortunately in the recent years a group of C++ programmers led by Ben van Klinken decided to port Java Lucene to C++.

The purpose of the Lucene CPAN module is to export the C++ Lucene API to perl and at the same time be as close as possible to the original Java API. This has the combined advantage of providing perl programmers with a well-documented API and giving them access to a C++ search engine library that is supposedly faster than the original.

http://search.cpan.org/~tbusch/Lucene-0.01/lib/Lucene.pm

  • Comment on Release of Lucene-0.01 -- API to the C++ port of the Lucene search engine

Replies are listed 'Best First'.
Re: Release of Lucene-0.01 -- API to the C++ port of the Lucene search engine
by perrin (Chancellor) on Aug 21, 2006 at 15:16 UTC
    Those looking for a lighter weight and easy to set up search engine should consider SWISH-E. It doesn't have the same features (no Unicode), but is very fast at searching and indexing, and has actively maintained Perl APIs on CPAN.