vit has asked for the wisdom of the Perl Monks concerning the following question:

Dear Monks,
Does anybody have experience with creating index with Plucene?
What I need is to create index files for words and documents which can refer each other. Something which can be used in TF-IDF like methods.
If you have such experience, how long approximately it will take to index 10000 docs with 500 words each.

Replies are listed 'Best First'.
Re: Creating Index with Plucene
by Anonymous Monk on Jun 16, 2009 at 19:01 UTC
    Plucene::Simple

    If you have such experience, how long approximately it will take to index 10000 docs with 500 words each.

    It could manage it in under an half hour.

      I installed it and ran the following piece of code
      use strict; use Plucene::Simple; ###### INPUT ####################### my $index_path = "./idexdir"; #################################### # create an index my $plucy = Plucene::Simple->open($index_path); $plucy->index_document(doc1 => 'Go to restaurant'); $plucy->index_document(doc2 => 'Return money'); exit(0);
      and got a set of files like _1.f1 .... which I do not understand.
      I expected to get an index file which I could use for my purposes.
      What I did wrong? or this index can only be used by Plucene search.