in reply to Creating Index with Plucene

Plucene::Simple

If you have such experience, how long approximately it will take to index 10000 docs with 500 words each.

It could manage it in under an half hour.

Replies are listed 'Best First'.
Re^2: Creating Index with Plucene
by vit (Friar) on Jun 16, 2009 at 22:54 UTC
    I installed it and ran the following piece of code
    use strict; use Plucene::Simple; ###### INPUT ####################### my $index_path = "./idexdir"; #################################### # create an index my $plucy = Plucene::Simple->open($index_path); $plucy->index_document(doc1 => 'Go to restaurant'); $plucy->index_document(doc2 => 'Return money'); exit(0);
    and got a set of files like _1.f1 .... which I do not understand.
    I expected to get an index file which I could use for my purposes.
    What I did wrong? or this index can only be used by Plucene search.
        What am I doing wrong here:
        use strict; use Plucene::Simple; use Plucene::Index::Reader; ###### INPUT ####################### my $index_path = "./idexdir"; #################################### # create an index my $plucy = Plucene::Simple->open($index_path); $plucy->index_document(doc1 => 'Go to restaurant'); $plucy->index_document(doc2 => 'Return money'); my $reader = Plucene::Index::Reader->new($index_path); my Plucene::Index::SegmentsReader $seg_read = $reader->open; print "$seg_read\n"; exit(0);
        It returns
        Use of uninitialized value in concatenation (.) or string at c:/Active +Perl/site/lib/Plucene/Index/SegmentInfos.pm line 57. read_file '/segments' - sysopen: No such file or directory at c:/Activ +ePerl/site/lib/Plucene/Index/SegmentInfos.pm line 57
        In SegmentInfos.pm there is a statement
        my ($count, @unpack) = unpack "NN/(w/aN)", read_file("$directory/segme +nts");
        which I do not understand. What is "segments" here?