2007-11-05

[zz] Lucene goodness

关键字: lucene

Lucene goodness

Lots of good things happening in Lucene land lately, all of which should benefit users with faster indexing and searching capabilities.  Most notably, Lucene 2.3 (hopefully released this quarter) has some major changes in indexing memory management and performance.  I have personally clocked indexing using release 2.2 at about 400 rec/s (single threaded, Mac Pro dual CPU/dual core, using the contrib/benchmark indexing.alg) to over 2,100 records/s on 2.3-dev (the latest trunk).  It also features easier control of the indexing process by specifying how much memory to give it, instead of the confusing maxBufferedDocs factor.

Other work being undertaken should speed up reopening IndexReader’s.  There also are a number of smaller changes including a faster StandardTokenizer (the tokenizer most people use) and faster term vector access.

Of course, with that comes more testing and a greater need to make sure the next release is rock solid and backwards compatible.   So, if you are a Lucene user, I would encourage you to give trunk a try on some of your non-production indexes, etc. and help us test it out.

 

link from http://lucene.grantingersoll.com/2007/11/02/lucene-goodness/

评论
grantbb 2007-11-26
不错,值得期待
licco1 2007-11-26
太好了
发表评论

您还没有登录,请登录后发表评论

imjl
搜索本博客
博客分类
我的相册
4ae9948d-bac9-4e81-8351-d05c0182de1b-thumb
robot
共 1 张
存档
最新评论