Found this interesting utility today: http://code.google.com/p/luke/
Zend Lucene is a great library to add search engine capability to your site, but it consumes so much memory that you may need sometimes to turn off the index optimiser. In which case you'll see the number of index files increasing dramatically.
And you may end up (as I did) being unable to index any more, or optimise the index, as number of index files the application needs to open is too high.
Quick fix is to run Luke on your index, and optimise it (Luke uses Java version of Lucene + Hadoop library) - et voila!
Zend Lucene is a great library to add search engine capability to your site, but it consumes so much memory that you may need sometimes to turn off the index optimiser. In which case you'll see the number of index files increasing dramatically.
And you may end up (as I did) being unable to index any more, or optimise the index, as number of index files the application needs to open is too high.
Quick fix is to run Luke on your index, and optimise it (Luke uses Java version of Lucene + Hadoop library) - et voila!