The current Apache Lucene Java version is 3.6, released in April of 2012. We’ve updated the Lucene 3 tutorial and the accompanying source code to bring it in line with the current API so that it doesn’t use any deprecated methods and my, there are a lot of them. Bob blogged about this tutorial back in February 2011, shortly after Lucene Java rolled over to version 3.0.
Like other 3.x minor releases, Lucene 3.6 introduces performance enhancements, bug fixes, new analyzers, and changes that bring the Lucene API in line with Solr. In addition, Lucene 3.6 anticipates Lucene 4, billed as “the next major backwards-incompatible release.”
Significant changes since version 3.0
IndexReaderdelete methods are deprecated and will be removed entirely in Lucene 4. All deletes and updates are done via anIndexWriter.- There is a single
IndexWriterconstructor that takes two arguments: the index directory and anIndexWriterConfigobject. The latter was introduced in Lucene 3.1. It holds configuration information that was previously specified directly as additional arguments to the constructor. IndexWriteroptimize methods are deprecated. The merge method(s) supply this functionality.
Building the Source
The ant build file is in the file src/applucene/build.xml and should be run from that directory. The book’s distribution is organized this way so that each chapter’s demo code is roughly standalone, but they are able to share libs. There are some minor dependencies on LingPipe in the example (jar included), but those are just for I/O and could be easily removed or replicated. As an added bonus, the source code now includes the data used in the examples throughout the tutorial, the venerable Federalist Papers from Project Gutenberg.
July 6, 2012 at 1:18 pm |
Lucene4.0 aphla has been released. looking forwards to a new tutorial.
July 24, 2012 at 1:07 pm |
[...] Lucene 3.6 Tutorial [...]
March 8, 2013 at 11:42 am |
[...] sich weitergehend mit dem Thema beschäftigen möchte, dem sei dieses umfassende Tutorial ans Herz [...]