Update (24 July 2012) The tutorial has been updated for Lucene 3.6. See:
With this release of the LingPipe Book, I created a standalone version of the tutorial for version 3 of the Apache Lucene search library.
It contains about 20 pages covering the basics of analysis, indexing and search. It’s distributed with sample code and an Ant build file with targets to run the demos.
- Lucene Version 3.0 Tutorial (pdf)
- Supporting Code (tar, gzipped)
Building the Source
The ant build file is in the file src/applucene/build.xml and should be run from that directory. The book’s distribution is organized this way so that each chapter’s demo code is roughly standalone, but they are able to share libs. There are some minor dependencies on LingPipe in the example (jar included), but those are just for I/O and could be easily removed or replicated.
More In-Depth Info on Lucene
The standard reference for Lucene is not its own site or javadoc, which are fairly limited tutorial-wise, but rather the recently released (as of February 2011) book by three Lucene committers:
- Michael McCandless, Erik Hatcher, and Otis Gospodnetić. 2010. Lucene in Action, Second Edition. Manning Press.
Looking at the Manning Press page for the book (linked above), I just realized they blurbed one of my previous blog posts, a review of Lucene in Action!
But wait, there’s more
If you’re interested in natural language, or just need a tutorial on character encodings and Java strings and I/O, you can find the rest of the LingPipe book at its home page:
Enjoy. And as always, let me know if you have any comments, here, or directly to carp@lingpipe.com.
March 30, 2011 at 3:57 am |
[...] Tutorial: http://lingpipe-blog.com/2011/02/11/apachelucene-3-0-tutorial/ [...]
May 3, 2011 at 10:10 am |
If you are writing a Tutorial for Lucene 3.0… than you should’nt use deprecated Functions that are pre 3.0 …
If you add code snippets… a project file would be great/nice…
summorized: This Tutorial SUCKS!
THIS WEBSITE WAS A WASTE OF TIME!
May 3, 2011 at 11:54 am |
1. It’s compiling with Lucene 3.0.1. Are there specific deprecated features you’re worried about?
2. The code’s linked from the post above. There’s an Ant build file.
July 6, 2012 at 1:02 pm |
[...] API so that it doesn’t use any deprecated methods and my, there are a lot of them. Bob blogged about this tutorial back in February 2011, shortly after Lucene Java rolled over to version [...]