Intro to IR book online


The following book promises to become the book on information retrieval. Although there’s less on index compression than Witten et al.’s Managing Gigabytes, it is much broader and more up to date. Chapters 13-18 aren’t really IR-specific at all, covering topics such as classification, clustering and latent semantic indexing. That means this is a great place for an introduction to the way LingPipe does all these things, as we’ve followed standard practice in all of our models.

Here’s the reference, with a hotlink:

Christopher D. Manning, Prabhakar Raghavan and Hinrich Schütze. (forthcoming) Introduction to Information Retrieval. Cambridge University Press.

From what I’ve read, the treatments of various topics are better thought out and contain much more practical advice than the corresponding sections in Manning and Schütze’s previous book. I don’t know how they’ve broken up the writing, but Prabhakar Raghavan, the third author, not only works at Yahoo! but is the editor-in-chief of the CS journal, the Journal of the ACM.

There’s still plenty of time to send the authors feedback and earn a coveted spot in the acknowledgements of a book destined to be widely read.