LingPipe 2.2.1 released

by

We released LingPipe 2.2.1. This is a maintenance release that patches a major bug in the new fast cache and a minor bug in spelling token sensitivity.

The first fix enables the new util.FastCache to handle negative hash codes. I was reminded that not only may hash codes be negative, the remainder operator (%) does not always return a positive value. The first unit tests must’ve coincidentally used strings with only positive hash codes. This has been fixed and tried in a large scale setting now. Why do I say reminded? Well, it’s literally Puzzle Number 1 in Joshua Bloch and Neal Gafter’s mind-bending Java Puzzlers. Many of the other puzzles deal with overflow of integer values.

The second fix takes care of a subtle bug in spelling correction token sensitivity for when the best hypothesis splits a token into two tokens. I was forgetting that the second token had been edited implicitly. There are also two new spelling features: one lets you set a minimum on the size of tokens edited and the second lets you specify a set of do-not-edit tokens.

If you’re not using spelling or the new cache, the new release won’t affect you at all.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s