Check out Chris Manning’s post on scoring NE recognition for applications and my and Hal’s responses at:
http://nlpers.blogspot.com/2006/08/doing-named-entity-recognition-dont.html
The gist of Chris’s response is that F1 “double”-penalizes, so it’s the wrong metric to optmize for applications. I respond that the problem is in trying to do first-best only scoring.
Leave a Reply