<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Piecewise-Linear or Isotonic Regression for Calibrating Naive Bayes (or Other Predictors)</title>
	<atom:link href="http://lingpipe-blog.com/2010/06/03/piecewise-linear-isotonic-regression-for-calibrating-naive-bayes/feed/" rel="self" type="application/rss+xml" />
	<link>http://lingpipe-blog.com/2010/06/03/piecewise-linear-isotonic-regression-for-calibrating-naive-bayes/</link>
	<description>Natural Language Processing and Text Analytics</description>
	<lastBuildDate>Wed, 08 Feb 2012 17:47:08 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: lingpipe</title>
		<link>http://lingpipe-blog.com/2010/06/03/piecewise-linear-isotonic-regression-for-calibrating-naive-bayes/#comment-7064</link>
		<dc:creator><![CDATA[lingpipe]]></dc:creator>
		<pubDate>Wed, 09 Jun 2010 20:21:06 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=4046#comment-7064</guid>
		<description><![CDATA[Thanks for being persistent -- I hadn&#039;t thought through the multi-class case enough.  The problem is that you want the adjusted category probabilities to sum to 1.  This is actually a problem for the two-class case, too.  You might get raw probabilities of 0.95 and 0.05, where the binned probabilities would be 0.70 and 0.01.  

One approach would be to renormalize the adjusted scores.  This is what everyone does after taking roots to account for some of the correlation.  

I agree that Zadrozny and Elkan&#039;s proposal is very different from what Rennie et al. proposed.]]></description>
		<content:encoded><![CDATA[<p>Thanks for being persistent &#8212; I hadn&#8217;t thought through the multi-class case enough.  The problem is that you want the adjusted category probabilities to sum to 1.  This is actually a problem for the two-class case, too.  You might get raw probabilities of 0.95 and 0.05, where the binned probabilities would be 0.70 and 0.01.  </p>
<p>One approach would be to renormalize the adjusted scores.  This is what everyone does after taking roots to account for some of the correlation.  </p>
<p>I agree that Zadrozny and Elkan&#8217;s proposal is very different from what Rennie et al. proposed.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mathieu</title>
		<link>http://lingpipe-blog.com/2010/06/03/piecewise-linear-isotonic-regression-for-calibrating-naive-bayes/#comment-7062</link>
		<dc:creator><![CDATA[Mathieu]]></dc:creator>
		<pubDate>Wed, 09 Jun 2010 00:08:33 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=4046#comment-7062</guid>
		<description><![CDATA[So what is n(x)? It is the non-calibrated score output by Naive Bayes. In the two-class case, n(x) can be set to the score of class j=1. In the multi-class case, what is n(x)? The score of the most likely class for x? Then I don&#039;t see why, in that case, choosing what bin x falls into solely according to n(x) makes any sense. 

As I see it, the approach of Rennie&#039;s paper is quite different from Zadrozny&#039;s. In the latter, given the predicted P(c&#124;x), the goal was to find a more reliable estimate ˆP(c&#124;x), especially for decision making. In the former, a way to find a better estimate of P(w&#124;c) was proposed but the classification rule didn&#039;t change.]]></description>
		<content:encoded><![CDATA[<p>So what is n(x)? It is the non-calibrated score output by Naive Bayes. In the two-class case, n(x) can be set to the score of class j=1. In the multi-class case, what is n(x)? The score of the most likely class for x? Then I don&#8217;t see why, in that case, choosing what bin x falls into solely according to n(x) makes any sense. </p>
<p>As I see it, the approach of Rennie&#8217;s paper is quite different from Zadrozny&#8217;s. In the latter, given the predicted P(c|x), the goal was to find a more reliable estimate ˆP(c|x), especially for decision making. In the former, a way to find a better estimate of P(w|c) was proposed but the classification rule didn&#8217;t change.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lingpipe</title>
		<link>http://lingpipe-blog.com/2010/06/03/piecewise-linear-isotonic-regression-for-calibrating-naive-bayes/#comment-7058</link>
		<dc:creator><![CDATA[lingpipe]]></dc:creator>
		<pubDate>Tue, 08 Jun 2010 17:28:17 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=4046#comment-7058</guid>
		<description><![CDATA[I don&#039;t think what they&#039;re doing is that complicated.  They&#039;re taking the basic scores output from naive Bayes, sorting them, and binning them into equal sized bins.  Then they output a score for an example based on which bin it falls into, calculating the score by the percentage of examples in that bin that were correctly classified.  I&#039;m just surprised they didn&#039;t massively overfit by doing this on the training data rather than cross-validated training data (where the examples being evaluated are held out from training).]]></description>
		<content:encoded><![CDATA[<p>I don&#8217;t think what they&#8217;re doing is that complicated.  They&#8217;re taking the basic scores output from naive Bayes, sorting them, and binning them into equal sized bins.  Then they output a score for an example based on which bin it falls into, calculating the score by the percentage of examples in that bin that were correctly classified.  I&#8217;m just surprised they didn&#8217;t massively overfit by doing this on the training data rather than cross-validated training data (where the examples being evaluated are held out from training).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mathieu</title>
		<link>http://lingpipe-blog.com/2010/06/03/piecewise-linear-isotonic-regression-for-calibrating-naive-bayes/#comment-7052</link>
		<dc:creator><![CDATA[Mathieu]]></dc:creator>
		<pubDate>Tue, 08 Jun 2010 11:05:06 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=4046#comment-7052</guid>
		<description><![CDATA[I read Zadrozny and Elkan&#039;s paper, in particular their Naive Bayes calibration method. It wasn&#039;t clear what they meant by the score n(.) but, given that the corrected probability estimate is the &quot;fraction of training examples in the bin that actually belong to class j&quot;,  putting a test example x in a bin according to its score n(x) seems to only make sense in the two-class case. In the experiment section, indeed they&#039;re working with only two classes (j=1 or j=0).]]></description>
		<content:encoded><![CDATA[<p>I read Zadrozny and Elkan&#8217;s paper, in particular their Naive Bayes calibration method. It wasn&#8217;t clear what they meant by the score n(.) but, given that the corrected probability estimate is the &#8220;fraction of training examples in the bin that actually belong to class j&#8221;,  putting a test example x in a bin according to its score n(x) seems to only make sense in the two-class case. In the experiment section, indeed they&#8217;re working with only two classes (j=1 or j=0).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Arun Iyer</title>
		<link>http://lingpipe-blog.com/2010/06/03/piecewise-linear-isotonic-regression-for-calibrating-naive-bayes/#comment-7018</link>
		<dc:creator><![CDATA[Arun Iyer]]></dc:creator>
		<pubDate>Fri, 04 Jun 2010 10:00:24 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=4046#comment-7018</guid>
		<description><![CDATA[One term or two - http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.50.3857]]></description>
		<content:encoded><![CDATA[<p>One term or two &#8211; <a href="http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.50.3857" rel="nofollow">http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.50.3857</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

