<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Russ Altman on &#8220;Poisoning Your Prior&#8221;</title>
	<atom:link href="http://lingpipe-blog.com/2009/07/21/russ-altman-on-poisoning-your-prior/feed/" rel="self" type="application/rss+xml" />
	<link>http://lingpipe-blog.com/2009/07/21/russ-altman-on-poisoning-your-prior/</link>
	<description>Natural Language Processing and Text Analytics</description>
	<lastBuildDate>Sat, 04 Feb 2012 20:56:48 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Clair Branhan</title>
		<link>http://lingpipe-blog.com/2009/07/21/russ-altman-on-poisoning-your-prior/#comment-15975</link>
		<dc:creator><![CDATA[Clair Branhan]]></dc:creator>
		<pubDate>Mon, 03 Oct 2011 18:35:42 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=1871#comment-15975</guid>
		<description><![CDATA[Hi Sandie. So fare i can see, Arbuckle has answered your question. I dont see any sense.]]></description>
		<content:encoded><![CDATA[<p>Hi Sandie. So fare i can see, Arbuckle has answered your question. I dont see any sense.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lingpipe</title>
		<link>http://lingpipe-blog.com/2009/07/21/russ-altman-on-poisoning-your-prior/#comment-5191</link>
		<dc:creator><![CDATA[lingpipe]]></dc:creator>
		<pubDate>Wed, 22 Jul 2009 18:43:51 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=1871#comment-5191</guid>
		<description><![CDATA[I see the semantic confusion -- what you call &quot;validation&quot; is what I&#039;d call &quot;testing&quot; or &quot;evaluation&quot;, not what I&#039;d call &quot;development&quot;.   During development, the prior&#039;s in play for tuning.  During evaluation, it really can&#039;t be, because you could get any result you wanted.

This brings up an issue with reporting cross-validation results rather than true held-out test results.  I think of cross-validation like development, not like testing.  The problem often found in papers is a confusion of optimal cross-validated parameters with true test results.   It&#039;s too easy to overfit cross-validation, so despite the name, tuned cross-validation results aren&#039;t to be trusted for what Russ is calling &quot;validation&quot;. 

Of course, the real evaluation we care about is prediction of things we don&#039;t yet know.  Too bad that&#039;s so much work in biology.]]></description>
		<content:encoded><![CDATA[<p>I see the semantic confusion &#8212; what you call &#8220;validation&#8221; is what I&#8217;d call &#8220;testing&#8221; or &#8220;evaluation&#8221;, not what I&#8217;d call &#8220;development&#8221;.   During development, the prior&#8217;s in play for tuning.  During evaluation, it really can&#8217;t be, because you could get any result you wanted.</p>
<p>This brings up an issue with reporting cross-validation results rather than true held-out test results.  I think of cross-validation like development, not like testing.  The problem often found in papers is a confusion of optimal cross-validated parameters with true test results.   It&#8217;s too easy to overfit cross-validation, so despite the name, tuned cross-validation results aren&#8217;t to be trusted for what Russ is calling &#8220;validation&#8221;. </p>
<p>Of course, the real evaluation we care about is prediction of things we don&#8217;t yet know.  Too bad that&#8217;s so much work in biology.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Russ Altman</title>
		<link>http://lingpipe-blog.com/2009/07/21/russ-altman-on-poisoning-your-prior/#comment-5190</link>
		<dc:creator><![CDATA[Russ Altman]]></dc:creator>
		<pubDate>Wed, 22 Jul 2009 17:59:36 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=1871#comment-5190</guid>
		<description><![CDATA[As you know, for the purposes of academic publication, one needs to prove the performance of any new classifier and a common way to do this is to emulate how it will do in the world when it is let loose on real, unsolved problems.   In that case, we save some known stuff that is not used in training, so we can see how the algorithm would do on these &quot;gold standard&quot; known results, as a proxy for how it will do on true things that we don&#039;t know later.   That&#039;s what I mean by the validation phase.]]></description>
		<content:encoded><![CDATA[<p>As you know, for the purposes of academic publication, one needs to prove the performance of any new classifier and a common way to do this is to emulate how it will do in the world when it is let loose on real, unsolved problems.   In that case, we save some known stuff that is not used in training, so we can see how the algorithm would do on these &#8220;gold standard&#8221; known results, as a proxy for how it will do on true things that we don&#8217;t know later.   That&#8217;s what I mean by the validation phase.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lingpipe</title>
		<link>http://lingpipe-blog.com/2009/07/21/russ-altman-on-poisoning-your-prior/#comment-5189</link>
		<dc:creator><![CDATA[lingpipe]]></dc:creator>
		<pubDate>Wed, 22 Jul 2009 17:54:28 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=1871#comment-5189</guid>
		<description><![CDATA[Russ:  Thanks for the response.  I figured I might be misinterpreting something.  Is there a paper with the statistical details?  Were you talking about the &lt;a href=&quot;http://www.biomedcentral.com/1471-2105/9/214&quot; rel=&quot;nofollow&quot;&gt;M-BISON paper&lt;/a&gt; on micro-array association?  It looks like a very interesting use of informative priors.

I was trying to emphasize that from a Bayesian perspective, the prior is just another parameter in the model, and shouldn&#039;t get any special treatment.    

I see model-building as a continuous loop of tuning and testing fit, so I&#039;m confused by what you mean by &quot;validation phase&quot;.   Do you mean looking to see that an interaction you expect to see is actually found by the algorithm?  With fine-grained enough priors you could percolate instances to the top, which would definitely fall into the realm of overfitting the model.]]></description>
		<content:encoded><![CDATA[<p>Russ:  Thanks for the response.  I figured I might be misinterpreting something.  Is there a paper with the statistical details?  Were you talking about the <a href="http://www.biomedcentral.com/1471-2105/9/214" rel="nofollow">M-BISON paper</a> on micro-array association?  It looks like a very interesting use of informative priors.</p>
<p>I was trying to emphasize that from a Bayesian perspective, the prior is just another parameter in the model, and shouldn&#8217;t get any special treatment.    </p>
<p>I see model-building as a continuous loop of tuning and testing fit, so I&#8217;m confused by what you mean by &#8220;validation phase&#8221;.   Do you mean looking to see that an interaction you expect to see is actually found by the algorithm?  With fine-grained enough priors you could percolate instances to the top, which would definitely fall into the realm of overfitting the model.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Russ</title>
		<link>http://lingpipe-blog.com/2009/07/21/russ-altman-on-poisoning-your-prior/#comment-5184</link>
		<dc:creator><![CDATA[Russ]]></dc:creator>
		<pubDate>Tue, 21 Jul 2009 23:05:20 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=1871#comment-5184</guid>
		<description><![CDATA[NO!  You misinterpreted what I was saying.  I said the quote to stress that you can&#039;t build the answer into  your prior *during the validation phase*.  In order to have a credible validation, you need to set your prior, seal it, and THEN apply the method to make predictions.  Once your method is validated and you are making real predictions. go crazy and use everything you know--but during validation, don&#039;t cheat!]]></description>
		<content:encoded><![CDATA[<p>NO!  You misinterpreted what I was saying.  I said the quote to stress that you can&#8217;t build the answer into  your prior *during the validation phase*.  In order to have a credible validation, you need to set your prior, seal it, and THEN apply the method to make predictions.  Once your method is validated and you are making real predictions. go crazy and use everything you know&#8211;but during validation, don&#8217;t cheat!</p>
]]></content:encoded>
	</item>
</channel>
</rss>

