<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: White Paper: Multilevel Bayesian Models of Categorical Data Annotation</title>
	<atom:link href="http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/feed/" rel="self" type="application/rss+xml" />
	<link>http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/</link>
	<description>Natural Language Processing and Text Analytics</description>
	<lastBuildDate>Wed, 08 Feb 2012 17:47:08 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: John Winn</title>
		<link>http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/#comment-6825</link>
		<dc:creator><![CDATA[John Winn]]></dc:creator>
		<pubDate>Thu, 06 May 2010 16:59:44 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=281#comment-6825</guid>
		<description><![CDATA[@lingpipe Infer.NET may be able to help with your larger data set.
http://research.microsoft.com/infernet/
It&#039;s been used for datasets of up to 100M records by using the support for chunking up the data.

It offers variational EM and Expectation Propagation as its main inference algorithms (preliminary Gibbs support is in there as well).  For a comparison to BUGS read this thread:
http://community.research.microsoft.com/forums/t/4823.aspx

We&#039;re planning a new release shortly with a number of optimisations that should increase the size of dataset that can be used without chunking.

Hope that helps,
John Winn, Infer.NET team]]></description>
		<content:encoded><![CDATA[<p>@lingpipe Infer.NET may be able to help with your larger data set.<br />
<a href="http://research.microsoft.com/infernet/" rel="nofollow">http://research.microsoft.com/infernet/</a><br />
It&#8217;s been used for datasets of up to 100M records by using the support for chunking up the data.</p>
<p>It offers variational EM and Expectation Propagation as its main inference algorithms (preliminary Gibbs support is in there as well).  For a comparison to BUGS read this thread:<br />
<a href="http://community.research.microsoft.com/forums/t/4823.aspx" rel="nofollow">http://community.research.microsoft.com/forums/t/4823.aspx</a></p>
<p>We&#8217;re planning a new release shortly with a number of optimisations that should increase the size of dataset that can be used without chunking.</p>
<p>Hope that helps,<br />
John Winn, Infer.NET team</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lingpipe</title>
		<link>http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/#comment-3105</link>
		<dc:creator><![CDATA[lingpipe]]></dc:creator>
		<pubDate>Thu, 20 Nov 2008 17:54:52 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=281#comment-3105</guid>
		<description><![CDATA[The Gibbs samplers in BUGS somewhat counter-intuitively to me needed fewer samples when there was more data.  I found with bigger data sets that the sampler converged faster in real time and much faster in terms of number of epochs.  I&#039;m guessing that&#039;s because the underlying distribution samplers in BUGS were doing less rejection.

With the dentistry data, I found it didn&#039;t take a huge number of samples, but the posteriors were rather diffuse.  Especially on the item-level difficulty parameters (in the beta-binomial by item or logistic models) and in the hierarchical parameters for annotators (beta-binomial by annotator and logistic models).  I also found it hard to fit the multiplicative slope parameter from Qu et al. (also in the Uebersax and Grove models) -- the posteriors were all over the place with very little difference in log likelihood.  

Also, BUGS pitched a fit (throwing underflow/overflow exceptions) when I tried to swap in probit for logit.  I&#039;ve seen this mentioned before.

The other problem with BUGS and R is scaling -- I&#039;m about to create a 200K item data set using the mechanical Turk and I may need to swap over to something like Hal Daume&#039;s &lt;a href=&quot;http://www.cs.utah.edu/~hal/HBC/&quot; rel=&quot;nofollow&quot;&gt;hierarchical Bayesian compiler (HBC)&lt;/a&gt;.  Looking at the &lt;a href=&quot;http://www.cs.utah.edu/~hal/HBC/hbc.pdf&quot; rel=&quot;nofollow&quot;&gt;HBC doc&lt;/a&gt;, it doesn&#039;t look like it&#039;ll implement hierarchical generalized linear models like logistic or probit regression.]]></description>
		<content:encoded><![CDATA[<p>The Gibbs samplers in BUGS somewhat counter-intuitively to me needed fewer samples when there was more data.  I found with bigger data sets that the sampler converged faster in real time and much faster in terms of number of epochs.  I&#8217;m guessing that&#8217;s because the underlying distribution samplers in BUGS were doing less rejection.</p>
<p>With the dentistry data, I found it didn&#8217;t take a huge number of samples, but the posteriors were rather diffuse.  Especially on the item-level difficulty parameters (in the beta-binomial by item or logistic models) and in the hierarchical parameters for annotators (beta-binomial by annotator and logistic models).  I also found it hard to fit the multiplicative slope parameter from Qu et al. (also in the Uebersax and Grove models) &#8212; the posteriors were all over the place with very little difference in log likelihood.  </p>
<p>Also, BUGS pitched a fit (throwing underflow/overflow exceptions) when I tried to swap in probit for logit.  I&#8217;ve seen this mentioned before.</p>
<p>The other problem with BUGS and R is scaling &#8212; I&#8217;m about to create a 200K item data set using the mechanical Turk and I may need to swap over to something like Hal Daume&#8217;s <a href="http://www.cs.utah.edu/~hal/HBC/" rel="nofollow">hierarchical Bayesian compiler (HBC)</a>.  Looking at the <a href="http://www.cs.utah.edu/~hal/HBC/hbc.pdf" rel="nofollow">HBC doc</a>, it doesn&#8217;t look like it&#8217;ll implement hierarchical generalized linear models like logistic or probit regression.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ken</title>
		<link>http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/#comment-3103</link>
		<dc:creator><![CDATA[Ken]]></dc:creator>
		<pubDate>Thu, 20 Nov 2008 00:12:27 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=281#comment-3103</guid>
		<description><![CDATA[Agreed about the speculative comments, it would be better just dealing with the present.  

I&#039;ll have a look at the rest when I get the chance. The dentistry data isn&#039;t too bad to fit, some of the other Qu et al data is harder to fit. The paper is also full of typo&#039;s. An interesting question will be whether MCMC will require a massive number of samples to fully explore the posterior probability for some of the models.]]></description>
		<content:encoded><![CDATA[<p>Agreed about the speculative comments, it would be better just dealing with the present.  </p>
<p>I&#8217;ll have a look at the rest when I get the chance. The dentistry data isn&#8217;t too bad to fit, some of the other Qu et al data is harder to fit. The paper is also full of typo&#8217;s. An interesting question will be whether MCMC will require a massive number of samples to fully explore the posterior probability for some of the models.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lingpipe</title>
		<link>http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/#comment-3101</link>
		<dc:creator><![CDATA[lingpipe]]></dc:creator>
		<pubDate>Wed, 19 Nov 2008 23:53:32 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=281#comment-3101</guid>
		<description><![CDATA[The project is named &quot;hierAnno&quot;.  It&#039;s not on the sandbox web page yet, because it wasn&#039;t done in time for the last release.  I edited the blog post to include the relevant command so it&#039;s clearer.]]></description>
		<content:encoded><![CDATA[<p>The project is named &#8220;hierAnno&#8221;.  It&#8217;s not on the sandbox web page yet, because it wasn&#8217;t done in time for the last release.  I edited the blog post to include the relevant command so it&#8217;s clearer.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Brendan O'Connor</title>
		<link>http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/#comment-3100</link>
		<dc:creator><![CDATA[Brendan O'Connor]]></dc:creator>
		<pubDate>Wed, 19 Nov 2008 20:45:32 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=281#comment-3100</guid>
		<description><![CDATA[Very nice, thanks for pulling all this together -- I was starting to refer people to several different blog posts of yours for more information on the topic :)

What&#039;s the CVS project name?  I couldn&#039;t find it on the webpage listing all the project names, and it seems to be necessary to do a checkout.]]></description>
		<content:encoded><![CDATA[<p>Very nice, thanks for pulling all this together &#8212; I was starting to refer people to several different blog posts of yours for more information on the topic :)</p>
<p>What&#8217;s the CVS project name?  I couldn&#8217;t find it on the webpage listing all the project names, and it seems to be necessary to do a checkout.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lingpipe</title>
		<link>http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/#comment-3094</link>
		<dc:creator><![CDATA[lingpipe]]></dc:creator>
		<pubDate>Tue, 18 Nov 2008 19:27:42 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=281#comment-3094</guid>
		<description><![CDATA[I found fitting the dentistry data a challenge in the hierarchical setting with item-level effects because of the small number of coders per item.  There&#039;s a broad expanse of parameters that provide roughly the same deviance or log likelihood.  So I&#039;d suspect EM would have similar problems -- how&#039;d it work on the dentistry data?   

I might also be able to speed up the sampler by starting at the ML solution found by EM if it&#039;s fast and robust.

I really need to handle the varying panel situation in which not every coder annotates every item.  

Here&#039;s the link to Ken Beath&#039;s randomLCA package, which implements Qu, Tan and Kutner&#039;s (1996) random effects model 2LCR.  

http://cran.r-project.org/web/packages/randomLCA/index.html

As an aside, I find it confusing when doc has speculative comments about what might be coming next rather than sticking to what&#039;s implemented.]]></description>
		<content:encoded><![CDATA[<p>I found fitting the dentistry data a challenge in the hierarchical setting with item-level effects because of the small number of coders per item.  There&#8217;s a broad expanse of parameters that provide roughly the same deviance or log likelihood.  So I&#8217;d suspect EM would have similar problems &#8212; how&#8217;d it work on the dentistry data?   </p>
<p>I might also be able to speed up the sampler by starting at the ML solution found by EM if it&#8217;s fast and robust.</p>
<p>I really need to handle the varying panel situation in which not every coder annotates every item.  </p>
<p>Here&#8217;s the link to Ken Beath&#8217;s randomLCA package, which implements Qu, Tan and Kutner&#8217;s (1996) random effects model 2LCR.  </p>
<p><a href="http://cran.r-project.org/web/packages/randomLCA/index.html" rel="nofollow">http://cran.r-project.org/web/packages/randomLCA/index.html</a></p>
<p>As an aside, I find it confusing when doc has speculative comments about what might be coming next rather than sticking to what&#8217;s implemented.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sharepoint Integration Again</title>
		<link>http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/#comment-3092</link>
		<dc:creator><![CDATA[Sharepoint Integration Again]]></dc:creator>
		<pubDate>Tue, 18 Nov 2008 11:29:35 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=281#comment-3092</guid>
		<description><![CDATA[[...] * White Paper: Multilevel Bayesian Models of Categorical Data Annotation [...]]]></description>
		<content:encoded><![CDATA[<p>[...] * White Paper: Multilevel Bayesian Models of Categorical Data Annotation [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ken</title>
		<link>http://lingpipe-blog.com/2008/11/17/white-paper-multilevel-bayesian-models-of-categorical-data-annotation/#comment-3090</link>
		<dc:creator><![CDATA[Ken]]></dc:creator>
		<pubDate>Tue, 18 Nov 2008 05:36:44 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=281#comment-3090</guid>
		<description><![CDATA[For a frequentist approach to this type of model my package randomLCA is available on CRAN. It still needs some more work but the vignette includes an analysis of the dentistry data. When I get a chance I&#039;ll read your paper and compare.]]></description>
		<content:encoded><![CDATA[<p>For a frequentist approach to this type of model my package randomLCA is available on CRAN. It still needs some more work but the vignette includes an analysis of the dentistry data. When I get a chance I&#8217;ll read your paper and compare.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

