<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: What&#8217;s Wrong with Probability Notation?</title>
	<atom:link href="http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/feed/" rel="self" type="application/rss+xml" />
	<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/</link>
	<description>Natural Language Processing and Text Analytics</description>
	<lastBuildDate>Sat, 04 Feb 2012 20:56:48 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Bob Carpenter</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-16068</link>
		<dc:creator><![CDATA[Bob Carpenter]]></dc:creator>
		<pubDate>Mon, 10 Oct 2011 20:42:20 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-16068</guid>
		<description><![CDATA[That introduces several new problems. 

1.   Do we then use $latex \mbox{Pr}(A)$ for event probs?   

2.  And what about mixtures, like spike and slab or Dirichlet process which are part discrete and part continuous?

3. The objection to the cap/lowercase convention is about matrices and to a lesser extent Greek letters.  If we also want to capitalize matrices or bold them, we run into conflicting conventions.  Not all the Greek letters have easily distinguishable caps.

3.  While writing $latex p(X=x&#124;C=c)$ may clear up some confusions, it also runs head-on into the notation used for events, where $latex X = x$ is shorthand for the event $latex \{\omega \in \Omega&#124;\omega(X) = x\}$.  And obviously $latex p(X=x&#124;C=c) = 0$ if we&#039;re talking events and $latex X$ is continuous.

4.  I don&#039;t see how using conditionals cleans up the distinction between $latex p(x&#124;y)$ and $latex p(y&#124;x)$, which would be written in probability theory as $latex p_{X&#124;Y}(x&#124;y)$ and $latex p_{Y&#124;X}(y&#124;x)$.  This notation gets cumbersome when we have a dozen parameters.  

I&#039;ve never heard anyone say that the problem is purely Bayesian or frequentist -- this is just about probability theory, about which everyone is in agreement.  The frequentist/Bayesian debate is about what can be the object of a probability distribution, not how the laws of probability work.]]></description>
		<content:encoded><![CDATA[<p>That introduces several new problems. </p>
<p>1.   Do we then use <img src='http://s0.wp.com/latex.php?latex=%5Cmbox%7BPr%7D%28A%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mbox{Pr}(A)' title='&#92;mbox{Pr}(A)' class='latex' /> for event probs?   </p>
<p>2.  And what about mixtures, like spike and slab or Dirichlet process which are part discrete and part continuous?</p>
<p>3. The objection to the cap/lowercase convention is about matrices and to a lesser extent Greek letters.  If we also want to capitalize matrices or bold them, we run into conflicting conventions.  Not all the Greek letters have easily distinguishable caps.</p>
<p>3.  While writing <img src='http://s0.wp.com/latex.php?latex=p%28X%3Dx%7CC%3Dc%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p(X=x|C=c)' title='p(X=x|C=c)' class='latex' /> may clear up some confusions, it also runs head-on into the notation used for events, where <img src='http://s0.wp.com/latex.php?latex=X+%3D+x&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X = x' title='X = x' class='latex' /> is shorthand for the event <img src='http://s0.wp.com/latex.php?latex=%5C%7B%5Comega+%5Cin+%5COmega%7C%5Comega%28X%29+%3D+x%5C%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;{&#92;omega &#92;in &#92;Omega|&#92;omega(X) = x&#92;}' title='&#92;{&#92;omega &#92;in &#92;Omega|&#92;omega(X) = x&#92;}' class='latex' />.  And obviously <img src='http://s0.wp.com/latex.php?latex=p%28X%3Dx%7CC%3Dc%29+%3D+0&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p(X=x|C=c) = 0' title='p(X=x|C=c) = 0' class='latex' /> if we&#8217;re talking events and <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> is continuous.</p>
<p>4.  I don&#8217;t see how using conditionals cleans up the distinction between <img src='http://s0.wp.com/latex.php?latex=p%28x%7Cy%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p(x|y)' title='p(x|y)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=p%28y%7Cx%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p(y|x)' title='p(y|x)' class='latex' />, which would be written in probability theory as <img src='http://s0.wp.com/latex.php?latex=p_%7BX%7CY%7D%28x%7Cy%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_{X|Y}(x|y)' title='p_{X|Y}(x|y)' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=p_%7BY%7CX%7D%28y%7Cx%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_{Y|X}(y|x)' title='p_{Y|X}(y|x)' class='latex' />.  This notation gets cumbersome when we have a dozen parameters.  </p>
<p>I&#8217;ve never heard anyone say that the problem is purely Bayesian or frequentist &#8212; this is just about probability theory, about which everyone is in agreement.  The frequentist/Bayesian debate is about what can be the object of a probability distribution, not how the laws of probability work.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Julien Diard</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-16023</link>
		<dc:creator><![CDATA[Julien Diard]]></dc:creator>
		<pubDate>Fri, 07 Oct 2011 12:25:08 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-16023</guid>
		<description><![CDATA[Concerning your general issue with classical probabilistic notation: I could not agree more. However, I believe an elegant solution already exists, and was proposed by Jaynes. 

It&#039;s basis is simple: p(x), or P(x), is an object which does not exist. The correct tool in probabilities is conditional probabilities: one should always specify the preliminary knowledge that conditions the state of uncertainty about a quantity x. Therefore, p(X &#124; c) is the distribution about variable X under the assumption that c holds. If preliminary knowledge is different, say c&#039;, then p(X &#124; c&#039;) can be another mathematical distribution. Function p(. &#124; .), in this case, is not overloaded (contrary to what many reviewers of my papers asserted, but that&#039;s not the point. :) ).

Complement this with a few conventions, for instance, use p(. &#124; .) in the continuous case, P(. &#124; .) in the discrete case. Use capitalized symbols when referring to variables (i.e., domains), and small-capped symbols for values. So that P([X=x] &#124; c) (or P(X=x &#124; c), when you&#039;re lazy) is a probability value, whereas P(X &#124; c) is a probability distribution. And you should be all set. 

The use of right-hand side symbols to make probability distributions non-ambiguous does not even need to be tied to &quot;subjectivist&quot; stances about the meaning of probabilities as states of knowledge of agents. Indeed, it is also the basis of (purely Bayesian statistics or machine learning inspired) methods of model selection: both P(X &#124; c) and P(X &#124; c&#039;) being defined, a new variable C can then be introduced, with domain C={c, c&#039;}, and the model P(C &#124; D) P(X &#124; C D) can then be introduced to carry out model selection by computing P(C &#124; X D).]]></description>
		<content:encoded><![CDATA[<p>Concerning your general issue with classical probabilistic notation: I could not agree more. However, I believe an elegant solution already exists, and was proposed by Jaynes. </p>
<p>It&#8217;s basis is simple: p(x), or P(x), is an object which does not exist. The correct tool in probabilities is conditional probabilities: one should always specify the preliminary knowledge that conditions the state of uncertainty about a quantity x. Therefore, p(X | c) is the distribution about variable X under the assumption that c holds. If preliminary knowledge is different, say c&#8217;, then p(X | c&#8217;) can be another mathematical distribution. Function p(. | .), in this case, is not overloaded (contrary to what many reviewers of my papers asserted, but that&#8217;s not the point. :) ).</p>
<p>Complement this with a few conventions, for instance, use p(. | .) in the continuous case, P(. | .) in the discrete case. Use capitalized symbols when referring to variables (i.e., domains), and small-capped symbols for values. So that P([X=x] | c) (or P(X=x | c), when you&#8217;re lazy) is a probability value, whereas P(X | c) is a probability distribution. And you should be all set. </p>
<p>The use of right-hand side symbols to make probability distributions non-ambiguous does not even need to be tied to &#8220;subjectivist&#8221; stances about the meaning of probabilities as states of knowledge of agents. Indeed, it is also the basis of (purely Bayesian statistics or machine learning inspired) methods of model selection: both P(X | c) and P(X | c&#8217;) being defined, a new variable C can then be introduced, with domain C={c, c&#8217;}, and the model P(C | D) P(X | C D) can then be introduced to carry out model selection by computing P(C | X D).</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Julien Diard</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-16022</link>
		<dc:creator><![CDATA[Julien Diard]]></dc:creator>
		<pubDate>Fri, 07 Oct 2011 12:10:09 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-16022</guid>
		<description><![CDATA[Leaving the sample space implicit really should be prohibited. It&#039;s like crossing the beams, it&#039;s bad. ;) 

I have read way too many papers where, for instance, Normal distributions were happily applied over R+, or even over circular spaces. Consider, oh, &quot;almost&quot; all of the robotics navigation and mapping literature of the 90&#039;s: positions x, y and orientations \theta of robots were put together in a single &quot;pose&quot; variable, a linear Gaussian models were applied directly on this 3D variable... :)]]></description>
		<content:encoded><![CDATA[<p>Leaving the sample space implicit really should be prohibited. It&#8217;s like crossing the beams, it&#8217;s bad. ;) </p>
<p>I have read way too many papers where, for instance, Normal distributions were happily applied over R+, or even over circular spaces. Consider, oh, &#8220;almost&#8221; all of the robotics navigation and mapping literature of the 90&#8242;s: positions x, y and orientations \theta of robots were put together in a single &#8220;pose&#8221; variable, a linear Gaussian models were applied directly on this 3D variable&#8230; :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lingpipe</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-7129</link>
		<dc:creator><![CDATA[lingpipe]]></dc:creator>
		<pubDate>Fri, 18 Jun 2010 19:47:02 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-7129</guid>
		<description><![CDATA[Once you understand the notation, it&#039;s consistent.  That&#039;s not to say that some people don&#039;t use it inconsistently.  The problem for beginners is just how much the notation&#039;s overloaded.  

I&#039;m afraid it doesn&#039;t make sense to talk about the independence of random variables over different sample spaces.  Keep in mind that it&#039;s OK to have different outcomes in the same sample space -- the sample space itself is abstract.  For instance, if you have two discrete distros with outcomes {A,B} and {X,Y,Z}, then you can have a sample space with six points, {AX, AY, AZ, BX, BY, BZ}.   The value of the first variable is A for samples AX, AY and AZ, and B for samples BX, BY and BZ.  Similar, the value of the second variable is X for AX and BX, and so on.  

For instance, writing independence out in full, variables $latex X$ and $latex Y$ over a sample space are independent if and only if $latex p_{X,Y}(x,y) = p_X(x) \times p_Y(y)$ for all $latex x, y$.  The reason you need this to be over a single sample space is that you can&#039;t define the joint distribution $latex p_{X,Y}$ otherwise.

In practice, the sample space is rarely mentioned and random variables are defined by their distributions.  It&#039;s just assumed everything comes from the same sample space.  Often you define $latex p_{X,Y}$ by first defining $latex p_X$ and $latex p_Y$ and then stating that they&#039;re independent.]]></description>
		<content:encoded><![CDATA[<p>Once you understand the notation, it&#8217;s consistent.  That&#8217;s not to say that some people don&#8217;t use it inconsistently.  The problem for beginners is just how much the notation&#8217;s overloaded.  </p>
<p>I&#8217;m afraid it doesn&#8217;t make sense to talk about the independence of random variables over different sample spaces.  Keep in mind that it&#8217;s OK to have different outcomes in the same sample space &#8212; the sample space itself is abstract.  For instance, if you have two discrete distros with outcomes {A,B} and {X,Y,Z}, then you can have a sample space with six points, {AX, AY, AZ, BX, BY, BZ}.   The value of the first variable is A for samples AX, AY and AZ, and B for samples BX, BY and BZ.  Similar, the value of the second variable is X for AX and BX, and so on.  </p>
<p>For instance, writing independence out in full, variables <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='Y' title='Y' class='latex' /> over a sample space are independent if and only if <img src='http://s0.wp.com/latex.php?latex=p_%7BX%2CY%7D%28x%2Cy%29+%3D+p_X%28x%29+%5Ctimes+p_Y%28y%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_{X,Y}(x,y) = p_X(x) &#92;times p_Y(y)' title='p_{X,Y}(x,y) = p_X(x) &#92;times p_Y(y)' class='latex' /> for all <img src='http://s0.wp.com/latex.php?latex=x%2C+y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x, y' title='x, y' class='latex' />.  The reason you need this to be over a single sample space is that you can&#8217;t define the joint distribution <img src='http://s0.wp.com/latex.php?latex=p_%7BX%2CY%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_{X,Y}' title='p_{X,Y}' class='latex' /> otherwise.</p>
<p>In practice, the sample space is rarely mentioned and random variables are defined by their distributions.  It&#8217;s just assumed everything comes from the same sample space.  Often you define <img src='http://s0.wp.com/latex.php?latex=p_%7BX%2CY%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_{X,Y}' title='p_{X,Y}' class='latex' /> by first defining <img src='http://s0.wp.com/latex.php?latex=p_X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_X' title='p_X' class='latex' /> and <img src='http://s0.wp.com/latex.php?latex=p_Y&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_Y' title='p_Y' class='latex' /> and then stating that they&#8217;re independent.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alan Mainwaring</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-7121</link>
		<dc:creator><![CDATA[Alan Mainwaring]]></dc:creator>
		<pubDate>Fri, 18 Jun 2010 04:58:12 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-7121</guid>
		<description><![CDATA[Can somebody help me here, I started off trying to understand the concept of nested designs in ANOVA, then moved into trying to clarify what was meant by the term variate, bivariate, etc. Then the term Random variable as being a real valued function defined on a sample space (only dealing with discreet) at the moment. I then tried to sort out the term independent variables from the term statistically independent random variables. All of the definitions I have seen define two or more independent random variable over the SAME sample space. Why could we not have different random variables defined over different sample spaces (discreet ones). The more I think about probability theory the more confused I get. Something has got to be done about this to bring about some sort of simplicity and consistency in this very important area.
Hope there is someone out there who can help.]]></description>
		<content:encoded><![CDATA[<p>Can somebody help me here, I started off trying to understand the concept of nested designs in ANOVA, then moved into trying to clarify what was meant by the term variate, bivariate, etc. Then the term Random variable as being a real valued function defined on a sample space (only dealing with discreet) at the moment. I then tried to sort out the term independent variables from the term statistically independent random variables. All of the definitions I have seen define two or more independent random variable over the SAME sample space. Why could we not have different random variables defined over different sample spaces (discreet ones). The more I think about probability theory the more confused I get. Something has got to be done about this to bring about some sort of simplicity and consistency in this very important area.<br />
Hope there is someone out there who can help.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Problem with notation in applied Bayesian work</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-6828</link>
		<dc:creator><![CDATA[Problem with notation in applied Bayesian work]]></dc:creator>
		<pubDate>Fri, 07 May 2010 23:12:34 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-6828</guid>
		<description><![CDATA[[...] One hurdle newcomers have to applied Bayesian work is understand the notation at work. Understanding that p(x) is not the same function as p(y). Typically these refer to the marginal density (or mass) function for x and y, respectively. Similarly p(x&#124;y) is not the same function as p(y&#124;x), but instead the first is the function describing the conditional density (or mass) function of x given y and the second is the conditional density (or mass) function of y given x. Attempts to rectify this notation seem to make the notation overly complicated and therefore, the differences are made implicit. For more discussion of this, please see this post. [...]]]></description>
		<content:encoded><![CDATA[<p>[...] One hurdle newcomers have to applied Bayesian work is understand the notation at work. Understanding that p(x) is not the same function as p(y). Typically these refer to the marginal density (or mass) function for x and y, respectively. Similarly p(x|y) is not the same function as p(y|x), but instead the first is the function describing the conditional density (or mass) function of x given y and the second is the conditional density (or mass) function of y given x. Attempts to rectify this notation seem to make the notation overly complicated and therefore, the differences are made implicit. For more discussion of this, please see this post. [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jason Rennie</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-5920</link>
		<dc:creator><![CDATA[Jason Rennie]]></dc:creator>
		<pubDate>Fri, 04 Dec 2009 20:42:55 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-5920</guid>
		<description><![CDATA[Wouldn&#039;t they write $latex p_{X&#124;Y}(x&#124;y)$ when being careful?  I intentionally left-off the function arguments.

Thanks for the wordpress/latex tip :)]]></description>
		<content:encoded><![CDATA[<p>Wouldn&#8217;t they write <img src='http://s0.wp.com/latex.php?latex=p_%7BX%7CY%7D%28x%7Cy%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_{X|Y}(x|y)' title='p_{X|Y}(x|y)' class='latex' /> when being careful?  I intentionally left-off the function arguments.</p>
<p>Thanks for the wordpress/latex tip :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: lingpipe</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-5919</link>
		<dc:creator><![CDATA[lingpipe]]></dc:creator>
		<pubDate>Fri, 04 Dec 2009 20:32:24 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-5919</guid>
		<description><![CDATA[Indeed, that&#039;s how the notation is used to distinguish random variables $latex X,Y$ from regular old bound variables $latex x,y$.  But you typically only see this in careful discussions of probability theory or intro stats texts.  

In practical modeling papers, where there are parameters, matrices, etc., the upper-case/lower-case thing gets difficult to maintain.  And it&#039;s so rare to see anything other than $latex p_{X&#124;Y}(x&#124;y)$ that it seems awfully pedantic to include all those subscripts.  

The real kicker is that you almost never see random variables defined as maps $latex X:\Omega \rightarrow {\mathbb R}$.  In fact, you never see the sample space $latex \Omega$ even mentioned.  Instead, you&#039;ll see statements like &quot;assume $latex X$ is a random variable distributed as $latex \mbox{Binomial}(\theta,N)$.&quot;  ]]></description>
		<content:encoded><![CDATA[<p>Indeed, that&#8217;s how the notation is used to distinguish random variables <img src='http://s0.wp.com/latex.php?latex=X%2CY&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X,Y' title='X,Y' class='latex' /> from regular old bound variables <img src='http://s0.wp.com/latex.php?latex=x%2Cy&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='x,y' title='x,y' class='latex' />.  But you typically only see this in careful discussions of probability theory or intro stats texts.  </p>
<p>In practical modeling papers, where there are parameters, matrices, etc., the upper-case/lower-case thing gets difficult to maintain.  And it&#8217;s so rare to see anything other than <img src='http://s0.wp.com/latex.php?latex=p_%7BX%7CY%7D%28x%7Cy%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_{X|Y}(x|y)' title='p_{X|Y}(x|y)' class='latex' /> that it seems awfully pedantic to include all those subscripts.  </p>
<p>The real kicker is that you almost never see random variables defined as maps <img src='http://s0.wp.com/latex.php?latex=X%3A%5COmega+%5Crightarrow+%7B%5Cmathbb+R%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X:&#92;Omega &#92;rightarrow {&#92;mathbb R}' title='X:&#92;Omega &#92;rightarrow {&#92;mathbb R}' class='latex' />.  In fact, you never see the sample space <img src='http://s0.wp.com/latex.php?latex=%5COmega&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;Omega' title='&#92;Omega' class='latex' /> even mentioned.  Instead, you&#8217;ll see statements like &#8220;assume <img src='http://s0.wp.com/latex.php?latex=X&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='X' title='X' class='latex' /> is a random variable distributed as <img src='http://s0.wp.com/latex.php?latex=%5Cmbox%7BBinomial%7D%28%5Ctheta%2CN%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='&#92;mbox{Binomial}(&#92;theta,N)' title='&#92;mbox{Binomial}(&#92;theta,N)' class='latex' />.&#8221;</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jason Rennie</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-5917</link>
		<dc:creator><![CDATA[Jason Rennie]]></dc:creator>
		<pubDate>Fri, 04 Dec 2009 19:12:52 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-5917</guid>
		<description><![CDATA[I sometimes wonder why we don&#039;t write:

$latex p_{X&#124;Y} = \frac{p_{Y&#124;X} \ p_X}{p_Y}$

This seems less heinous than $latex p(x&#124;y)$.  At least, I think it&#039;s more natural to imply bindings from underlying density measures than vice versa...

(sorry if this doesn&#039;t come out nicely latex-ified---I&#039;m no wordpress expert) [ed. I added the latex escape for you; all you needed was to put latex after the $ and before the first bit of LaTeX.]]]></description>
		<content:encoded><![CDATA[<p>I sometimes wonder why we don&#8217;t write:</p>
<p><img src='http://s0.wp.com/latex.php?latex=p_%7BX%7CY%7D+%3D+%5Cfrac%7Bp_%7BY%7CX%7D+%5C+p_X%7D%7Bp_Y%7D&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p_{X|Y} = &#92;frac{p_{Y|X} &#92; p_X}{p_Y}' title='p_{X|Y} = &#92;frac{p_{Y|X} &#92; p_X}{p_Y}' class='latex' /></p>
<p>This seems less heinous than <img src='http://s0.wp.com/latex.php?latex=p%28x%7Cy%29&amp;bg=ffffff&amp;fg=333333&amp;s=0' alt='p(x|y)' title='p(x|y)' class='latex' />.  At least, I think it&#8217;s more natural to imply bindings from underlying density measures than vice versa&#8230;</p>
<p>(sorry if this doesn&#8217;t come out nicely latex-ified&#8212;I&#8217;m no wordpress expert) [ed. I added the latex escape for you; all you needed was to put latex after the $ and before the first bit of LaTeX.]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: yolio</title>
		<link>http://lingpipe-blog.com/2009/10/13/whats-wrong-with-probability-notation/#comment-5703</link>
		<dc:creator><![CDATA[yolio]]></dc:creator>
		<pubDate>Thu, 22 Oct 2009 19:19:59 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe-blog.com/?p=2724#comment-5703</guid>
		<description><![CDATA[I agree that probability notation is tricky and maybe moreso than necessary. But I totally disagree that THIS is the obstacle to understanding probability that most people face.]]></description>
		<content:encoded><![CDATA[<p>I agree that probability notation is tricky and maybe moreso than necessary. But I totally disagree that THIS is the obstacle to understanding probability that most people face.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

