<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: Computational Linguistics Curriculum</title>
	<atom:link href="http://lingpipe-blog.com/2008/10/13/computational-linguistics-curriculum/feed/" rel="self" type="application/rss+xml" />
	<link>http://lingpipe-blog.com/2008/10/13/computational-linguistics-curriculum/</link>
	<description>Natural Language Processing and Text Analytics</description>
	<lastBuildDate>Sat, 04 Feb 2012 20:56:48 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Bob Carpenter</title>
		<link>http://lingpipe-blog.com/2008/10/13/computational-linguistics-curriculum/#comment-2968</link>
		<dc:creator><![CDATA[Bob Carpenter]]></dc:creator>
		<pubDate>Tue, 28 Oct 2008 21:57:23 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=225#comment-2968</guid>
		<description><![CDATA[Teo:

This is a great area for projects because there&#039;s so much free data.  Or you could create some of your own data.  Projects can be more or less linguistically oriented.  

At the simplest level, there&#039;s just throwing something at a new corpus.  Like say our latent Dirichlet clustering in some language or domain other than English news.  Or even simpler, topic extraction over time.

Or you could create your own named entity corpus using our annotation tool and use it to build a named entity extractor.

For that matter, you could follow my instructions and build a Chinese search engine.   Or Thai or Turkish for that matter.

Take a look through the ACL proceedings -- there are lots of student papers and poster write-ups in there.  I&#039;d aim at getting something you could submit to the ACL student session as a concrete goal.  Assuming you&#039;re going to write it in English, that is.

At the most complex level, you could try to take on a whole new problem or develop a new model for an existing problem.  That&#039;s much harder, though.]]></description>
		<content:encoded><![CDATA[<p>Teo:</p>
<p>This is a great area for projects because there&#8217;s so much free data.  Or you could create some of your own data.  Projects can be more or less linguistically oriented.  </p>
<p>At the simplest level, there&#8217;s just throwing something at a new corpus.  Like say our latent Dirichlet clustering in some language or domain other than English news.  Or even simpler, topic extraction over time.</p>
<p>Or you could create your own named entity corpus using our annotation tool and use it to build a named entity extractor.</p>
<p>For that matter, you could follow my instructions and build a Chinese search engine.   Or Thai or Turkish for that matter.</p>
<p>Take a look through the ACL proceedings &#8212; there are lots of student papers and poster write-ups in there.  I&#8217;d aim at getting something you could submit to the ACL student session as a concrete goal.  Assuming you&#8217;re going to write it in English, that is.</p>
<p>At the most complex level, you could try to take on a whole new problem or develop a new model for an existing problem.  That&#8217;s much harder, though.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Teo D'Smyrni</title>
		<link>http://lingpipe-blog.com/2008/10/13/computational-linguistics-curriculum/#comment-2966</link>
		<dc:creator><![CDATA[Teo D'Smyrni]]></dc:creator>
		<pubDate>Tue, 28 Oct 2008 19:04:44 +0000</pubDate>
		<guid isPermaLink="false">http://lingpipe.wordpress.com/?p=225#comment-2966</guid>
		<description><![CDATA[Thanks, got a lot useful info hear. Can u probably advise me a topic for a senior project in this area. thanks in advance.]]></description>
		<content:encoded><![CDATA[<p>Thanks, got a lot useful info hear. Can u probably advise me a topic for a senior project in this area. thanks in advance.</p>
]]></content:encoded>
	</item>
</channel>
</rss>

