<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: The Powerset Blogstorm: 1 week later</title>
	<atom:link href="http://www.barneypell.com/2006/10/the-powerset-blogstorm-1-week-later/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.barneypell.com/2006/10/the-powerset-blogstorm-1-week-later/</link>
	<description></description>
	<lastBuildDate>Fri, 07 May 2010 13:32:55 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
	<item>
		<title>By: jshrager</title>
		<link>http://www.barneypell.com/2006/10/the-powerset-blogstorm-1-week-later/comment-page-1/#comment-46</link>
		<dc:creator>jshrager</dc:creator>
		<pubDate>Fri, 13 Oct 2006 00:29:15 +0000</pubDate>
		<guid isPermaLink="false">http://174.120.172.92/~barneype/?p=77#comment-46</guid>
		<description>I think that some important points have been missed by all parties in this discussion. First, keywords (more general keyword logics) are the way we have now of doing &quot;trick semantics&quot;. In order to understand this, one has to understand that search is a two-sided coin; there is the query, and there is the database. Let&#039;s take it for the moment that the database is The Web. The oft-overlooked point is that NEITHER a query NOR the database are expressed in pure semantics. Put in more plain terms: You don&#039;t write what you think, nor does the person who write the web page you&#039;re looking for; rather, you both use a surface code (NL) that bears a complex relationship to what what one &quot;really means&quot;. Keywords are a way that we have learned to see through this problem, and expert searches know how to do it. Suppose, for example, that you&#039;ve interested in when marsupials first diverged from mammals. I have several options: I could write: &quot;When did marsupials diverge from mammals?&quot; If these were just interpreted as keywords, then if there happens to be a paper that contained the same words, I might win. But let&#039;s assume that that&#039;s not the case. Instead, there might be a paper that discusses (perhaps over a paragraph): &quot;The evolution of kangaroos, and when they split off from the bears, their nearest mammalian counterparts.&quot; (I&#039;m making this up, I doubt that kangaroos and bears are closely related!) Here&#039;s where the two headed coin comes in: You need to not only understand my query in some semantics terms, but you also need to understand the databse (that is, the sentences in the web pages) in similar semantic terms. Now, expertise in keyword search usage is, I claim, NOT merely a matter of knowledge of natural language, but a specalized skill that involves NL, but also involves understanding how to flip that double-headed coin -- that is, how to take a query, in whatever terms, and create a new query that is likely to be able to find the appropriate things on the other side of the coin. This is not, I claim, the same as, nor even closely related to natural langauge processing, and indeed, I hypothesize that skill in NL is not highly correlated with it. (Everyone in my lab can speak several languages, but everyone in my lab comes to me because I can find things they can&#039;t on google!) My hypothesis is that the step from NL queries to semantics is actually NOT the critical step in this process. Rather it is the step from Query-semantics (now approximated by keywords) to the other side of the coin: DB-semantics (now approximated by index terms), and the figuring out how to do THIS is going to be the critical step. I have a theory of how to do this... which I&#039;ll tell you if you hire me! ;-)
Cheers,
&#039;Jeff
</description>
		<content:encoded><![CDATA[<p>I think that some important points have been missed by all parties in this discussion. First, keywords (more general keyword logics) are the way we have now of doing &#8220;trick semantics&#8221;. In order to understand this, one has to understand that search is a two-sided coin; there is the query, and there is the database. Let&#8217;s take it for the moment that the database is The Web. The oft-overlooked point is that NEITHER a query NOR the database are expressed in pure semantics. Put in more plain terms: You don&#8217;t write what you think, nor does the person who write the web page you&#8217;re looking for; rather, you both use a surface code (NL) that bears a complex relationship to what what one &#8220;really means&#8221;. Keywords are a way that we have learned to see through this problem, and expert searches know how to do it. Suppose, for example, that you&#8217;ve interested in when marsupials first diverged from mammals. I have several options: I could write: &#8220;When did marsupials diverge from mammals?&#8221; If these were just interpreted as keywords, then if there happens to be a paper that contained the same words, I might win. But let&#8217;s assume that that&#8217;s not the case. Instead, there might be a paper that discusses (perhaps over a paragraph): &#8220;The evolution of kangaroos, and when they split off from the bears, their nearest mammalian counterparts.&#8221; (I&#8217;m making this up, I doubt that kangaroos and bears are closely related!) Here&#8217;s where the two headed coin comes in: You need to not only understand my query in some semantics terms, but you also need to understand the databse (that is, the sentences in the web pages) in similar semantic terms. Now, expertise in keyword search usage is, I claim, NOT merely a matter of knowledge of natural language, but a specalized skill that involves NL, but also involves understanding how to flip that double-headed coin &#8212; that is, how to take a query, in whatever terms, and create a new query that is likely to be able to find the appropriate things on the other side of the coin. This is not, I claim, the same as, nor even closely related to natural langauge processing, and indeed, I hypothesize that skill in NL is not highly correlated with it. (Everyone in my lab can speak several languages, but everyone in my lab comes to me because I can find things they can&#8217;t on google!) My hypothesis is that the step from NL queries to semantics is actually NOT the critical step in this process. Rather it is the step from Query-semantics (now approximated by keywords) to the other side of the coin: DB-semantics (now approximated by index terms), and the figuring out how to do THIS is going to be the critical step. I have a theory of how to do this&#8230; which I&#8217;ll tell you if you hire me! <img src='http://www.barneypell.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /><br />
Cheers,<br />
&#8216;Jeff</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: JoeDuck</title>
		<link>http://www.barneypell.com/2006/10/the-powerset-blogstorm-1-week-later/comment-page-1/#comment-45</link>
		<dc:creator>JoeDuck</dc:creator>
		<pubDate>Thu, 12 Oct 2006 15:04:30 +0000</pubDate>
		<guid isPermaLink="false">http://174.120.172.92/~barneype/?p=77#comment-45</guid>
		<description>I&#039;m bullish about your prospects Bernie but don&#039;t think that 20% poll means much.  More people than that simply *want* Google to falter.  But best of luck.
Also: We need MUCH better search paradigms, so hurry up dude!
</description>
		<content:encoded><![CDATA[<p>I&#8217;m bullish about your prospects Bernie but don&#8217;t think that 20% poll means much.  More people than that simply *want* Google to falter.  But best of luck.<br />
Also: We need MUCH better search paradigms, so hurry up dude!</p>
]]></content:encoded>
	</item>
</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Served from: www.barneypell.com @ 2012-02-07 06:03:48 -->
