<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>azeem.azhar &#187; Google</title>
	<atom:link href="http://azeemazhar.wordpress.com/tag/google/feed/" rel="self" type="application/rss+xml" />
	<link>http://azeemazhar.wordpress.com</link>
	<description>technology &#124; finance &#124; other stuff</description>
	<lastBuildDate>Tue, 10 Mar 2009 09:20:42 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='azeemazhar.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/ee94dceafa7bcd5b72456758b31b3d09?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>azeem.azhar &#187; Google</title>
		<link>http://azeemazhar.wordpress.com</link>
	</image>
			<item>
		<title>Search engines today do what they currently do 100%!</title>
		<link>http://azeemazhar.wordpress.com/2008/09/07/search-engines-today-do-what-they-currently-do-100/</link>
		<comments>http://azeemazhar.wordpress.com/2008/09/07/search-engines-today-do-what-they-currently-do-100/#comments</comments>
		<pubDate>Sun, 07 Sep 2008 23:41:37 +0000</pubDate>
		<dc:creator>azeemazhar</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Web search query]]></category>

		<guid isPermaLink="false">http://azeemazhar.wordpress.com/?p=100</guid>
		<description><![CDATA[Image via CrunchBase, source unknown Mike Arrington is rightly dumbfounded to hear that search is 90% done. Marissa Mayer said as a much to the LA Times.
From one perspective&#8211;where you look at search and the solution to it as being indexing document and making statistical connections from queries to documents&#8211;perhaps the game nearly is done. [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=azeemazhar.wordpress.com&blog=70033&post=100&subd=azeemazhar&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><span class="zemanta-img" style="float:right;display:block;margin:1em;"><a href="http://www.crunchbase.com/company/google"><img src="http://www.crunchbase.com/assets/images/resized/0000/1315/1315v2-max-250x250.jpg" alt="Image representing Google as depicted in Crunc..." style="border:medium none;display:block;"></a><span class="zemanta-img-attribution" style="display:block;margin:1em 0 0;">Image via <a href="http://www.crunchbase.com">CrunchBase</a>, source unknown </span></span>Mike Arrington is rightly <a href="http://www.techcrunch.com/2008/09/07/is-search-really-90-solved/">dumbfounded to hear that search is</a> 90% done. <a href="http://latimesblogs.latimes.com/technology/2008/09/marissa-mayer-t.html">Marissa Mayer</a> said as a much to the LA Times.</p>
<p>From one perspective&#8211;where you look at search and the solution to it as being indexing document and making statistical connections from queries to documents&#8211;perhaps the game nearly is done. In other words, <strong>if you measure the success of search engines against the yardstick of what they already do, then I guess they are there.</strong></p>
<p>In the same way that the grand ocean liners were the pinnacle, nay the 95% done of transatlantic travel, until, er, the jet age. Or that Bessie, my trusty shire horse, was the best way of getting from London to Windsor on a rainy day.</p>
<p>This is setting up for a classic Clayton Christensen disruption where the way in which the problem is framed limits the way in which resources are allocated to crack the problem (from an R&amp;D stand point) and then suffocates it (from a commercialisation standpoint).</p>
<p>Forgetting for a moment what other companies are trying, start from the stand point of the end user.</p>
<p>I have been at internet user for seventeen years (nearly half my life), I remember the days of Archie, Veronica, Gopher (but no Jughead). And having worked with two search companies, even today I will struggle with certain types of search queries:<br />
<span id="more-100"></span><br />
Remember a search query is not the thing that Google/Altavista/Webcrawler does&#8211;which is to try to statistically match a string of words in a query to similar words in a document. A search query is a request for information, it is me as a human (or me as a cyborg or sentient machine) looking to find something out. The seach query&#8211;the bit I type in to the search bar or shout at my assistant is merely my attempt to communicate the representation of the request I have in my head. </p>
<p>And frankly there is no way that search is 90% solved when it comes to that customer problem. Just today, things I was looking for but couldn&#8217;t find.</p>
<ul>
<li><strong>I need to find someone to come round to my house and fix my bike.</strong> There I said it. But I sure as hell haven&#8217;t been able to Google, Gumtree or Craigslist it.
</li>
<li><strong>I can&#8217;t get Xgrid Controller to start on Leopard</strong>. I can<a href="http://www.google.co.uk/search?q=xgrid+controller+timeouts&amp;ie=utf-8&amp;oe=utf-8&amp;aq=t&amp;rls=org.mozilla:en-US:official&amp;client=firefox-a"> google it<br />
</a> but I can&#8217;t get anything useful. I know someone has some wisdom about this but where?
</li>
<li><strong>I need to find some cool dresses for my <a href="http://flickr.com/photos/36259387778@N01/2836955493/">daughter&#8217;s</a> first birthday party</strong> (for her, not me), and frankly the unthrilling prospect of an hours iterative searching through unfamiliar fashion blogs and kids clothes stores does not seem like a solution.
</li>
<li><strong>I wanted to find out the nutritional value of the food I ate at a party today</strong> for my food log. Randomly catered food, strange portions? Nadda di nadda from Google</li>
</ul>
<p>So from the perspective of solving the problem that search engines currently solve today, search is pretty much solved, better than 90%. But who are we kidding? What they currently solve today is some weird, sub-human information retrieval problem.</p>
<p>Yes. It is better than<a href="http://www.biblio-tech.com/html/z39_50.html"> Z39.50</a> searching. <img alt="" src="http://www.oclc.org/americalatina/pt/support/documentation/z3950/searchtips/images/Option1_01.gif" title="Z39 50 a librarians dream" class="alignright" height="442" width="589"></p>
<p>But let&#8217;s not get confused the relief that it isn&#8217;t completely atrocious with the reality that we just aren&#8217;t done yet. <a href="http://en.wikipedia.org/wiki/Learned_helplessness">Learned helplessness</a> is a horrible thing, and ten minutes observing someone trying to solve some real problems will find a large class of them is just unresolvable on today&#8217;s Internet. </p>
<p>But they will be resolved.</p>
<p>Who is interesting in this space?<br />
<a href="http://www.trueknowledge.com/">True Knowledge </a>(usual disclosures apply) : totally unique approach to solving the fact-oriented research problem.  (Ping me on <a href="http://twitter.com/azeem">Twitter</a> if you want a beta, give me your email address)</p>
<li><a href="http://www.freebase.com">Freebase</a> is another good example but has yet to crack the hard bit of figuring out what a user&#8217;s intent is and how to translate that to a query
</li>
<li><a href="http://www.twitter.com/">Twitter</a> has potential, because you can fling your request into the information stream and sometimes get a human to come back with you. <a href="http://www.twine.com/">Twine</a> might fall into the same category. Although I question whether Twitter can scale enough to handle this and whether we&#8217;ll ever understand Twine.
<p>The raft of semantic layer companies out there will no doubt help. Vertical search can also help, although someone has to help me find the right vertical search engine.  I doubt whether the visualisation guys are close to changing the paradigm although they might be a nice bit of polish.</p>
<p>In the meantime, I have four queries up top if anyone wants to help me with them&#8230;</li>
<div style="margin-top:10px;height:15px;" class="zemanta-pixie"><a class="zemanta-pixie-a" href="http://reblog.zemanta.com/zemified/fc517858-9e0c-45e6-aac1-50cf859734d9/" title="Zemified by Zemanta"><img style="border:medium none;float:right;" class="zemanta-pixie-img" src="http://img.zemanta.com/reblog_e.png?x-id=fc517858-9e0c-45e6-aac1-50cf859734d9" alt="Reblog this post [with Zemanta]"></a></div>
<img alt="" border="0" src="http://feeds.wordpress.com/1.0/categories/azeemazhar.wordpress.com/100/" /> <img alt="" border="0" src="http://feeds.wordpress.com/1.0/tags/azeemazhar.wordpress.com/100/" /> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/azeemazhar.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/azeemazhar.wordpress.com/100/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/azeemazhar.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/azeemazhar.wordpress.com/100/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/azeemazhar.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/azeemazhar.wordpress.com/100/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/azeemazhar.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/azeemazhar.wordpress.com/100/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/azeemazhar.wordpress.com/100/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/azeemazhar.wordpress.com/100/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=azeemazhar.wordpress.com&blog=70033&post=100&subd=azeemazhar&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://azeemazhar.wordpress.com/2008/09/07/search-engines-today-do-what-they-currently-do-100/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/3170a6fe67050cef735621d7aeb523f5?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">azeemazhar</media:title>
		</media:content>

		<media:content url="http://www.crunchbase.com/assets/images/resized/0000/1315/1315v2-max-250x250.jpg" medium="image">
			<media:title type="html">Image representing Google as depicted in Crunc...</media:title>
		</media:content>

		<media:content url="http://www.oclc.org/americalatina/pt/support/documentation/z3950/searchtips/images/Option1_01.gif" medium="image">
			<media:title type="html">Z39 50 a librarians dream</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=fc517858-9e0c-45e6-aac1-50cf859734d9" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
	</channel>
</rss>