<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Sanjay Kairam &#187; social search</title>
	<atom:link href="http://www.sanjaykairam.com/blog/tag/social-search/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.sanjaykairam.com/blog</link>
	<description>Graduate Student &#38; Armchair Philosopher</description>
	<lastBuildDate>Thu, 19 Jan 2012 23:09:09 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Do You Know Jules Verne? What&#8217;s He Like?</title>
		<link>http://www.sanjaykairam.com/blog/2011/02/do-you-know-jules-verne-whats-he-like/</link>
		<comments>http://www.sanjaykairam.com/blog/2011/02/do-you-know-jules-verne-whats-he-like/#comments</comments>
		<pubDate>Thu, 03 Feb 2011 01:10:38 +0000</pubDate>
		<dc:creator>skairam</dc:creator>
				<category><![CDATA[/Matter]]></category>
		<category><![CDATA[aardvark]]></category>
		<category><![CDATA[credibility]]></category>
		<category><![CDATA[David Pogue]]></category>
		<category><![CDATA[MG Siegler]]></category>
		<category><![CDATA[online communities]]></category>
		<category><![CDATA[Pogue's Posts]]></category>
		<category><![CDATA[Q&A Sites]]></category>
		<category><![CDATA[Quora]]></category>
		<category><![CDATA[social]]></category>
		<category><![CDATA[social computing]]></category>
		<category><![CDATA[social internet]]></category>
		<category><![CDATA[social search]]></category>
		<category><![CDATA[TechCrunch]]></category>
		<category><![CDATA[web 2.0]]></category>
		<category><![CDATA[Web Startups]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.sanjaykairam.com/blog/?p=280</guid>
		<description><![CDATA[The buzz around Q&#038;A startup Quora has been building steadily over the past couple of months. I measure this not only by the number of Follow messages received concerning people randomly sampled from my Facebook connections which are now flooding my inbox, but also by the heated debate that is developing about the site's usefulness, much of which is chronicled in this TechCrunch article about the "Quora Backlash Backlash". ]]></description>
			<content:encoded><![CDATA[<p>The buzz around Q&amp;A startup <a title="Quora - Home Page" href="http://quota.com" target="_blank">Quora</a> has been building steadily over the past couple of months. I measure this not only by the number of Follow messages now flooding my inbox concerning people randomly sampled from my Facebook connections, but also by the heated debate that is developing about the site&#8217;s usefulness, much of which was chronicled in this TechCrunch article about the &#8220;<a title="TechCrunch - Quora Backlash Backlash" href="http://techcrunch.com/2011/01/31/quora-quora-quora-quora-quora-quora-quora/" target="_blank">Quora Backlash Backlash</a>&#8220;.</p>
<p>Based on this post and other comments, it is fairly clear that MG Siegler is on &#8220;Team Quora&#8221;, calling Quora &#8220;a great source of information like Twitter and Facebook and blogs themselves.&#8221; Having been a Quora user for several months, I have found the quality of answers on the site to be extremely high. These have ranged from the much-celebrated cases of high-profile individuals answering questions about topics pertaining to them (e.g. Netflix CEO Reed Hastings answering the question &#8220;<a title="Quora - How Much Does Netflix Spend on Postage Each Year?" href="http://www.quora.com/Netflix/How-much-does-Netflix-spend-on-postage-each-year?" target="_blank">How much does Netflix spend on postage every year?</a>&#8220;) to the opportunities for creative individuals to answer questions in awesome and innovative ways (see Wavii programmer <a title="Erik Frey" href="http://fawx.com/" target="_blank">Erik Frey</a>&#8216;s answer to the question &#8220;<a title="Quora - Which animal has been used most frequently for a band name?" href="http://www.quora.com/Which-animal-has-been-used-most-frequently-for-a-band-name" target="_blank">Which animal has been used most frequently for a band name?</a>&#8220;). I have also personally asked a number of questions and gotten timely and high-quality answers.</p>
<p>However, the key question here is whether the site will continue to be as useful as more and more people join. While many might argue that including more subject matter experts can only improve the site, one must also remember that this increased signal is only useful when it can be separated from the increased noise. Right now, Quora is a bit like &lt;nerd alert&gt;Flynn&#8217;s cave dwelling in the Outlands, where the information contained is only safe as long as the masses can&#8217;t get to it&lt;/nerd alert&gt;; users can trust the answers they find because they often come paired with often famous or at least recognizable names and faces. Yahoo! Answers is a great example of how a Q&amp;A site can decrease in quality with respect to both the questions and answers as it opens up (for a quick, possibly NSFWish laugh, check out &#8220;<a title="11 Points - 11 Stupid..." href="http://www.11points.com/Web-Tech/11_Stupid_Questions_From_Yahoo_Answers_That_Have_Changed_My_Life" target="_blank">11 Stupid Questions from Yahoo Answers That Have Changed My Life</a>&#8220;) Even if you develop the most robust social answer-quality-checker imaginable, the presence of thousands of stupid questions tagged with topic tags that direct them to your inbox is going to turn a lot of the quality answer providers away from the site. In some ways, I felt that Aardvark fell into that trap as it grew more popular, and I now find myself answering a lot of questions that involved identifying rashes.</p>
<p>I had this question about the possible perils of mainstream adoption in mind, when reading a <a title="NY Times: Pogue's Posts - Quora Raises Questions" href="http://pogue.blogs.nytimes.com/2011/02/01/quora-raises-questions/" target="_blank">NY Times blog post</a> this week, in which David Pogue describes his first interactions with the site as &#8220;a descent into bafflement.&#8221; Among the parts of the site that he deems confusing are the login process, the task of adding connections and following topics, and the actual task of asking a question. For chronicling his confusion, however, he earned the following response from Siegler (<a title="ParisLemon - Is this Quora or the Laundromat?" href="http://parislemon.com/post/3065294483/is-this-quora-or-the-laundromat" target="_blank">on his personal blog</a>):</p>
<blockquote><p>Is this meant to be written from the perspective of a 95-year-old senile man?</p>
<p>Apparently, every site should be designed in a way so that it’s just  like every other site that failed before it on the Internet. Makes  perfect sense.</p>
<p>Prediction: he’ll love Quora in 12 months.</p></blockquote>
<p>Now, I would consider David Pogue to be a relatively tech-savvy individual. He&#8217;s been blogging about Web/technology topics for over 10 years now, and I generally find his posts to be pretty interesting and insightful. If he is having this much difficulty using the site, I think that his frustrations really do say something about Quora&#8217;s current potential to reach beyond the geek crowd into the general public . And on some level, I think that the desire to mock him for this reflects an underlying recognition that the Silicon Valley influentials who currently use the site wouldn&#8217;t actually benefit from &#8220;regular people&#8221; using it, since this could potentially spell the end of Quora&#8217;s usefulness. Perhaps a better use of time might be to consider how to make the site more user-friendly and how to maintain the quality as new users arrive.</p>
<p>Studying sites like <a title="Wikipedia - Home" href="http://en.wikipedia.org" target="_blank">Wikipedia</a> is a great way to examine how to maintain quality while scaling out to a broader audience. While it&#8217;s difficult to ascertain what actually makes Wikipedia work (<a title="Edge 2008: Kevin Kelly" href="http://www.edge.org/q2008/q08_6.html#kelly" target="_blank">in practice, if not in theory</a>), many of the qualities inherent in Wikipedia are those identified in a 2005 paper [1] by J.M. Leimeister and colleagues at the <a title="TUM Home" href="http://portal.mytum.de/welcome/" target="_blank">Technische Universität München</a> as factors which promote community success in an environment where trusted information is key. Some of these include exposing the identity of content providers, clearly establishing goals for the community, making member profiles available to other members, and providing various levels of anonymity, all of which are things which are built into Wikipedia&#8217;s core. Another feature recognized across the literature is recognition and rewards for contributors, something embodied in Wikipedia in the form of &#8220;<a title="Wikipedia - Barnstars" href="http://en.wikipedia.org/wiki/Wikipedia:Barnstars" target="_blank">barnstars</a>&#8220;. Quora does a great job of exposing identity (which will likely make it much better than Yahoo! Answers), but I don&#8217;t believe that it adequately addresses these other elements. I think that the addition of &#8220;moderators&#8221; or other custodian roles for hyper-motivated users could be the kind of thing that keeps the Quora community in check, and I would be eager to see them roll something like that out before opening the site up to the general public (I think it&#8217;s technically still in beta, no?)</p>
<p>Quora is a great source of information for me in some of the same ways that Facebook and Twitter are. The key difference is that in those media, I can control what I see and who it comes from. If you want to imagine the utility of those sites without such controls, just imagine trying to sift through the real-time Twitter stream &#8211; it&#8217;s mind-numbing, to be kind (unless of course you are a Justin Bieber fan, in which case you should be delighted). The goals of those who currently enjoy Quora&#8217;s usefulness should be to help maintain that usefulness as the site grows. I&#8217;m not saying that this will be an easy task, but if it isn&#8217;t accomplished, I don&#8217;t think that anyone is going to love the site in 12 months.</p>
<p>[1] <a title="ACM Digital Library" href="http://portal.acm.org/citation.cfm?id=1277723" target="_blank">Leimeister, J.M., Ebner, W., &amp; Krcmar, H. 2005: Design, implementation, and evaluation of trust-supporting components in virtual communities for patients. <em>Journal of Management Information Systems 21</em>, 4, 101-135.</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.sanjaykairam.com/blog/2011/02/do-you-know-jules-verne-whats-he-like/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Rise of GoogVark</title>
		<link>http://www.sanjaykairam.com/blog/2010/02/the-rise-of-googvark/</link>
		<comments>http://www.sanjaykairam.com/blog/2010/02/the-rise-of-googvark/#comments</comments>
		<pubDate>Thu, 11 Feb 2010 23:33:21 +0000</pubDate>
		<dc:creator>skairam</dc:creator>
				<category><![CDATA[/Matter]]></category>
		<category><![CDATA[aardvark]]></category>
		<category><![CDATA[business]]></category>
		<category><![CDATA[buzz]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[social search]]></category>

		<guid isPermaLink="false">http://www.sanjaykairam.com/blog/?p=143</guid>
		<description><![CDATA[So, in a seemingly inevitable, but nonetheless surprising move, Google has purchased Aardvark for $50 million. My last blog post was about Aardvark's recent paper describing their social search engine, which included allusions to the research paper which was responsible for the creation of Google, so the announcement seems timely.]]></description>
			<content:encoded><![CDATA[<p>So, in a seemingly inevitable, but nonetheless surprising move, <a title="TechCrunch" href="http://techcrunch.com/2010/02/11/google-acquires-aardvark-for-50-million/" target="_blank">Google has purchased Aardvark for $50 million</a>. My <a title="Sanjay Kairam - Commons Sense" href="http://www.sanjaykairam.com/blog/2010/02/anatomy-of-a-paper-about-a-large-scale-social-search-engine/">last blog post</a> was about Aardvark&#8217;s recent <a title="Aardvark Blog" href="http://blog.vark.com/?p=352" target="_blank">paper</a> describing their social search engine, which included allusions to the research paper which was <a title="Stanford InfoLab - PageRank" href="http://infolab.stanford.edu/~backrub/google.html">responsible for the creation of Google</a>, so the announcement seems timely.</p>
<p>Given Google&#8217;s recent social efforts (Twitter Search, Social Search, Google Buzz, etc.), I am curious to see what they will do with the Aardvark product &#8211; will it stand alone as it has or will it find its way into existing or new Google tools? I, for one, would love to see it integrated into Google&#8217;s main search. One consequence of <a title="Google Blog - Introducing Google Buzz" href="http://googleblog.blogspot.com/2010/02/introducing-google-buzz.html" target="_blank">Google&#8217;s recent launch of Buzz</a> is reminding people that Google has been collecting data on your social network for a while now. If Aardvark were integrated into your Google network, we&#8217;d have a out-of-the-box solution for social search (no messy profile-connecting or friend-inviting needed! To me, it seems like one of the biggest hurdles for most people in terms of social search or networking tools is the cost of building up their networks, so this would provide a quick and easy way around that.</p>
<p>What will GoogVark look like? I can&#8217;t say I&#8217;ve ever used the &#8220;I&#8217;m Feeling Lucky&#8221; button, but I do know that there are times when I can&#8217;t quite find the best answers through Google search, and I&#8217;d love to be able to seamlessly shift over to social search. I personally would love to see something like this (with an example supplied by Google itself!):</p>
<a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/02/googvark.jpg"><img class="size-full wp-image-144" title="GoogVark" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/02/googvark.jpg" alt="GoogVark" width="449" height="247" /></a>
<p>Would this make social search more inviting to you?</p>
<p><em>P.S. Congratulations to Aardvark&#8217;s founders over at The Mechanical Zoo &#8211; you guys deserve it!</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.sanjaykairam.com/blog/2010/02/the-rise-of-googvark/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Anatomy of a Paper about a Large-Scale Social Search Engine</title>
		<link>http://www.sanjaykairam.com/blog/2010/02/anatomy-of-a-paper-about-a-large-scale-social-search-engine/</link>
		<comments>http://www.sanjaykairam.com/blog/2010/02/anatomy-of-a-paper-about-a-large-scale-social-search-engine/#comments</comments>
		<pubDate>Fri, 05 Feb 2010 21:43:22 +0000</pubDate>
		<dc:creator>skairam</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[aardvark]]></category>
		<category><![CDATA[google]]></category>
		<category><![CDATA[PageRank]]></category>
		<category><![CDATA[papers]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[social]]></category>
		<category><![CDATA[social search]]></category>
		<category><![CDATA[the mechanical zoo]]></category>
		<category><![CDATA[WWW]]></category>

		<guid isPermaLink="false">http://www.sanjaykairam.com/blog/?p=135</guid>
		<description><![CDATA[Earlier this week, the team at Aardvark unveiled a new paper "The Anatomy of a Large-Scale Social Search Engine" which will be presented in April at WWW 2010. Inspired by and patterned after "The Anatomy of a Large-Scale Hypertextual Web Search Engine", which describes the PageRank algorithm which drives Google's search ranking system (which as Aardvark's blog points out, was also presented at WWW 12 years ago). The paper, by Aardvark's Damon Horowitz and Stanford's Sep Kamvar, focuses mostly on the architecture of the Aardvark system, from the external representations with which users interact to the internal ranking algorithms on which the system runs. Below, I present a short summary of what they report, focusing on the elements I found most interesting.]]></description>
			<content:encoded><![CDATA[<p>Earlier this week, the team at Aardvark unveiled a new paper &#8220;<a title="Aardvark Blog - Anatomy of a Large-Scale Social Search Engine" href="http://blog.vark.com/?p=352" target="_blank">The Anatomy of a Large-Scale Social Search Engine</a>&#8221; which will be presented in April at <a title="WWW2010 - Home" href="http://www2010.org/www/" target="_blank">WWW 2010</a>. Inspired by and patterned after &#8220;<a title="Stanford InfoLab - Google" href="http://infolab.stanford.edu/~backrub/google.html">The Anatomy of a Large-Scale Hypertextual Web Search Engine</a>&#8220;, which describes the <a title="Wikipedia - PageRank" href="http://en.wikipedia.org/wiki/PageRank" target="_blank">PageRank</a> algorithm which drives Google&#8217;s search ranking system (which as Aardvark&#8217;s blog points out, was also presented at WWW 12 years ago).</p>
<p>The paper, by Aardvark&#8217;s Damon Horowitz and Stanford&#8217;s Sep Kamvar, focuses mostly on the architecture of the Aardvark system, from the external representations with which users interact to the internal ranking algorithms on which the system runs. Below, I present a short summary of what they report, focusing on the elements I found most interesting:</p>
<p><strong>The Basic Model</strong>: Aardvark&#8217;s scoring function is similar to PageRank in that both utilize two primary, but somewhat independently considered components: <em>relevance</em> and <em>quality</em>.</p>
<ul>
<li><em>Relevance</em> in the Aardvark model pertains to the probability that a particular user <em>i</em> can answer the given question <em>q</em> based on the identified topics contained in <em>t</em>.</li>
<li><em>Quality</em> in the Aardvark model pertains to the overall probability that a user <em>i</em> can return a satisfactory answer to another user <em>j</em>, regardless of the question.</li>
</ul>
<p><strong>Indexing Topics:</strong> Aardvark computes the relevance score by calculating a distribution of knowledge over topics known by the user using the following sources (keyword-y sounding italicized terms are for convenience only and are not used in the paper):</p>
<ul>
<li><em>Explicit Prompting</em> at sign-up for three &#8220;starter&#8221; topics about which the user has expertise.</li>
<li><em>Social Prompting</em> of a user&#8217;s friends to provide topics about which they trust the user&#8217;s opinion.</li>
<li><em>Structured Parsing</em> of the online profile pages connected to Aardvark by the user (e.g. &#8220;Interests&#8221; on a Facebook profile).</li>
<li><em>Unstructured Parsing</em> of the users&#8217; online homepage, blog, or status updates using a linear SVM to extract overall subject area and a named entity extractor to extract more specific topics.</li>
</ul>
<p><strong>Indexing Connections:</strong> Aardvark computes the quality score by building a set of weighted connections between users using characteristics ranging from social proximity to similarities in demographics or behavior, such as:</p>
<ul>
<li><em>Social Connections</em> either in the form of explicitly defined &#8220;friend&#8221; connections or implicit &#8220;network&#8221; connections, such as both being part of the Stanford network.</li>
<li><em>Demographic Similarity</em>, which likely includes age, gender, and location based on profile information collected by Aardvark.</li>
<li><em>Profile Similarity</em>, which seems to include similar movies and other items which might be listed on other profiles, such as Facebook.</li>
<li><em>Vocabulary Match</em>, which they explain with the example of &#8220;IM Shortcuts&#8221; (i.e. I assume this means it is based on the language you use to interact with Aardvark, but I am unsure.)</li>
<li><em>Chattiness and Verbosity Match</em>, which relate to frequency and length of messages used when interacting with Aardvark.</li>
<li><em>Politeness Match</em>, which basically seems to mean whether or not say &#8220;Thanks!&#8221; or not.</li>
<li><em>Speed Match</em>, which is a measure of responsiveness to other users.</li>
</ul>
<p><strong>Analyzing Questions:</strong> While all of the other components are pre-computed, this part is computed at question time (obviously). The utilize a number of classifiers to classify the question and then a set of mappers to map the question to a set of topics, noting that &#8220;the role of the Question Analyzer&#8230;is simply to learn enough about the qeustion that it may be sent to appropriately interested and knowledgeable human answerers&#8221;. Here are the classifiers they list (with the names used in the paper):</p>
<ul>
<li><em>NonQuestionClassifier:</em> Determines if input is a valid question.</li>
<li><em>InappropriateQuestionClassifier:</em> Determines if input is obscene, spam, or otherwise unsuitable for asking.</li>
<li><em>TrivialQuestionClassifier:</em> Determines if input is a simple factual question (examples given: &#8220;What time is it now?&#8221;, &#8220;What is the weather?&#8221;). If so, the user gets an automatically generated answer via traditional web search.</li>
<li><em>LocationSensitiveClassifier:</em> Determines if the question contains location information; if it does, it passes that information along to the Routing Engine</li>
</ul>
<ul>
<li><em>KeywordMatchTopicMapper:</em> Checks for string matches against user profile topics (the mapper attempts to classify meaningful vs. spurious matches).</li>
<li><em>TaxonomyTopicMapper:</em> Classifies question text using an SVM trained on an &#8220;annotated corpus of several million questions&#8221; (<strong>where did they find that?</strong>)</li>
<li><em>SalientTermTopicMapper:</em> Extracts salient phrases using a noun-phrase chunker and tf-idf and finds &#8220;semantically similar user topics&#8221;.</li>
<li><em>UserTagTopicMapper:</em>Utilizes tags explicitly provided by the asker or other answerers and maps them to user topics.</li>
</ul>
<p>This description of the routing algorithm comprises the main function of the paper. After some more description of how users interact with the system, the authors provide some interesting data collected over the past several months of use (from the beta launch in March 2009 until October 2009).  Here&#8217;s a quick run-down of the more interesting facts that they presented:</p>
<ul>
<li><em>Strong User Growth: </em>As of October 2009, they reported 90,361 user accounts, and users appear to be remaining active (in the study period, over 1/2 the users actively generated content and over 2/3 of the users passively participated).</li>
</ul>
<div id="attachment_139" class="wp-caption aligncenter" style="width: 402px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/02/aardvarkusers.png"><img class="size-full wp-image-139" title="Aardvark User Growth" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/02/aardvarkusers.png" alt="Aardvark User Growth" width="392" height="331" /></a><p class="wp-caption-text">User Growth on Aardvark (graph taken from the paper).</p></div>
<ul>
<li><em>Higher Query Contextualization:</em> Aardvark queries average 18.6 words in length while the average query length reported for web search is between 2.2 and 2.9 words (citing previous comparison and characterization studies).  They further state that &#8220;98.1% of questions are unique&#8221;, though I am unsure as to how exact they are being about matching (I am sure the question &#8220;What&#8217;s a great restaurant in SF&#8221; has been asked 1000 times in different forms). In addition, they report from manual scoring of 1000 randomly selected questions that 64.7% of questions asked have a subjective element, with advice about travel, restaurants, and products being specifically popular.</li>
<li><em>Fast, High-Quality Answers:</em> They report that 87.7% of questions get answers and 57.2% received an answer within 10 minutes. They report that 70.4% of answers receiving feedback are rated as &#8220;good&#8221; and only 15.5% are rated as &#8220;bad&#8221;. Interestingly, they observe a notable difference in feedback on answers from users within the asker&#8217;s social network (76% rated as food) and outside the asker&#8217;s network (68% rated as good).</li>
</ul>
<div id="attachment_138" class="wp-caption aligncenter" style="width: 503px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/02/aardvarkquestions.png"><img class="size-full wp-image-138" title="Aardvark Questions" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/02/aardvarkquestions.png" alt="Aardvark Questions" width="493" height="229" /></a><p class="wp-caption-text">Questions on Aardvark (chart taken from the paper).</p></div>
<p>Overall, I really enjoyed reading this paper. After using Aardvark for over a year now, it was really interesting to get to peer inside and see how the system works, and a lot of great details were provided about the ranking engine.</p>
<p>One place where I feel that the authors missed the mark was in the cursory side-by-side evaluation which pitted Aardvark against Google for a set of 200 questions randomly selected from the Aardvark system. They report that 71.5% of the questions studied were answered successfully on Aardvark, while 70.5% of the questions were answered successfully on Google. This comparison seems mostly useless as the questions, having been pulled from the Aardvark system in the first place, are ones that were specifically chosen because they are better adapted to what is being called &#8216;social search&#8217;. This comparison left me desirous of more investigation into two main questions.<em> </em></p>
<p><em>&#8220;What makes a search engine &#8216;social&#8217; in the first place?&#8221;</em></p>
<p>The distinction between social and non-social is extremely murky, something Brynn and I discovered when working on our <a title="Sanjay Kairam - Cognitive Consequences of Social Search (PDF)" href="http://sanjaykairam.com/papers/evans-kairam-pirolli-inSubmission.pdf" target="_blank">Social Search paper</a>. It has been argued before (one small example <a title="Brynn Evans' Blog - Comment by Manas Tungare" href="http://brynnevans.com/blog/2009/01/30/why-social-search-wont-topple-google-anytime-soon/#comment-1933">here</a>) that Google&#8217;s PageRank algorithm is inherently social, as it aggregates information provided by people (links to one another) to rank results. However, it is clear that something seems categorically different between Google and what people perceive to be &#8216;social search&#8217;. When it comes down to it, even though everyone is excited about <a title="Google Blog - Search is getting more social" href="http://googleblog.blogspot.com/2010/01/search-is-getting-more-social.html" target="_blank">Google&#8217;s forays into &#8220;Social Search&#8221;</a>, there&#8217;s nothing all that fundamentally different about Google indexing your blog and your tweets than any other documents extant on the web.</p>
<p>To me, it seems that the key difference is really the change in the <strong>direction of interaction</strong>. While Google takes a query (question) and compares it against traces of discussion about that question from the past (web documents), systems perceived as &#8216;social&#8217; take a question and attempt to generate new answers in the future. This change in direction is what allows for the higher context that makes &#8216;social&#8217; search answers so much more rich (at least for some questions.)  Perhaps we need a different word to define this phenomenon &#8211; &#8216;real-time search&#8217; seems to get at it more, but has its own problems.  Perhaps something like &#8216;generative search&#8217;? I really don&#8217;t know.</p>
<p><em>&#8220;Why do we need a social search engine at all?&#8221;</em></p>
<p>This one seems like the best fodder for a follow-up study by Aardvark. While they do provide a rough breakdown of the types of questions asked on Aardvark (see pie chart above), I think that a comparison might have been much more interesting if they had looked at a variety of classes of user needs and had compared the relative efficacy of searching on Aardvark and a traditional search engine such as Google. It is clear that &#8216;social&#8217; will work much better for some needs and much worse for others, but up to this point, people who talk about social search always seem to use the same types of examples (travel, restaurants, and products, for instance). It would be great to get a clear idea over a wide range of needs and use cases where systems such as Aardvark can provide benefits over existing tools.</p>
<p>Anyways, for those of you interested in &#8216;social search&#8217; and search systems, I encourage you to read this paper and tell me your thoughts!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sanjaykairam.com/blog/2010/02/anatomy-of-a-paper-about-a-large-scale-social-search-engine/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Social Search Researchers on Twitter</title>
		<link>http://www.sanjaykairam.com/blog/2009/02/social-search-researchers-on-twitter/</link>
		<comments>http://www.sanjaykairam.com/blog/2009/02/social-search-researchers-on-twitter/#comments</comments>
		<pubDate>Fri, 13 Feb 2009 20:06:27 +0000</pubDate>
		<dc:creator>skairam</dc:creator>
				<category><![CDATA[/Matter]]></category>
		<category><![CDATA[social search]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://sanjaykairam.com/blog/?p=30</guid>
		<description><![CDATA[A few days ago, I started passing around a sign-up form to help people interested in topics relating to Social Search find each other on Twitter.  The great thing about having millions of people on Twitter now is that you are virtually guaranteed to have other people tweeting about the things that interest you, and...]]></description>
			<content:encoded><![CDATA[<p>A few days ago, I started passing around a sign-up form to help people interested in topics relating to Social Search find each other on Twitter.  The great thing about having millions of people on Twitter now is that you are virtually guaranteed to have other people tweeting about the things that interest you, and this was such a great and easy way to find new interesting followers and build a community around a common interest.</p>
<p>I purposely kept the subject &#8220;Social Search&#8221; vague, because I hoped to cast a wider net and then decide who to follow more specifically later.  It&#8217;s turned out that I&#8217;ve already learned a ton of interesting stuff from the people here, so if you are interested in Social Search topics at all, I&#8217;d recommend that you join the list and give everyone else a follow!</p>
<p>I had trouble embedding the spreadsheet in the post, so click these links for the <a title="Social Search Research Sign-Up Form" href="http://spreadsheets.google.com/viewform?key=psNXPtbS1ZToq7CZZmI1JYA" target="_blank">sign-up form</a> and <a title="Social Search Researchers on Twitter" href="http://spreadsheets.google.com/ccc?key=psNXPtbS1ZToq7CZZmI1JYA" target="_blank">results list</a>.  Enjoy!</p>
<p>P.S. The list is obviously public as it was passed around on Twitter, but if anybody has any questions or concerns about my posting the list here, please let me know!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sanjaykairam.com/blog/2009/02/social-search-researchers-on-twitter/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

