<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Sanjay Kairam &#187; mechanical turk</title>
	<atom:link href="http://www.sanjaykairam.com/blog/tag/mechanical-turk/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.sanjaykairam.com/blog</link>
	<description>Graduate Student &#38; Armchair Philosopher</description>
	<lastBuildDate>Thu, 19 Jan 2012 23:09:09 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>A Brief Overview of TurKit</title>
		<link>http://www.sanjaykairam.com/blog/2011/03/a-brief-overview-of-turkit/</link>
		<comments>http://www.sanjaykairam.com/blog/2011/03/a-brief-overview-of-turkit/#comments</comments>
		<pubDate>Fri, 11 Mar 2011 19:05:26 +0000</pubDate>
		<dc:creator>skairam</dc:creator>
				<category><![CDATA[/Matter]]></category>
		<category><![CDATA[crowds]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[HITs]]></category>
		<category><![CDATA[human computation]]></category>
		<category><![CDATA[human intelligence tasks]]></category>
		<category><![CDATA[mechanical turk]]></category>
		<category><![CDATA[mturk]]></category>
		<category><![CDATA[TurKit]]></category>

		<guid isPermaLink="false">http://www.sanjaykairam.com/blog/?p=292</guid>
		<description><![CDATA[These slides are from a presentation I gave in Sep Kamvar's Computational Methods in Data Mining (old website link here). In the presentation, I presented TurKit, a programming framework created by Greg Little and others at MIT that allows for programmatic iteration over tasks in Mechanical Turk. Essentially, that means that instead of the familiar paradigm of sending out a bunch of HITs and waiting for the responses, TurKit will ping AMT for answers and these answers can be used in future HITs. This allows for the use of an "improve and vote" loop, where Turkers continually improve on and validate the work of other Turkers. They had some impressive results in the paper, getting fairly high quality responses to a wide range of tasks (including image labeling, handwriting recognition, and brainstorming) for under $0.50.]]></description>
			<content:encoded><![CDATA[<p>These slides are from a presentation I gave in<a title="Sep Kamvar - Home Page" href="http://kamvar.org/" target="_blank"> Sep Kamvar</a>&#8216;s Computational Methods in Data Mining (old website link <a title="Stanford CME 340 - Computation Methods in Data Mining" href="http://kamvar.org/cme340/" target="_blank">here</a>). In the presentation, I presented TurKit, a programming framework created by <a title="MIT CSAIL - Greg Little" href="http://people.csail.mit.edu/glittle/" target="_blank">Greg Little</a> and others at MIT that allows for programmatic iteration over tasks in <a title="Amazon Mechanical Turk - Welcome" href="https://www.mturk.com/mturk/welcome" target="_blank">Mechanical Turk</a>. Essentially, that means that instead of the familiar paradigm of sending out a bunch of HITs and waiting for the responses, TurKit will ping AMT for answers and these answers can be used in future HITs. This allows for the use of an &#8220;improve and vote&#8221; loop, where Turkers continually improve on and validate the work of other Turkers. They had some impressive results in the paper, getting fairly high quality responses to a wide range of tasks (including image labeling, handwriting recognition, and brainstorming) for under $0.50.</p>
<p>The presentation ends with a quick intro in the JavaScript code (from the Iterative Text Improvement example on the <a title="TurKit - Home Page" href="http://groups.csail.mit.edu/uid/turkit/" target="_blank">TurKit website</a>) and some hopefully helpful information to know when using the Java application that you can download to try out TurKit. If I survive the end of the quarter, I hope to get a post up with some TurKit tutorial tips and lessons learned. If you have questions about TurKit, let me know, and I&#8217;ll try to get them answered in the next post!</p>
<div id="__ss_7235176" style="width: 425px;">
<p><strong><a title="TurKit: Tools for Iterative Tasks on Mechanical Turk [Little, et al. 2010]" href="http://www.slideshare.net/skairam/turkit-tools-for-iterative-tasks-on-mechanical-turk-little-et-al-2010">TurKit: Tools for Iterative Tasks on Mechanical Turk [Little 2010]</a></strong><object id="__sse7235176" width="425" height="355" classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=turkitpresowebsite-110311124935-phpapp01&amp;stripped_title=turkit-tools-for-iterative-tasks-on-mechanical-turk-little-et-al-2010&amp;userName=skairam" /><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><embed id="__sse7235176" width="425" height="355" type="application/x-shockwave-flash" src="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=turkitpresowebsite-110311124935-phpapp01&amp;stripped_title=turkit-tools-for-iterative-tasks-on-mechanical-turk-little-et-al-2010&amp;userName=skairam" allowFullScreen="true" allowScriptAccess="always" allowfullscreen="true" allowscriptaccess="always" /></object></p>
<div style="padding: 5px 0 12px;">View more <a href="http://www.slideshare.net/">presentations</a> from <a href="http://www.slideshare.net/skairam">Sanjay Kairam</a>.</div>
</div>
]]></content:encoded>
			<wfw:commentRss>http://www.sanjaykairam.com/blog/2011/03/a-brief-overview-of-turkit/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Can You Spot The Experts? Tagging and Expertise</title>
		<link>http://www.sanjaykairam.com/blog/2010/03/can-you-spot-the-experts-tagging-and-expertise/</link>
		<comments>http://www.sanjaykairam.com/blog/2010/03/can-you-spot-the-experts-tagging-and-expertise/#comments</comments>
		<pubDate>Wed, 24 Mar 2010 16:00:53 +0000</pubDate>
		<dc:creator>skairam</dc:creator>
				<category><![CDATA[/Matter]]></category>
		<category><![CDATA[delicious]]></category>
		<category><![CDATA[domain expertise]]></category>
		<category><![CDATA[expertise]]></category>
		<category><![CDATA[mechanical turk]]></category>
		<category><![CDATA[mturk]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[tag]]></category>
		<category><![CDATA[tagging]]></category>

		<guid isPermaLink="false">http://www.sanjaykairam.com/blog/?p=166</guid>
		<description><![CDATA[I decided to try a little Mechanical Turk study to see if I could spot some differences between tags generated by experts and those generated by novices. I had each Turker read 1 of 5 web pages (on the topic of "enterprise 2.0 mashups") and enter 5 tags which they thought would be useful for bookmarking the page (either for themselves or others). I also asked them to rate how familiar they were with the subject matter ("Not at All", "Slightly Familiar", "Somewhat Familiar", and "I am an Expert")...]]></description>
			<content:encoded><![CDATA[<p>Recently, I&#8217;ve been reading some papers about identifying and harnessing expertise in tagging communities such as <a href="http://www.delicious.com">Delicious</a>&#8211;some of the research that I have come across have looked at topics such as:</p>
<ul>
<li>Identifying the features that underlie &#8220;tag quality&#8221; (e.g. <a href="www.grouplens.org/system/files/group07-sen.pdf">Sen, et al. (2007)</a>, <a href="portal.acm.org/ft_gateway.cfm?id=1531676&amp;type=pdf">Zhang, et al. (2009)</a>)</li>
<li>Topic-based approaches for information retrieval from tagged collections (e.g. <a href="www.cse.psu.edu/~dzhou/papers/www08-tags.pdf">Zhou, et al. (2008)</a>)</li>
<li>Graph-based algorithms for ranking based on user tags (e.g. <a href="www.kde.cs.uni-kassel.de/hotho/pub/.../seach2006hotho_eswc.pdf">Hotho, et al. (2006)</a>, <a href="www.michael-noll.com/.../telling-experts-from-spammers-expertise-ranking-in-folksonomies/">Noll, et al. (2009)</a>)</li>
</ul>
<p>I decided to try a little Mechanical Turk study to see if I could spot some differences between tags generated by experts and those generated by novices. I had each Turker read 1 of 5 web pages (on the topic of &#8220;enterprise 2.0 mashups&#8221;) and enter 5 tags which they thought would be useful for bookmarking the page (either for themselves or others). I also asked them to rate how familiar they were with the subject matter (&#8220;Not at All&#8221;, &#8220;Slightly Familiar&#8221;, &#8220;Somewhat Familiar&#8221;, and &#8220;I am an Expert&#8221;).</p>
<p>As a game, I thought it would be interesting to post some of the responses to see how easy it was to identify which tags were generated by people who rated themselves as &#8220;experts&#8221; vs. &#8220;non-experts&#8221;. I took all of the tags generated by each expertise group, cleaned them up for minor spelling mistakes and typos (e.g., &#8220;applciation&#8221; &gt; &#8220;application&#8221;) and generated a tag cloud using <a href="http://www.wordle.net/">Wordle</a>, where the tag size corresponds to the frequency of use of that word (all other factors, such as positioning and color, are purely stylistic).</p>
<p>For the following URL &#8211; <a href="http://www.soamag.com/I18/0508-1.php">http://www.soamag.com/I18/0508-1.php</a> &#8211; can you identify which tag cloud belongs to which of these groups: &#8220;Not at All (Familiar)&#8221;, &#8220;Slightly Familiar&#8221;, and &#8220;Somewhat Familiar&#8221; (there was a 4th category of &#8220;I am an Expert&#8221;, but nobody rating this URL classified themselves this way):</p>
<div id="attachment_167" class="wp-caption aligncenter" style="width: 460px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/03/URL3-Expertise1.jpg"><img class="size-medium wp-image-167" title="Tag Cloud 1" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/03/URL3-Expertise1-300x150.jpg" alt="Tag Cloud 1" width="450" height="224" /></a><p class="wp-caption-text">Tag Cloud 1 (N = 17)</p></div>
<div id="attachment_168" class="wp-caption aligncenter" style="width: 460px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/03/URL3-Expertise2.jpg"><img class="size-medium wp-image-168" title="Tag Cloud 2" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/03/URL3-Expertise2-300x95.jpg" alt="Tag Cloud 2" width="450" height="142" /></a><p class="wp-caption-text">Tag Cloud 2 (N = 16)</p></div>
<div id="attachment_170" class="wp-caption aligncenter" style="width: 460px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/03/URL3-Expertise0.jpg"><img class="size-medium wp-image-170" title="Tag Cloud 3" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/03/URL3-Expertise0-300x139.jpg" alt="Tag Cloud 3" width="450" height="207" /></a><p class="wp-caption-text">Tag Cloud 3 (N = 14)</p></div>
<p>If you have any idea which tag cloud is which, please feel free to post your guess in the comments! I&#8217;d be extremely curious to see why people guessed the way that they did. I am actually currently in the process of having some Turkers do the same thing; if you are curious about the answers, come back for my follow-up post where I post the correct answers, as well as the results of the Mechanical Turk evaluation of the tag cloud.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sanjaykairam.com/blog/2010/03/can-you-spot-the-experts-tagging-and-expertise/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>What Makes Web Sites Credible? 10 Years Later</title>
		<link>http://www.sanjaykairam.com/blog/2010/01/what-makes-web-sites-credible-10-years-later/</link>
		<comments>http://www.sanjaykairam.com/blog/2010/01/what-makes-web-sites-credible-10-years-later/#comments</comments>
		<pubDate>Fri, 22 Jan 2010 22:04:54 +0000</pubDate>
		<dc:creator>skairam</dc:creator>
				<category><![CDATA[/Matter]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[credibility]]></category>
		<category><![CDATA[design]]></category>
		<category><![CDATA[mechanical turk]]></category>
		<category><![CDATA[mturk]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[survey]]></category>

		<guid isPermaLink="false">http://www.sanjaykairam.com/blog/?p=118</guid>
		<description><![CDATA[Last month, I read a study by B. J. Fogg and others from the Persuasive Technology Lab at Stanford, entitled "What Makes Web Sites Credible? A Report on a Large Quantitative Study". The paper described an early effort to systematically determine how different elements of web sites affect people's perceptions of credibility (defined roughly as the intersection of trustworthiness and expertise). The original study design had 1400 participants completing a survey which presented them with 51 web site elements and asked them to rate how much more or less each element would affect the believability of a web site. The two questions I hope to answer, roughly, are "What has changed in the past 10 years about how people assess web site credibility?" and "Is there a cheaper, yet effective, way to do a study like this?". The results have implications for website design.]]></description>
			<content:encoded><![CDATA[<p>Last month, I read a study by <a title="B.J. Fogg - Home Page" href="http://www.bjfogg.com/" target="_blank">B. J. Fogg</a> and others from the <a title="Stanford University - Persuasive Technology Lab" href="http://captology.stanford.edu/" target="_blank">Persuasive Technology Lab at Stanford</a>, entitled &#8220;<a title="Paper Link (PDF)" href="http://www.google.com/url?sa=t&amp;source=web&amp;ct=res&amp;cd=1&amp;ved=0CAsQFjAA&amp;url=http%3A%2F%2Fcaptology.stanford.edu%2Fpdf%2Fp61-fogg.pdf&amp;ei=ASBaS5W-G4TYsgPIu7HNBA&amp;usg=AFQjCNFfC0AVUNm3lt3gOVxE-DCbs5pI-A&amp;sig2=mkyXRpQBujCx6-j0MMFJ3Q" target="_blank">What Makes Web Sites Credible? A Report on a Large Quantitative Study</a>&#8220;. The paper described an early effort to systematically determine how different elements of web sites affect people&#8217;s perceptions of <a title="Wikipedia - Credibility" href="http://en.wikipedia.org/wiki/Credibility" target="_blank">credibility</a> (defined roughly as the intersection of trustworthiness and expertise). The original study design had 1400 participants completing a survey which presented them with 51 web site elements and asked them to rate how much more or less each element would affect the believability of a web site.</p>
<p>Upon reading this study, I noticed two things:</p>
<p>1) The original experiment solicited participants by offering to donate $10 to charity for their time; resulting in a net cost of at least $14K, raising the second question: <em>Given the growth of sites like <a title="Amazon Mechanical Turk" href="http://www.mturk.com" target="_blank">Mechanical Turk</a>, can we get comparable results for less money?</em></p>
<p>2) The data was originally collected in December <span style="text-decoration: line-through;">2009</span> 1999, which is almost exactly 10 years ago. Content on the web and our interactions with it have changed a great deal since then &#8211; I mean, in 1999, <a title="Google - Corporate Milestones" href="http://www.google.com/corporate/history.html" target="_blank">Google had only existed for a year</a>, blogs were still relatively uncommon (<a title="LiveJournal - Wikipedia" href="http://en.wikipedia.org/wiki/LiveJournal" target="_blank">LiveJournal had just started</a>), and <a title="Wikipedia - Historical Overview by Year" href="http://en.wikipedia.org/wiki/History_of_Wikipedia#Historical_overview_by_year" target="_blank">Wikipedia didn&#8217;t even exist yet</a>! This raised the question: <em>Given that the average Internet user now has a far greater amount of experience navigating the Web, should we expect the responses to be different 10 years later?</em></p>
<p>I decided to explore both of these questions through a survey on Mechanical Turk; in December 2009, I posted a HIT to Mechanical Turk replicating the original study as closely as possible. The 51 statements about web site elements were available from Fogg&#8217;s original paper, the same 7-point Likert scale was used, and the survey items were randomized.  I paid $0.05 per HIT, and I ended up getting 327 responses, with none thrown out due to quality.</p>
<p>An initial, high-level examination of the responses showed that they actually matched the 1999 data fairly well.  Average Likert ratings for each item correlated highly with average ratings for the same items in the 1999 data with R^2 = 0.96.  One difference in the data was that answers were compressed (closer to 0) overall, so for the purposes of comparison, I transformed the 2009 data using the transformation (1.1677X &#8211; 0.0003) to match the 1999 data (note that this has no effect on the correlation coefficient).</p>
<p>The first analysis in Fogg, et al. was to separate the various elements into 7 &#8220;scales&#8221; using factor analysis (<em>Real-World Feel, Ease of Use, Expertise, Trustworthiness, Tailoring, Commercial Implications,</em> and <em>Amateurism</em>).  Below, I present comparisons for items in each scale.  I highlighted items that deviated from the 1999 values by more than 0.25 (without the original data, I couldn&#8217;t do much more in-depth comparison), but this might give some rough idea of which elements are <strong>more</strong> important now than they were in 1999 (red) and which elements are <strong>less</strong> important (purple):</p>
<div id="attachment_120" class="wp-caption aligncenter" style="width: 500px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide1.jpg"><img class="size-full wp-image-120" title="Real-World Feel Scale" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide1.jpg" alt="Real-World Feel Scale" width="490" height="211" /></a><p class="wp-caption-text">Elements related to the &quot;Real-World&quot; feel scale are rated similarly from 1999 to 2009.</p></div>
<div id="attachment_121" class="wp-caption aligncenter" style="width: 498px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide2.jpg"><img class="size-full wp-image-121" title="&quot;Ease of Use&quot; Scale" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide2.jpg" alt="&quot;Ease of Use&quot; Scale" width="488" height="209" /></a><p class="wp-caption-text">One difference is that people seem more critical of long download times than in the past.</p></div>
<div id="attachment_122" class="wp-caption aligncenter" style="width: 500px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide3.jpg"><img class="size-full wp-image-122" title="Expertise Scale" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide3.jpg" alt="Expertise Scale" width="490" height="304" /></a><p class="wp-caption-text">For some reason, it seems that displaying an award helps your site&#39;s credibility more than 10 years ago, and that providing a lot of news stories matters less.</p></div>
<div id="attachment_123" class="wp-caption aligncenter" style="width: 501px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide4.jpg"><img class="size-full wp-image-123" title="Trustworthiness Scale" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide4.jpg" alt="Trustworthiness Scale" width="491" height="285" /></a><p class="wp-caption-text">Stating your policy on content and ending in &quot;.org&quot; are more important to people now - could this be a cultural shift in response to sites like Wikipedia? Overall, it seems as if links out to other sites matter less for credibility now.</p></div>
<div id="attachment_124" class="wp-caption aligncenter" style="width: 503px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide5.jpg"><img class="size-full wp-image-124" title="Tailoring Scale" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide5.jpg" alt="Tailoring Scale" width="493" height="195" /></a><p class="wp-caption-text">Email confirmations are more important now than in 1999.</p></div>
<div id="attachment_125" class="wp-caption aligncenter" style="width: 501px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide6.jpg"><img class="size-full wp-image-125" title="Commercial Implications" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide6.jpg" alt="Commercial Implications" width="491" height="356" /></a><p class="wp-caption-text">Outside advertisements and an e-commerce focus matter more for credibility now than in the past. People are paying less attention to the commercial purpose of sites, as well as the number of ads and their integration with content (Google is a favorite site for many, after all.)</p></div>
<div id="attachment_126" class="wp-caption aligncenter" style="width: 502px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide7.jpg"><img class="size-full wp-image-126" title="Amateurisum Scale" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide7.jpg" alt="Amateurism Scale" width="492" height="367" /></a><p class="wp-caption-text">People seem to notice domain name mismatches now (more public knowledge of phishing/identity theft now?), but less attention to multi-lingual sites.</p></div>
<div id="attachment_127" class="wp-caption aligncenter" style="width: 501px"><a href="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide8.jpg"><img class="size-full wp-image-127" title="Other Elements" src="http://www.sanjaykairam.com/blog/wp-content/uploads/2010/01/Slide8.jpg" alt="Other Elements" width="491" height="181" /></a><p class="wp-caption-text">People seem more willing to trust free financial sites now than in the past.</p></div>
<p>Anyways, as we might have guessed, the answers from 2009 seem to match the answers from 1999 pretty well.  The elements that made things highly credible or highly &#8216;un-credible&#8217; in the past seem to have remained constant, and those which didn&#8217;t matter then seem not to matter too much now.  Some interesting elements are noted in the captions with some rough conjectures as to why some of them might be trending the way they are.</p>
<p>The rest of the Fogg paper focused on characterizing differences in scale responses due to demographic differences between participants, but I found that part of the study less convincing as they averaged over the positively and negatively phrased elements on each scale (which I think makes the interpretation somewhat confusing).</p>
<p>Now, back to our two questions:</p>
<p>1) It looks as if using Mechanical Turk, we were able to get reasonable answers that fairly closely matched those from the original study.  Total price tag: 327 response * $0.05 = $16.35 paid to participants (plus Amazon&#8217;s cut), which made this study about $13,980 cheaper than the original one.</p>
<p>2) We see above a few small changes in what makes web sites credible, but overall, people are looking at the same things, meaning that we should continue taking the same factors into account when designing websites. I&#8217;ve made some guesses as to why these elements may have changed over time, but I&#8217;m curious to hear what you think, so leave a comment!</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sanjaykairam.com/blog/2010/01/what-makes-web-sites-credible-10-years-later/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>A MTurk Exploration of Activity Stream Usage</title>
		<link>http://www.sanjaykairam.com/blog/2009/06/a-mturk-exploration-of-activity-stream-usage/</link>
		<comments>http://www.sanjaykairam.com/blog/2009/06/a-mturk-exploration-of-activity-stream-usage/#comments</comments>
		<pubDate>Mon, 15 Jun 2009 23:02:02 +0000</pubDate>
		<dc:creator>skairam</dc:creator>
				<category><![CDATA[/Matter]]></category>
		<category><![CDATA[activity streams]]></category>
		<category><![CDATA[facebook]]></category>
		<category><![CDATA[friendfeed]]></category>
		<category><![CDATA[mechanical turk]]></category>
		<category><![CDATA[mturk]]></category>
		<category><![CDATA[news feeds]]></category>
		<category><![CDATA[presentation]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://sanjaykairam.com/blog/?p=51</guid>
		<description><![CDATA[These are some slides from a presentation I gave on some Mechanical Turk data I collected about how people are using Activity Streams.  Specifically, I was interested in what tools people were using, what they were using them for, how these tools might be improved, and how people had been using these tools to collaborate/coordinate.  Here's what I found...]]></description>
			<content:encoded><![CDATA[<p>These are some slides from a presentation I gave on some Mechanical Turk data I collected about how people are using Activity Streams (also called News Feeds).  Specifically, I was interested in what tools people were using, what they were using them for, how these tools might be improved, and how people had been using these tools to collaborate/coordinate.  Here&#8217;s what I found:</p>
<div id="__ss_1588047" style="width: 425px; text-align: left;"><a style="font:14px Helvetica,Arial,Sans-serif;display:block;margin:12px 0 3px 0;text-decoration:underline;" title="An Exploration of Activity Stream Usage via Mechanical Turk" href="http://www.slideshare.net/skairam/an-exploration-of-activity-stream-usage-via-mechanical-turk?type=presentation">An Exploration of Activity Stream Usage via Mechanical Turk</a><object width="425" height="355" data="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=parcactivitystreamsmturksurveypresentation-090615165608-phpapp01&amp;stripped_title=an-exploration-of-activity-stream-usage-via-mechanical-turk" type="application/x-shockwave-flash"><param name="allowFullScreen" value="true" /><param name="allowScriptAccess" value="always" /><param name="src" value="http://static.slidesharecdn.com/swf/ssplayer2.swf?doc=parcactivitystreamsmturksurveypresentation-090615165608-phpapp01&amp;stripped_title=an-exploration-of-activity-stream-usage-via-mechanical-turk" /><param name="allowfullscreen" value="true" /></object></p>
<div style="font-size: 11px; font-family: tahoma,arial; height: 26px; padding-top: 2px;">View more <a style="text-decoration:underline;" href="http://www.slideshare.net/">PDF documents</a> from <a style="text-decoration:underline;" href="http://www.slideshare.net/skairam">Sanjay Kairam</a>.</div>
</div>
<p>The data collected and the major points were fairly straightforward:</p>
<p><strong>Participant Demographics:</strong></p>
<ul>
<li><strong>Age:</strong> Mean = 25.6, SD = 8.0</li>
<li><strong>Education:</strong> Almost all were mid-college or post-college (and about 1/6 post-graduate study).</li>
<li><strong>Usage:</strong> Most (56/78) reported specifically personal usage, and only 2 subjects reported specifically professional usage (14 indicated both, however).</li>
</ul>
<p><strong>Tools Used:</strong></p>
<ul>
<li>Vast majority listed <strong>Facebook</strong> (61/78) &#8211; this was unsurprising (also, Facebook Stream listed as first example of an &#8220;activity stream&#8221; in survey instructions.)</li>
<li>Wide <strong>Twitter</strong> usage (41/78) was surprising, however.  Past experiencing with polling for Twitter-related topics on MTurk had resulted in low yield.  Perhaps this is due to the crazy upswing in Twitter sign-ups over the past few months?</li>
<li>Other than <strong>MySpace</strong> (16/78), tools such as LinkedIn, Yammer, FriendFeed, and others were barely listed, indicating either that these tools are not widely used or that people do not consider some of these to be activity streams.</li>
</ul>
<p><strong>Functions Served:</strong></p>
<ul>
<li><strong>Note:</strong> These responses were loosely categorized by me &#8211; this was not intended to be a rigorous academic study, but rather a glimpse into usage of these tools.</li>
<li><strong>Status</strong>(33/78), <strong>Communication</strong>(32/78), and <strong>Information</strong>(19/78) were listed as the most common functions served.</li>
<li>Responses also demonstrated a wide variety of usage, however, including some less anticipated uses such as <strong>Journaling</strong>.  Perhaps this speaks somewhat to the flexibility of these tools and the ability that users have to adapt them to their own needs.</li>
</ul>
<p><strong>Feature Requests / Improvements:</strong></p>
<ul>
<li>These are exact quotes from participants (again, loosely grouped into categories by me &#8211; no cross-coding was done).</li>
</ul>
<p>As you can see, the &#8220;Summary&#8221; was really just a reminder about what people said regarding potential improvements, but I thought this was really the most interesting part.  It&#8217;s interesting that most of the things that people asked for were things that are either available or which could be easily made available by new activity stream client applications, so there may be a lot of low-hanging fruit out there for application developers.</p>
<p>I&#8217;d be curious to see if any of you has done (or seen) similar research regarding Twitter, Facebook, or other activity streams (whether on MTurk or otherwise) and if you found similar or different trends.  If you are interested in clarification, more details, or discussion about any of the points brought up here, the comments section awaits.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sanjaykairam.com/blog/2009/06/a-mturk-exploration-of-activity-stream-usage/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>Smartsheet Automating Data Collection Through Mechanical Turk</title>
		<link>http://www.sanjaykairam.com/blog/2009/02/smartsheet-automating-data-collection-through-mechanical-turk/</link>
		<comments>http://www.sanjaykairam.com/blog/2009/02/smartsheet-automating-data-collection-through-mechanical-turk/#comments</comments>
		<pubDate>Thu, 12 Feb 2009 18:25:50 +0000</pubDate>
		<dc:creator>skairam</dc:creator>
				<category><![CDATA[/Matter]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[mechanical turk]]></category>
		<category><![CDATA[mturk]]></category>
		<category><![CDATA[smartsheet]]></category>
		<category><![CDATA[smartsourcing]]></category>

		<guid isPermaLink="false">http://sanjaykairam.com/blog/?p=26</guid>
		<description><![CDATA[Smartsheet has just launched their &#8220;Smartsourcing&#8221; product which allows you to automatically build data-collection tasks in a &#8220;Smartsheet&#8221; (a spreadsheet) and outsource them as HIT&#8217;s on Mechanical Turk.  I just watched the demo video over at Smartsheet, and I have to say that the integration looks pretty slick. The ability to outsource data collection and...]]></description>
			<content:encoded><![CDATA[<p><a title="Smartsheet" href="http://www.smartsheet.com/" target="_blank">Smartsheet</a> has just launched their &#8220;Smartsourcing&#8221; product which allows you to automatically build data-collection tasks in a &#8220;Smartsheet&#8221; (a spreadsheet) and outsource them as HIT&#8217;s on <a title="Wikipedia - Mechanical Turk" href="http://en.wikipedia.org/wiki/Amazon_Mechanical_Turk" target="_blank">Mechanical Turk</a>.  I just watched the <a title="Smartsourcing Information and Demo Video" href="http://www.smartsheet.com/product/smartsourcing" target="_blank">demo video</a> over at Smartsheet, and I have to say that the integration looks pretty slick.</p>
<p>The ability to outsource data collection and have it populate a formatted spreadsheet in near real-time is fairly impressive (speed will probably differ on attractiveness of the task).  Often, the steps required to set up a MTurk task can take as long as the data collection itself, and afterwards you are left exporting to an unformatted Excel sheet.  Making sure to name data-labels correctly, populating data fields to randomize among workers, and other tasks can further complicate things; judging from the demo video, Smartsheet seems to handle these parts of the process fairly easily.</p>
<p>Unfortunately, good service doesn&#8217;t come free, and using the Smartsourcing feature requires signing up for a paid-account as well as the Smartsourcing fee on top of that, so I didn&#8217;t get the chance to try it out, but I would be extremely curious to hear about anyone&#8217;s experience with the service if they do try it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.sanjaykairam.com/blog/2009/02/smartsheet-automating-data-collection-through-mechanical-turk/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

