<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Menarik</title>
	<atom:link href="http://www.menarik.co.uk/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.menarik.co.uk</link>
	<description>my kind of interesting</description>
	<lastBuildDate>Wed, 23 Nov 2011 21:11:03 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>Stack Overview</title>
		<link>http://www.menarik.co.uk/2011/11/stack-overview/</link>
		<comments>http://www.menarik.co.uk/2011/11/stack-overview/#comments</comments>
		<pubDate>Wed, 23 Nov 2011 21:11:03 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Papers Reviewed]]></category>
		<category><![CDATA[hci]]></category>
		<category><![CDATA[qa]]></category>
		<category><![CDATA[websci]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=589</guid>
		<description><![CDATA[Review of some research into the popular programming Q&#038;A, Stack Overflow]]></description>
			<content:encoded><![CDATA[<p>The hugely popular <a href="http://stackoverflow.com/" title="http://stackoverflow.com/">Stack Overflow</a> site provides answers to questions on a range of programming topics and has become a trusted resource for both students and professionals needing to improve their code, solve a problem, or get advice on an implementation. The owners also provide a regular <a href="http://blog.stackoverflow.com/category/cc-wiki-dump/">dump of the entire site </a>, allowing analysis of users and their interaction through posing and responding to questions, and voting on others&#8217; posts. Below are three recent studies each investigating aspects of this Q&#038;A site.</p>
<div id="attachment_612" class="wp-caption aligncenter" style="width: 759px"><a href="http://www.menarik.co.uk/wp-content/uploads/2011/11/Selection_0581.png"><img src="http://www.menarik.co.uk/wp-content/uploads/2011/11/Selection_0581.png" alt="Some Stack Overflow questions" title="Selection_058" width="749" height="267" class="size-full wp-image-612" /></a><p class="wp-caption-text">Some Stack Overflow questions</p></div>
<p>The Mamykina paper uses a nice mixed methods approach to look at both the interaction patterns on the site and the motivation of users and community owners. Some key design aspects highlighted are that the community managers are themselves respected domain experts, and that they have taken an evolutionary, collaborative approach to the development of the platform. In terms of site activity, the authors note that a high proportion of questions receive at least one answer (92%) and that answers are received and accepted quickly (median 11 minutes to be received, 21 minutes for an answer to be accepted). Whereas some users only ask  (23%) and others only answer(20%), a good number do both (21%). They go on to recognise distinct user profiles in the community: activists, shooting stars, low-profile users and lurkers/visitors. Activists and shooting stars &#8211; active for a short period &#8211; together provide a large proportion of the answers despite representing a low proportion of the community (figure).</p>
<div id="attachment_618" class="wp-caption aligncenter" style="width: 598px"><a href="http://www.menarik.co.uk/wp-content/uploads/2011/11/stack_graphic.png"><img src="http://www.menarik.co.uk/wp-content/uploads/2011/11/stack_graphic.png" alt="bar char if user types" title="stack_graphic" width="588" height="339" class="size-full wp-image-618" /></a><p class="wp-caption-text">Answer behaviours (data from manykina et al)</p></div>
<p>Mamykina et al also note how the gaming aspects to the site &#8211; reputation scores and badges &#8211; clearly add to the site&#8217;s stickiness and appeal, though are not without potential problems, including the tendency for answers to be provided quickly in a rush for the accompanying reputation points.  </p>
<p>Truede et als&#8217; study is qualitative, looking at the types of questions that are asked and the tags applied to them. They propose a coding of programming language; framework; environment; domain and non-functional for tag categories and question types including how-tos; discrepancies; environments; errors; decision and conceptual. They found code review questions to be the most frequently answered satisfactorily (ie with an &#8220;Accepted&#8221; answer in 92% of cases studied), proposing that this is the easiest category to answer being the most concrete and self-contained, usually containing a code snippet for answerers to critique and correct.</p>
<p>Kumar et al looked at the Stack Overflow platform as an example of a &#8220;two-sided market&#8221;, where there are two distinct user groups &#8211; questioners and answerers &#8211; and a network effect between them that helps to determine its attractiveness to new adopters. Using a data extract, they derived &#8220;attachment curves&#8221; for the platform, defined as the probability that a user of one type or another will join the platform. They found a strong &#8220;cross-side&#8221; network effect, where the rate of questioners joining the network was strongly influenced by the presence of answerers, but that this effect was asymmetric and answerers grew more slowly in the presence of many questioners.</p>
<p>These papers together provide some good insights into how this online community has grown and been a great success &#8211; there is a symbiotic and sustainable balance between questioners and providers, groups both being served by the interactions, to the huge benefit of the third group, the search engine visitor. We also see how some question types and categories work more successfully, and how rewards might encourage speed and quantity but may also compromise quality. </p>
<p>REFERENCES</p>
<p>KUMAR, R., LIFTSHITS, Y. and TOMKINS, A., 2010. <strong>Evolution of Two-Sided Markets</strong>, Third ACM International Conference on Web Search and Data Mining, February 3-6 2010<br />
<a href="http://research.yahoo.com/pub/3015" title="http://research.yahoo.com/pub/3015">http://research.yahoo.com/pub/3015</a></p>
<p>MAMYKINA, L., MANOIM, B., MITTAL, M., HRIPCSAK, G. and HARTMANN, B., 2011. <strong>Design lessons from the fastest q&#038;a site in the west</strong>, Proceedings of the 2011 annual conference on Human factors in computing systems, 2011, ACM pp2857-2866<br />
<a href="http://bid.berkeley.edu/files/papers/mamykina-stackoverflow-chi2011.pdf" title="http://bid.berkeley.edu/files/papers/mamykina-stackoverflow-chi2011.pdf">http://bid.berkeley.edu/files/papers/mamykina-stackoverflow-chi2011.pdf</a></p>
<p>TREUDE, C., BARZILAY, O. and STOREY, M.-., 2011. <strong>How do programmers ask and answer questions on the web?</strong>: NIER track, Software Engineering (ICSE), 2011 33rd International Conference on, 2011, pp804-807</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2011/11/stack-overview/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Online community social typology</title>
		<link>http://www.menarik.co.uk/2011/07/social-typology/</link>
		<comments>http://www.menarik.co.uk/2011/07/social-typology/#comments</comments>
		<pubDate>Wed, 06 Jul 2011 17:24:28 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Papers Reviewed]]></category>
		<category><![CDATA[network analysis]]></category>
		<category><![CDATA[online communities]]></category>
		<category><![CDATA[websci]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=540</guid>
		<description><![CDATA[A selection of studies aiming to characterise user profiles in online communities through structural and/or content based analysis (image:flickr/accidental hedonist)]]></description>
			<content:encoded><![CDATA[<p>COBB, N.K., GRAHAM, A.L. and ABRAMS, D.B., 2010. Social Network Structure of a Large Online Community for Smoking Cessation. American Journal of Public Health, 100</p>
<p>PFEIL, U., SVANGSTU, K., ANG, C.S. and ZAPHIRIS, P., 2011. Social Roles in an Online Support Community for Older People. International Journal of Human-Computer Interaction, 27(4), pp. 323-347</p>
<p>WELSER, H.T., GLEAVE, E., FISCHER, D. and SMITH, M., 2007. <a href="http://www.cmu.edu/joss/content/articles/volume8/Welser/">Visualizing the signatures of social roles in online discussion groups</a>. Journal of Social Structure, 8</p>
<p>This set of studies looked at data from online support and discussion communities with the aim of understanding network structures and the parts played by types of community members. Cobb et al used network analysis alone in their study of a community of smokers looking to give up (QuitNet). Pfeil et al attempted to integrate content and structural analysis in their analysis of the SeniorNet community for older people to discuss health issues. Welser et al sought to correlate structural characteristics with predominant interaction behaviours in three sets of Usenet data (two technical discussion groups and one on kites!).</p>
<div id="attachment_546" class="wp-caption alignleft" style="width: 219px"><a href="http://www.menarik.co.uk/wp-content/uploads/2011/07/barabeke.jpg"><img src="http://www.menarik.co.uk/wp-content/uploads/2011/07/barabeke-209x300.jpg" alt="woman smoking" title="barabeke" width="209" height="300" class="size-medium wp-image-546" /></a><p class="wp-caption-text">Image: Flickr/barabeke</p></div>
<p>Cobb et al. start out by comparing weakly connected with strongly/densely connected members &#8211; on the basis of blog post and private message connections. They find more of the strong or densely connected core are women, and that they are older than the weakly connected core. They are also less likely to have lapsed in their attempts to give up. The study goes on to identify three significant groups:<strong> Key Players</strong> (50 users of the 800,000 registered and of the 7500 or so active in the study period), <strong>Newcomers</strong> (800) and <strong>Integrators</strong> (750). Key players were 80% female, with an average age of 49 and a high level of activity in terms of posts, connections and reciprocal messaging. Newcomers were all smokers (avg age 39) who joined the site during the study. Integrators were those who formed ties with the newcomers by initiating contact. The authors note a healthy heterogeneity to the network, with abstainers mixed with smokers &#8211; thereby providing the normative influence needed for quitters to keep it up.</p>
<p>Pfeil et al used a coding scheme to tag forum posts in a much smaller community (only 30 or so active members analysed) according to: self-disclosure, community, factual, slightly-off and support oriented content. The tagging scheme yielded seven types of user based on the predominance of these post types. They also undertook a regular equivalence analysis (REGE algorithm) of the network, which yielded six groups showing distinct interaction patterns. In their discussion they attempt to integrate the two groupings, though note that:</p>
<p>&#8220;At first glance, it appears that there is no overt connection between the results of the structural approach (social network analysis) and the behavioral approach (content analysis). The groups identified by the two approaches contain different members. However, at closer look, some relationships can be found.&#8221;</p>
<p>The groups they arrive at are:-</p>
<ul>
<li><strong>Moderating supporter</strong>: Givers and receivers of support, active in welcoming new members and posting frequently at length;</li>
<li><strong>Central supporter</strong>: Frequent posters proving support and strongly connected, but less active in welcoming new members;</li>
<li><strong>Active member</strong>: Regular posters mostly chatting about themselves;</li>
<li><strong>Passive member</strong>: Less central. Did not provide support but did show self-disclosure</li>
<li><strong>Technical expert</strong>: Short messages on technical issues.</li>
<li><strong>Visitor</strong>: Remote positions, limited engagement.</li>
</ul>
<p>Pfeil et als conclusions &#8211; that moderating supporters should be recognised and cultivated &#8211; are not especially surprising or new, though they do make an interesting observation about building trust in order to encourage self-disclosure.</p>
<p>Welser et al&#8217;s study focussed on <strong>the answer person</strong> &#8220;a participant in an online discussion group whose primary mode of interaction is the provision of helpful, informative responses to other group members’ questions.&#8221; Answer people are important for contributing without expectation of recompense and have high community value. For this reason the authors wanted to see what structural aspects of communities correlated well with answer people. They found a characteristic pattern of a sparse, star shaped egocentric network with connections to isolates, few of whom were connected to each other (see <a href="http://www.cmu.edu/joss/content/articles/volume8/Welser/images/image016.jpg">their example</a>). Their analysis found that &#8220;authorlines&#8221; (thread contribution patterns), neighbour&#8217;s degree distribution and local network characteristics were good predictors of answer people. </p>
<p>The studies are methodologically interesting, with shortcomings of a purely structural approach noted by Cobb et al &#8211; for example, it is impossible to discern the influence of the community on more passive members. And what are the motivations of the key players? Pfeil et al tried to triangulate but found it difficult, suggesting that one-dimensional categorisation of users may be an over simplification. </p>
<p>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2011/07/social-typology/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Quora: Social epistemology, credibility &amp; the QA interface</title>
		<link>http://www.menarik.co.uk/2011/01/social-epistemology-credibility-and-social-qa-interface/</link>
		<comments>http://www.menarik.co.uk/2011/01/social-epistemology-credibility-and-social-qa-interface/#comments</comments>
		<pubDate>Tue, 11 Jan 2011 00:00:59 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Analysis]]></category>
		<category><![CDATA[credibility]]></category>
		<category><![CDATA[information literacy]]></category>
		<category><![CDATA[qa]]></category>
		<category><![CDATA[quora]]></category>
		<category><![CDATA[social epistemology]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=452</guid>
		<description><![CDATA[A view on the epistemic value of the content and interface of the social QA site
Img: Flickr/coldtaxi]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.quora.com/">Quora</a> is a newish social question answering site which is gaining popularity. Common reaction from new users seems to be : &#8220;<a href="http://www.quora.com/What-is-Quora-1">what&#8217;s this?</a>&#8221; &#8220;<a href="http://www.quora.com/How-Is-Quora-Different-From-X">do we need another answers site?</a>&#8220;, and &#8220;<a href="http://www.quora.com/How-do-you-turn-off-Quora-email-notifications">how can I stop these e-mail notifications?</a>&#8220;. Despite fears that Quora may become populated with lame questions and lamer answers, not to mention spammers, there may also be grounds for optimism, as Quora implements a number of interface features with the potential to encourage the aggregation of good social knowledge, perhaps to a greater extent than other sites including Wikipedia and Yahoo Answers.</p>
<p><img alt="Image:flickr/coldtaxi" src="http://www.menarik.co.uk/wp-content/uploads/2011/01/handsup1.jpg" class="alignleft" width="240" style="float:left;padding-right:20px;" /></p>
<p>Alvin Goldman&#8217;s 1999 book <a href="http://www.librarything.com/work/6304505/book/49515055">Knowledge in a Social World</a><sup>1</sup> provides a framework for the evaluation of social knowledge practices<sup>2</sup>. My interpretation of his model toward the QA context is that it comprises:</p>
<p>1. Allowance for personal interest in the topic : &#8220;sensitivity to relative amounts of interest should play a modest role in assessing .. epistemic credentials&#8221;;<br />
2. Need to know: Not everybody in a community needs to know everything, instead knowledge should be available to those who need it most;<br />
3. The mechanics of the knowledge exchange practice itself, which involves: the inference practice of the &#8220;credal agent&#8221; (user in this case); the speech practices of the speaker (answerer) and the communication-control practice (the interface and its affordances + community moderation)<br />
4. Accomodation of different types of question. For questions where no agreed &#8220;true&#8221; answer exists, it might be treated in a <a href="http://en.wikipedia.org/wiki/Karl_Popper">Popperian</a> manner &#8211; you cannot know what is true but you can reject specific hypotheses. For this to work, however, a full range of hypotheses need to be represented.</p>
<p>How does Quora look against this framework and other related thinking in credibility and the epistemology of testimony? In terms of interest and need-to-know, the site limits your view of the community to just those parts you have shown an interest in. While this may lead to a confusing experience for those coming new to the homepage, it is perhaps a deliberate social design decision to create specialist areas and multiple entry points, as<a href="http://www.ribbonfarm.com/2010/12/07/socratic-fishing-in-lake-quora/"> venkat argues in this post</a>. </p>
<p>As far as practices are concerned, some of this will be down to norms that evolve and change with the expansion of the community of users.  User inference practices need to be further studied, though it is likely that personal epistemologies &#8211; a personally held understanding of what knowledge is &#8211; and a desire for quick answers or &#8220;epistemic closure&#8221; will continue to be significant, though competing factors. That is, the need for closure may hinder the development of a richer understanding of knowledge as contested and nuanced. In providing responses, the motivation of answerers may be altruistic, competitive or toward self-promotion, though the relative effect of these are perhaps less significant when the community as a whole can vote answers up or down. And this collaborative filtering, while flawed, is at the heart of the consensus model that the site hopes to implement. Notably, Quora does not use the &#8220;Accepted answer&#8221; feature that other sites use which places emphasis on a particular answer.</p>
<p>Other interface features seem to be geared toward enhancing the user&#8217;s ability to evaluate answers. In terms of your qualification to answer on a topic, Quora allows you to provide your experience <em>per topic</em> rather than overall for your profile. This could provide important &#8211; though of course manipulable &#8211; cues to credibility. Evidence has shown that the closer testimony is to first-hand experience, the more important it is to us.</p>
<p>Criticism of<a href="http://www.menarik.co.uk/2010/07/the-social-epistemology-of-wikipedia/"> Wikipedia in terms of credible testimony</a><sup>3</sup> has been that the masking of author identity makes it very difficult to assess. As this question reveals, <a href="http://www.quora.com/Do-I-have-to-use-my-real-name-on-Quora">Quora insists on you providing your real name</a>, although you may also opt to answer particular questions anonymously.</p>
<p>It has been noted that the discussion pages in Wikipedia provide the real story behind the knowledge presented on the page itself, and that the user should use these to help understand the debate and range (or lack) of views presented. By having both edited summary and discussion together in the page, Quora potentially provides more transparency in this. The edited summary should serve as a useful overview when woven together from a number of answers &#8211; providing the opportunity for &#8220;braided learning&#8221;<sup>4</sup>. </p>
<p>A further spur to credibility which Quora is trying to encourage is for users to expose their reasoning and argument rather than simply providing pithy answers, thereby avoiding a &#8220;black box effect&#8221; which leaves important gaps in the explanation and requires a leap of faith on the part of the information consumer<sup>5</sup>.</p>
<p><img alt="Dawn and Pete" src="http://www.menarik.co.uk/wp-content/uploads/2011/01/petedawn.jpg" class="alignleft" width="240" style="float:right;padding-left:20px;" /></p>
<p>None of this is to underplay quality issues and potential for abuse and manipulation, which exists as much in Quora as other QA systems.  The question is whether Quora can develop a large and loyal enough community to enable the ongoing accretion, filtering and refinement of good quality knowledge. It&#8217;s potential positive impact is significant &#8211; as a species we are predisposed to trust others on topics that we are not well equipped or primed to understand. Friendly, tailored advice caters to this. But this credibility is not without risk: while working on this I saw <a href="http://www.bbc.co.uk/programmes/b007nf70">Gavin and Stacey</a>, the one where Pete asked a social QA site &#8220;Should Pete divorce Dawn?&#8221; &#8211; the answer persuades them to give their marriage another shot. The complete lack of context and detail given in the question and their blind faith in the answer was the joke, but it also makes a point about how we may rate a stranger&#8217;s advice to a life-changing extent!</p>
<p><strong>Footnotes</strong><br />
1. Goldman, A.I. 1999, <em>Knowledge in a Social World</em>, Oxford.<br />
2. More recently, Goldman has recognised several shades of social epistemology, and suggests that the internet should be approached with a <a href="http://www.librarything.com/work/5625701/book/61722175">systems-oriented social epistemology</a>, where the individual components should be understood but their interactions and the emergent effect of their aggregation are also significant.<br />
3. Magnus, P.D. 2009, &#8220;On Trusting WIKIPEDIA&#8221;, <em>Episteme</em>, vol. 6, no. 1, pp. 74-90.<br />
4. Preston, C.J. 2008, &#8220;Braided learning: an emerging process observed in e-communities of practice&#8221;, <em>International Journal of Web Based Communities</em>, vol. 4, pp. 220-243(24).<br />
5. Rieh, S.Y. &#038; Danielson, D.R. 2007, &#8220;Credibility: A multidisciplinary framework&#8221;, <em>Annual Review of Information Science and Technology</em>, vol. 41, no. 1, pp. 307-364.</p>
<p>Images:<br />
Hands <a href="http://www.flickr.com/photos/coldtaxi/">Flickr/coldtaxi</a></p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2011/01/social-epistemology-credibility-and-social-qa-interface/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The evolution of reason</title>
		<link>http://www.menarik.co.uk/2010/09/the-evolution-of-reason/</link>
		<comments>http://www.menarik.co.uk/2010/09/the-evolution-of-reason/#comments</comments>
		<pubDate>Wed, 29 Sep 2010 14:45:07 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Papers Reviewed]]></category>
		<category><![CDATA[argumentation]]></category>
		<category><![CDATA[bias]]></category>
		<category><![CDATA[social epistemology]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=430</guid>
		<description><![CDATA[The origin of reasoning is in preparing to persuade or believe others, this paper argues]]></description>
			<content:encoded><![CDATA[<p>Mercier, H. 2010, &#8220;<a href="http://www.springerlink.com/content/b6x2q79j450g3721/">The Social Origins of Folk Epistemology</a>&#8220;, Review of Philosophy and Psychology</p>
<p>In this paper, Mercier argues that an evolutionary account of reasoning that derives from social argumentation is more compelling than the classic view which frames it as a crown jewel of individual cognition. He sees reasoning as a necessary outcome of human communication, where the responder employs sophisticated coherence checking of newly received facts (tending towards conserving existing knowledge), and the sender employs techniques to select the most convincing premises to persuade the receiver (while striving to avoid an irreversible loss of trust).</p>
<p>Mercier draws support for this theory from the observations that:</p>
<ol>
<li><strong>Confirmation and disconfirmation bias are a common and robust experimental finding</strong>, with people finding support for their side of an issue much more readily than the other side;
</li>
<li><strong>Individual reasoning doesn&#8217;t necessarily lead to better decisions</strong>, and here Mercier cites evidence for poorer decisions when people are asked to consciously and explicitly consider their reasons first. </li>
<li><strong>Individual reasoning tends toward options that are easier to justify</strong>, and this may lead people away from normative answers toward those that would be easier to defend if called upon to do so.</ol>
<p>Individual reasoning is therefore &#8220;a tool of (anticipated) persuasion&#8221;.  Groups can perform better, on the other hand, perhaps due to a fairer balance between evaluation and persuasion. Here Mercier cites the &#8220;truth wins&#8221; and &#8220;assembly bonus effects&#8221;, where groups converge on correct answers, even where no individual members have these answers to begin with. In ideal social argumentation, the net effect can approach unbiased reasoning, because the type of biases named above provide the means to divide labour (each person only needs to find arguments for their own position and these are judged by the rest of the group on their merits).</p>
<p>Mercier does not always explain the evidence that he draws on very clearly, and it is tempting to wonder if he is exhibiting confirmation bias himself. Clearly neither the shortcomings of individual reasoning nor the merits of group decision making should be over-generalised. That said, there seems intuitively (i.e before engaging reason!?) to be much value in his approach in seeking a social, relationist origin to this cognitive capability. These ideas seem to fit well with that of enactive cognition &#8211; that individual cognitive structures are a consequence of a structurally coupled social system &#8211; and social epistemology, that social evidence should be considered a valid and primary source of knowledge.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/09/the-evolution-of-reason/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The Social Epistemology of Wikipedia</title>
		<link>http://www.menarik.co.uk/2010/07/the-social-epistemology-of-wikipedia/</link>
		<comments>http://www.menarik.co.uk/2010/07/the-social-epistemology-of-wikipedia/#comments</comments>
		<pubDate>Wed, 21 Jul 2010 14:41:00 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Papers Reviewed]]></category>
		<category><![CDATA[philosophy]]></category>
		<category><![CDATA[social epistemology]]></category>
		<category><![CDATA[testimony]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=405</guid>
		<description><![CDATA[Notes from a special issue of Episteme on the epistemology of mass collaboration. In the set of articles reviewed, philosophers consider Wikipedia from the perspective of theories of testimony and truth.
]]></description>
			<content:encoded><![CDATA[<p>Papers from <em>Episteme</em> 6 (1) The Epistemology of Mass Collaboration <a href="http://www.euppublishing.com/toc/epi/6/1">http://www.euppublishing.com/toc/epi/6/1</a></p>
<p>Tollefsen, D.P. (2009) WIKIPEDIA and the Epistemology of Testimony. <em>Episteme</em> 6 (1), 8-24.<br />
Wray, K.B. (2009) The Epistemic Cultures of Science and WIKIPEDIA: A Comparison. <em>Episteme</em> 6 (1), 38-51.<br />
Sanger, L.M. (2009) The Fate of Expertise after WIKIPEDIA. <em>Episteme</em> 6 (1), 52-73.<br />
Magnus, P.D. (2009) On Trusting WIKIPEDIA.<em> Episteme</em> 6 (1), 74-90.</p>
<p>In this set of articles, philosophers considered the implications of Wikipedia on theories of testimony, taking various positions on its role and significance as a knowledge source. A common theme is what our use of a collaborative but mostly anonymous system tells us about human credulity. A further thread is how many of the cues we normally use to assess credibility are not available in Wikipedia. Although there still ways to determine how accurate its articles are, we don&#8217;t actually tend to use them in practice. In taking the philosophical tack, the authors join up our current knowledge seeking habits with traditions within philosophy, but also remind us of the huge power and reach of Wikipedia.</p>
<p>In the first article, Tollefsen claims that Wikipedia  can count as testimony on the assurance principle &#8211; that the authors are providing some assurance about the correctness of the content. Moreover, it has many qualities of group testimony as distinct from the aggregation of individual testimonies. She notes that the more well established articles represent a balanced view of the topic (as we&#8217;ll see this is perhaps overidealistic and is not entirely corroborated by the other authors.),  seen in a way as the settling of the group mind on an issue. As we don&#8217;t know the individuals or groups involved in authorship, we are also to some extent placing our trust in a system. </p>
<p>Tollefsen largely supports an &#8216;antireductionist&#8217; approach to justification, where you you are justified in believing others as long as you have no reason to believe them insincere. In the case of Wikipedia, however, she concludes that default entitlement to trust does not hold, and that we need to treat Wikipedia more cautiously, like &#8220;talking to a child&#8221;. Later on, she notes that we monitor testimony by checking it against our background beliefs, and this is how we may choose to accept the content of articles.</p>
<p>Wray takes the approach of comparing the process behind wiki articles to the practices of scientists, and comes out more in favour of science as a sound knowledge making practice. He observes how hard it is to authenticate online claims, quoting Shapin that &#8220;trust is ubiquitous in knowledge societies&#8221;. He further notes that the reliability of wikipedia is based on &#8216;invisible hand justification&#8217; &#8211; in that there is no one person in charge of quality assurance on an article. </p>
<p>Wray sees wikipedians as having less to lose than collaborating scientists and therefore less incentive to be honest and conscientious. That said, he does go on to note that the rush to publish in science is a knowledge anti-pattern and may lead to as many accuracy issues as the rush to post online, which he casts as a form of gossip.</p>
<p>Wray compares egoism and non-egoism as justification in testimony. An egoist approach would say that you need to know the character of the person testifying, and a non-egoist would say it is just enoughto be told someone else believes something. But he concludes that neither extreme really holds in Wikipedia &#8211; we dont know the reputation of the authors nor if they really believe what they tell us.</p>
<p>Sanger, a philosopher as well as one of the originators of Wikipedia discusses whether experts still needed now that non-experts can so effectively aggregate knowledge. He concludes, however, that the role of the expert has been too far downplayed in Wikipedia, as experts may be pushed out or ignored. </p>
<p>In conflict with Tollefsen&#8217;s views about the &#8220;balanced&#8221; article emerging over time, Sanger claims that article quality may be a function of  the persistance and aggression of the article&#8217;s followers, not just the lifetime of the article. In content wars, experts may be driven off by aggressive amateurs with more time on their hands.</p>
<p>Magnus asks what it means to trust Wikipedia as a knowledge source. He notes that in the famous (though controversial) Nature study, Wikipedia proved more variable and slightly less accurate than Brittanica, though perhaps in Wikipedia omissions are more common than inaccuracy.</p>
<p>Magnus provides a useful list of cues that we may use in assessing written testimony including:  authority, plausibility of style and content, calibration and sampling with usual sources. He notes that these come undone on Wikipedia. Authority is hard to determine &#8211; and not guaranteed with referencing. Style can be edited out. There is a real risk of plausible falsehoods in the content. Calibration may only occur on widely known, as opposed to more specialist portions.</p>
<p>Magnus offers some solutions: Follow sources, interpret the palimpsest &#8211; the evolution of the article &#8211; from the discuss pages, and link to the dated version of the page. All things that are possible, but even academics probably don&#8217;t do.</p>
<p>While the more cautious views on how we &#8220;should&#8221; treat Wikipedia presented in this journal are coming from experienced academics, I am fairly sure that the position of the average user &#8211; including students and teachers &#8211; is much more toward anti-egoism and anti-reductionism. Thomas Reid&#8217;s principle of credulity &#8211; that we are predisposed to trust our fellow man &#8211;  certainly holds in the large part, especially as often there is not much at stake if we are taking on false beliefs. As the need for correctness increases, however, then so does the need to spend time (and cognitive resources) in assessing credibility.</p>
<p>One area that I felt was not well covered in these papers was the tendency for online information to be recycled. The verifiability principle mentioned by Sanger works well, though often means that large sections are lifted directly from pre-existing sources on the topic, many of which are available online, many of which are written by &#8220;experts&#8221;. I expect that there is research available on the extent to which Wikipedia articles are largely paraphrased prior texts, and will report it here if I come across it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/07/the-social-epistemology-of-wikipedia/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>ISKO event &#8211; digital cultural heritage</title>
		<link>http://www.menarik.co.uk/2010/06/isko-event-digital-cultural-heritage/</link>
		<comments>http://www.menarik.co.uk/2010/06/isko-event-digital-cultural-heritage/#comments</comments>
		<pubDate>Thu, 10 Jun 2010 14:31:50 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Meetings]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[digital heritage]]></category>
		<category><![CDATA[isko]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=384</guid>
		<description><![CDATA[Some notes from a set of talks at UCL on 9th June.]]></description>
			<content:encoded><![CDATA[<p><strong><a href="http://www.iskouk.org/cultural_heritage_jun2010.htm">ISKO Seeing is Believing</a> : New technologies for cultural heritage</strong></p>
<p>9 Jun 2010, UCL</p>
<p>A set of interesting talks from researchers, museum staff and a technology supplier on digital heritage. Common themes were enhancing interaction and engagement with collections through digital media and the crowdsourcing of transcription work with its attendant risks and benefits. Crowdsourcing is seen having particular potential at a time when it is hard to secure funding for digitisation and online projects.</p>
<p><strong>1. David Arnold, University of Brighton</strong></p>
<p>David Arnold presented on advances in 3D object representations for heritage collections. Not really my area this, so I only picked up a few points:</p>
<ul>
<li><a href="http://www.3d-coform.eu/">3D Coform Project</a> – part of big EU project with museums. </li>
<li>Phased adoption of technology – first phase just reproduces what you do manually. Subsequent phases only possible with tech.</li>
<li>Every time we digitise, we create potential legacy problems with the data</li>
</ul>
<p>Tools: </p>
<ul>
<li>triangulated laser scanning,  KULeuvan ARC 3D tool</li>
<li>Sourceforce FOSS mesh processing</li>
</ul>
<p>Example: <a href="http://www.publicsculpturesofsussex.co.uk/">publicscupturesofsussex.co.uk</a> – map + 3d pics</p>
<p>Metadata: 3D COFORM adopted CIDOC-CRM &#038; extension to “Digital Provenance”</p>
<p><strong>2. Andy Hudson-Smith, UCL (cf <a href="http://digitalurban.blogspot.com/">Digitalurban blog</a>)</strong></p>
<p>Andy spoke about a new project aimed to attach memories to objects: Tales of Things and electronic Memory (TOTeM). Site is:  <a href="http://www.talesofthings.com">talesofthings.com</a> </p>
<p><a href="http://www.talesofthings.com/"><img src="http://www.menarik.co.uk/wp-content/uploads/2010/06/tot_ss-300x268.png" alt="tales of things website" title="tot_ss" width="300" height="268" class="alignleft size-medium wp-image-392" /></a></p>
<p>You take a photo of an object/building etc.. of personal significance to you, upload it to their website and get a QR code sticker. API under development. </p>
<p>The object is then tagged and you can add your memories to it via the web. Object also tweets on updates. Audioboo for the audio tagging.</p>
<p>Apps: talking Oxfam objects (what happened to my old trousers?), artist adds QR to art to record people&#8217;s reactions to his art.</p>
<p>Plan to make QR code out of mosaic tiles for Shoreditch church.</p>
<p>“<a href="http://journal.media-culture.org.au/index.php/mcjournal/article/viewArticle/209/0">Tagging is Connecting</a>” paper M/C journal</p>
<p>Follow project at  <a href="twitter.com/talesofthings">twitter.com/talesofthings </a></p>
<p>Alzheimers link – triggering memories in sufferers.</p>
<p><strong>3. Melissa Terras, UCL</strong></p>
<p>Melissa spoke about her &#8220;Transcribe Bentham&#8221; Project &#038; Crowdsourcing the transcriptions.</p>
<p>Jeremy Bentham – large amount of papers &#038; material on him at UCL. </p>
<p>Crowdsourcing roots. Often those outside institutions took lead in exploiting new tech. Old idea – e.g. metal detectors revolutionising archeology with uptake by enthusiasts. </p>
<p>Holley, R D-Lib Magazine. “<a href="http://www.dlib.org/dlib/march10/holley/03holley.html">Crowdsourcing: how and why should libraries do it</a>?”</p>
<p>Crowdsourcing successes:</p>
<ul>
<li>Galaxy Zoo – 60 million galaxies classified</li>
<li>Australian Newspapers Digitisation Program www.nla.gov.au/ndp. 9000+ members thousands of lines corrected</li>
<li>V &#038; A – crowdsourcing best crops of images from collections. 14,000 to date.</li>
<li>Family search indexing of family trees www.familysearch.org</li>
</ul>
<p>and many more..</p>
<p>Learning from crowdsourcing experience:</p>
<ul>
<li>Majority of work is done by 10% of users</li>
<li>Personal interest &#038; reward – use of cognitive surplus</li>
<li>
Pensioners, disabled, terminally ill particularly keen</li>
<li>Builds up IT expertise, “addictive”, rewards = ranking</li>
<p>.</ul>
<p>Rose Holley&#8217;s checklist (on dlib). Launching system in July. </p>
<p><a href="www.ucl.ac.uk/transcribe-bentham">www.ucl.ac.uk/transcribe-bentham</a></p>
<p>..system will be based on mediawiki</p>
<p>Issues :</p>
<ul>
<li>linking transcription, catalogue &#038; image</li>
<li>Undoing vandalism</li>
<li>Quality control</li>
</ul>
<p>#TranscriBentham</p>
<p><strong>4. Fiona Romeo, National Maritime Museum </strong></p>
<p>Fiona also presented a range of NMM digital projects including crowdsourding of transcripts also. </p>
<p>“Your Paintings” project with BBC. Public Catalogue Foundation.</p>
<p><a href="http://www.flickr.com/photos/nationalmaritimemuseum/">NMM photos on Flickr</a>. Get foreign language tags as well as semantics (content related) . Some additional annotations. Get questions, which helps guide information design. Indicommons project, make it easier to share content.</p>
<p>Naval-history.net Naval history community – connected with them and got them to work on a set of scans – CC sharealike licenced.</p>
<p>NMM have a new wing under development dedicated to digital interaction and a number of interesting ideas for social sharing and enhancement of objects  in the collection.</p>
<p><strong>5. Sascha Curzon, Gallery Systems</strong></p>
<p>Sascha presented the <a href="http://www.emuseum.com/">eMuseum</a> federated search system, a nonprofit project which enables museums and galleries to expose  their catalogues, enabling end users to  search across collections. The aggregated data will also be available as XML and RDF.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/06/isko-event-digital-cultural-heritage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Asking the audience</title>
		<link>http://www.menarik.co.uk/2010/06/group-wisdom/</link>
		<comments>http://www.menarik.co.uk/2010/06/group-wisdom/#comments</comments>
		<pubDate>Thu, 03 Jun 2010 21:56:49 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Books Reviewed]]></category>
		<category><![CDATA[groupthink]]></category>
		<category><![CDATA[social epistemology]]></category>
		<category><![CDATA[wisdom]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=345</guid>
		<description><![CDATA[Bites from the 2006 book "Infotopia" by Cass Sunstein on the aggregation of knowledge]]></description>
			<content:encoded><![CDATA[<p>SUNSTEIN, C.R., 2006. <em>Infotopia: How many minds produce knowledge</em>. New York: Oxford</p>
<p><img src="http://www.menarik.co.uk/wp-content/uploads/2010/06/infotopia-199x300.jpg" alt="infotopia" title="infotopia" width="120" class="alignleft size-medium wp-image-348" /></p>
<p>I only discovered this book recently but it contains some interesting insights into group decision making, providing a range of evidence to support &mdash; often counterintuitive &mdash; observations about the best conditions for knowledge creation. As such, it relates well to <a href="http://fas-philosophy.rutgers.edu/goldman/">Alvin Goldman&#8217;s</a> &#8220;veritistic selection practices&#8221; &mdash; social design aimed at maximising the truth value of aggregated opinion.</p>
<p>Produced at a time when prediction markets, blogs and wikis were attracting growing interest, it is the book&#8217;s analysis of the more traditional social collaborative processes that stood out for me.</p>
<p>A central idea early in the book is the Condorcet Jury Theorem, which states that &mdash; on binary and numerical judgements at least &mdash; a group is more likely to be correct than an individual under the conditions that:-</p>
<ol>
<li>Majority rule is used</li>
<li>Each individual is more likely than not to be correct</li>
</ol>
<p>Notably, only some of the group need to know the correct answer, as the rest will answer randomly preserving the effect. This is redolent of Millionaire&#8217;s &#8220;Ask the Audience&#8221; option, which another book <a href="http://www.amazon.co.uk/Wisdom-Crowds-Many-Smarter-Than/dp/0349116059/">The Wisdom of Crowds</a> claimed is correct <b>91%</b> of the time, more often than the phone a friend option. <a href="http://www.smoking-bun.com/?p=9">Some warn against</a>  calling this mathematical effect &#8220;wisdom&#8221;, however.</p>
<p><a href="http://www.menarik.co.uk/wp-content/uploads/2010/06/wwtbam_audience_response_4.jpg"><img src="http://www.menarik.co.uk/wp-content/uploads/2010/06/wwtbam_audience_response_4-300x225.jpg" alt="ask the audience screenshot" title="wwtbam_audience_response_4" width="300" height="225" class="alignleft size-medium wp-image-401" /></a></p>
<p>Sunstein goes on to provide a range of evidence supporting the observation that deliberation does not lead necessarily lead to more accurate outcomes than polled, statistical samples. This may be due to:-</p>
<ol>
<li>&#8220;<strong>hidden profiles</strong>&#8220;, or accurate information available to a group which is not accessed in deliberation (perhaps because only a few members of the group possess it, or are reluctant to go against the grain and challenge the majority view);</li>
<li> The &#8220;<strong>common knowledge effect</strong>&#8220;, where shared knowledge has more of an influence on the group&#8217;s decision than information held individually;</li>
<li><strong>Informational</strong> and <strong>reputational cascades</strong>, where individuals change their view or the willingness to speak out after hearing the views of others;</li>
<li><strong>Group polarisation</strong>, where members of a group end up with more extreme views &mdash; in line with their initial tendencies &mdash; after debate.</li>
</ol>
<p>What this all mean for knowledge creation on the social web? Perhaps that giving scores and weighting to reputation, while establishing authority and expertise, may also dissuade or detract from informed dissent, and that such dissent may also warrant recognition. </p>
<p>In <a href="http://www.ethanzuckerman.com/blog/2006/11/30/cass-sunsteins-infotopia/">another review of the book</a>, Ethan Zuckerman noted:-</p>
<p>&#8220;A large number of the most interesting projects taking place on the internet use strategies to aggregate information from multiple users to create new knowledge &#8230;. analyzing these systems in terms of their effectiveness in getting people to reveal hidden knowledge is, in my opinion, an excellent framework for evaluation.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/06/group-wisdom/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>information systems research: describe, critique or design?</title>
		<link>http://www.menarik.co.uk/2010/02/information-systems-research-describe-critique-or-design/</link>
		<comments>http://www.menarik.co.uk/2010/02/information-systems-research-describe-critique-or-design/#comments</comments>
		<pubDate>Thu, 11 Feb 2010 13:58:44 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[is]]></category>
		<category><![CDATA[theory]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=313</guid>
		<description><![CDATA[At what level does IS research work best? This post argues for more direct involvement in design from the research community.
(image flickr/talios)]]></description>
			<content:encoded><![CDATA[<p>I sometimes find it hard to see the practical value of some IS research.  Researchers argue for a well-supported theoretical angle, come up with a pragmatic method and may present interesting observations and insights into the particular system that has been investigated. When it comes to design recommendations arising from the research, however, these often seem trite or added as an afterthought. </p>
<p>If theory and research are meant to inform practice, recommendations need to be current, relevant,  and actionable. This can be hard to achieve in a domain that develops so quickly. Checkland (1998) observed a while ago how it is very hard for theory to keep up and be informative, thereby failing to match the &#8220;ideal&#8221; dynamic:</p>
<div id="attachment_320" class="wp-caption aligncenter" style="width: 510px"><img src="http://www.menarik.co.uk/wp-content/uploads/2010/02/checkland_cycle.png" alt="Checkland&#039;s 'ideal' dynamic for theory informing practice" title="checkland_cycle" width="500"  class="size-full wp-image-320" /><p class="wp-caption-text">Checkland's ideal for theory informing practice</p></div>
<p>Whilst a range of research-theoretical stances can be taken, justified and accepted by the research community, only some seem to be close enough to practice to usefully inform it. This is where the researchers are either experimenting directly through design, or are involved enough to really engage with the language, problems and possible solutions in the domain. So I would favour interpretive or action research approaches in this regard:</p>
<div id="attachment_326" class="wp-caption aligncenter" style="width: 510px"><img src="http://www.menarik.co.uk/wp-content/uploads/2010/02/levels1.png" alt="Levels of engagement and theoretical stance (based on Aakhus &amp; Jackson (2005))" title="levels" width="500"  class="size-full wp-image-326" /><p class="wp-caption-text">Levels of engagement and theoretical stance (based on Aakhus &#038; Jackson (2005))</p></div>
<p>Whilst the more descriptive, long view can still be valuable, it is hard to <em>realise</em> this value unless the knowledge gained is acted on by those implementing new designs and approaches. As they are the ones reading the research, this should perhaps be the research community itself in the first instance. Design improvement can then be further communicated through demonstrators, but these need to be compelling enough to be taken seriously.</p>
<p><strong>References</strong></p>
<p>1. Aakhus, M. and Jackson, S., 2005. Technology, Interaction and Design. In: Fitch, K.L. and Sanders, R.E.( eds) <em>Handbook of Language and Social Interaction</em>. LEA, pp. 411-433<br />
2. Checkland, P .and Holwell ,S.  1998.  <em>Information, Systems and Information Systems: Making Sense of the Field</em>  John Wiley &#038; Sons, Inc. New York, NY, USA</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/02/information-systems-research-describe-critique-or-design/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Book Review: Metadata and Semantics</title>
		<link>http://www.menarik.co.uk/2010/01/book-review-metadata-and-semantics/</link>
		<comments>http://www.menarik.co.uk/2010/01/book-review-metadata-and-semantics/#comments</comments>
		<pubDate>Thu, 14 Jan 2010 15:18:04 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Books Reviewed]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[ontology]]></category>
		<category><![CDATA[semantics]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=295</guid>
		<description><![CDATA[A review of the proceedings of the 2nd International Conference on Metadata and Semantics Research, held in Corfu in October 2007

(Image:flickr:dullhunk)]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.springer.com/book/978-0-387-77744-3"><img alt="" src="http://www.springer.com/cda/content/image/cda_displayimage.jpg?SGWID=0-0-16-475462-0" title="Cover" class="alignleft" width="95" height="145" /></a><br />
Metadata and Semantics<br />
<a href="http://www.springer.com/book/978-0-387-77744-3">http://www.springer.com/book/978-0-387-77744-3</a><br />
Sicilia, M.; Lytras, M. (eds)<br />
2009 Springer 549pp</p>
<p><em><strong>Originally reviewed for the <a href="http://irsg.bcs.org/index.php">BCS Information Retrieval Specialists Group</a> Newsletter &#8220;<a href="http://irsg.bcs.org/display_informers.php">The Informer</a>&#8220;</strong></em></p>
<p>This volume contains the proceedings of the <strong>2nd International Conference on Metadata and Semantics Research</strong>, held in Corfu in October 2007. The conference had a 50% acceptance rate, with most of the presented papers being published in the book.</p>
<p>Despite the fact that many papers present only interim or early research work, the volume provides a fascinating overview of a cross section of metadata and semantics research in both pure form and as applied in a variety of domains, including those of cultural heritage, education and agriculture. In addition to the advances in these individual fields and domains, one is able to determine some general trends and patterns running through the work presented. Perhaps overriding all is the sense that many researchers and archivists are struggling with the problem of balancing the complexity of describing information resources within a domain adequately, while at the same time ensuring that discrete sets of knowledge can actually be successfully linked and that systems which can guide the user in retrieval and discovery can be usefully developed on top of these structures.</p>
<p>Immediately apparent as one reads the volume is the dominance of semantic web approaches to metadata and knowledge modelling, influencing migrations from stand-alone formats and conventions to those based on semantic web standards such as RDF and OWL.  Sections are devoted to semantic web applications and to ontology engineering.  In several places one sees how such approaches can provide the missing semantic aspect to rather more well-established syntactic rules and begin to enable reasoning and knowledge discovery through the use of ontologies. In <strong>Semantic Application Profiles: A Means to Enhance Knowledge Discovery in Domain Metadata Models</strong>, for instance, Koutsomitropoulos and colleagues show how the CIDOC Conceptual Reference Model for cultural heritage can be represented in OWL and used to discover related works (using a painting example).  In <strong>Capturing MPEG-7 Semantics</strong>, Dasiopoulou et al, argue that a semantic model is needed to enhance and disambiguate multimedia mark-up enabled by the MPEG-7 format.  Also very  much worthy of mention in this regard is good the article by Voss, <strong>Encoding changing country codes for the Semantic Web with ISO 3166 and SKOS</strong> which demonstrates with a grounded example how changes and versioning can be accommodated using the RDF based Simple Knowledge Organisation System.  This is clearly paramount when considering the provision of globally applicable yet cohesive and adaptive data sets.</p>
<p>The complexity of and unwieldiness of ontologies has been an issue in the past and several of the papers address this by providing either automated means of deriving ontology structures or reviewing tools used (or needed)  for ontology and semantic web data manipulation.  </p>
<p>In the first category,  <strong>Ensemble Learning of Economic Taxonomy Relations from Modern Greek Corpora</strong> by Kermanidis shows how a number of clustering algorithms can be applied in combination to arrive at quite a high level of accuracy in deriving relations, based only on the learning sets themselves.  In <strong>A new Formal Concept Analysis-based learning approach to Ontology Building</strong>, Jia and colleagues promote the FCA approach in providing a lattice of concepts rather than the hierarchy given by other popular clustering methods, which they argue as being more “true to life”. Jia et al then show how the derived ontology can then be used to provide a query refinement/expansion method. </p>
<p>In terms of user tools, Cardoso provides a useful state of the art in use of software to manipulate RDF data and ontologies in <strong>The Semantic Web : A mythical story or solid reality</strong> ,  and Enokksson et al address the problem of the remote and in-place modification of RDF in <strong>An RDF Modification Protocol based on the needs of Editing Tools</strong>.</p>
<p>While there are a number of papers that demonstrate derivations and developments of specialised or extended ontologies and metadata schemes, I found the more pragmatic papers to be more valuable and progressive for the way they manage to remain in touch with the ultimate aim of their work. A nice balance between the convenience of machine cataloguing and the need for oversight of human cataloguers is struck in Whitelaw and Collins’ <strong>Pragmatic support for taxonomy-based annotation of structured digital documents</strong>. The authors describe the derivation of metadata for Open University course documents (based on IEEE Learning Object Metadata) using a series of more- or less automated approaches and then having these evaluated by library cataloguing staff.  The optimal solution was found to be where terms are suggested from the vocabulary based on document content, but where the staff member can click through to the definition before confirming the use of the term as metadata. The authors also begin to look at the cost benefit ratio of collecting more detailed metadata to a slight increase in cataloguing effort. While these authors don’t get as far as exploring the efficiency of retrieval of documents, this is handled in Cervera et als’ <strong>Quality Metrics in Learning Objects</strong> which stresses the reusability objective and notes that quality metrics are crucial to the navigation of resources. Notably, Cervera et al describe the need for socially based rankings and ratings to be incorporated into the metadata scheme. </p>
<p>A further example of attempts to enhance the usability and comparability of metadata is provided by Nilsson et al in <strong>Formalizing Dublin Core Application Profiles: Description Set Profiles and Graph Constraints</strong>. A number of practical measures for introducing conventions for how metadata elements are used and coded are presented, including standard templates and the use of a wiki to provide for easy human editing of profiles which can then also be exported at XML, thereby bridging the gap between qualitative and more structured usage guides.</p>
<p>Against this general background of attempts to standardise, enable interoperability between collections and provide enhanced semantic infrastructures you have the inherent messiness of human decisions as to meaning and the complexity of the encoding act itself, something that is probably not discussed enough in this community but which is covered in Scifleet et al’s <strong>The Human Art of Encoding: Markup as Documentary Practice</strong>. The authors note that metadata cataloguing is socially situated and may often be idiosyncratic, though the extent to which this may affect quality and comparability is poorly known. They report on a programme of research to study this variety in the practice of participants in the Text Encoding Initiative (TEI).  Unfortunately their full findings are not covered in this volume but are well worth following up for those interested in this important question.</p>
<p>In summary, this is an interesting and varied read, with enough “current state of the art” review papers to interest the non-specialist as well as some good fodder for those already wrestling with metadata,  semantics and knowledge representation issues within particular domains.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/01/book-review-metadata-and-semantics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Online Information 2009 &#8211; A Twitter Timeline</title>
		<link>http://www.menarik.co.uk/2009/12/online-information-2009-a-twitter-timeline/</link>
		<comments>http://www.menarik.co.uk/2009/12/online-information-2009-a-twitter-timeline/#comments</comments>
		<pubDate>Wed, 09 Dec 2009 18:42:16 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Meetings]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[#online09]]></category>
		<category><![CDATA[conferences]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=269</guid>
		<description><![CDATA[An interactive timeline of tweets from Onine Information 2009]]></description>
			<content:encoded><![CDATA[<p>It was disappointing to miss <a href="http://www.online-information.co.uk/index.html">Online Information</a> this year, so I have been trying to catch up via the Twitter stream. Fortunately the #online09 tweets have been archived to <a href="http://twapperkeeper.com/online09/">Twapper Keeper</a> (via <a href="http://twitter.com/briankelly">@briankelly</a>). This is great, as the site makes them downloadable. </p>
<p>To see them in the context of the conference programme, I converted the tweets and the event schedule  to XML suitable for display in a <a href="http://www.simile-widgets.org/timeline/">Simile Timeline</a>. Click the screenshot below to access the resulting <a href="http://www.menarik.co.uk/tweetline">timeline mashup</a> (nb tweets take a few seconds to load).</p>
<p> <a href="http://www.menarik.co.uk/tweetline"><img src="http://www.menarik.co.uk/wp-content/uploads/2009/12/tweetline_cap.png" alt="tweetline_cap" title="tweetline_cap" width="500" height="253" class="aligncenter size-full wp-image-271" style="clear:both;" /></a></p>
<p><strong>What would have made this exercise easier, and the conference tweets easier to track</strong>:</p>
<ul>
<li>If the conference programme had been published in a more mashable format eg iCal / XML</li>
<li>If there was a way to identify a particular conference track (a sub hashtag? &#8211; or is that overkill?)</li>
<li>A way to distinguish critical comments, reportage, and general flim-flam. Though perhaps that is part of the fun!</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2009/12/online-information-2009-a-twitter-timeline/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
<!-- WP Super Cache is installed but broken. The path to wp-cache-phase1.php in wp-content/advanced-cache.php must be fixed! -->
