<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Menarik &#187; Paul Matthews</title>
	<atom:link href="http://www.menarik.co.uk/author/paulusm/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.menarik.co.uk</link>
	<description>my kind of interesting</description>
	<lastBuildDate>Tue, 10 Aug 2010 05:58:59 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>The Social Epistemology of Wikipedia</title>
		<link>http://www.menarik.co.uk/2010/07/the-social-epistemology-of-wikipedia/</link>
		<comments>http://www.menarik.co.uk/2010/07/the-social-epistemology-of-wikipedia/#comments</comments>
		<pubDate>Wed, 21 Jul 2010 14:41:00 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Papers Reviewed]]></category>
		<category><![CDATA[philosophy]]></category>
		<category><![CDATA[social epistemology]]></category>
		<category><![CDATA[testimony]]></category>
		<category><![CDATA[wikipedia]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=405</guid>
		<description><![CDATA[Notes from a special issue of Episteme on the epistemology of mass collaboration. In the set of articles reviewed, philosophers consider Wikipedia from the perspective of theories of testimony and truth.
]]></description>
			<content:encoded><![CDATA[<p>Papers from <em>Episteme</em> 6 (1) The Epistemology of Mass Collaboration <a href="http://www.euppublishing.com/toc/epi/6/1">http://www.euppublishing.com/toc/epi/6/1</a></p>
<p>Tollefsen, D.P. (2009) WIKIPEDIA and the Epistemology of Testimony. <em>Episteme</em> 6 (1), 8-24.<br />
Wray, K.B. (2009) The Epistemic Cultures of Science and WIKIPEDIA: A Comparison. <em>Episteme</em> 6 (1), 38-51.<br />
Sanger, L.M. (2009) The Fate of Expertise after WIKIPEDIA. <em>Episteme</em> 6 (1), 52-73.<br />
Magnus, P.D. (2009) On Trusting WIKIPEDIA.<em> Episteme</em> 6 (1), 74-90.</p>
<p>In this set of articles, philosophers considered the implications of Wikipedia on theories of testimony, taking various positions on its role and significance as a knowledge source. A common theme is what our use of a collaborative but mostly anonymous system tells us about human credulity. A further thread is how many of the cues we normally use to assess credibility are not available in Wikipedia. Although there still ways to determine how accurate its articles are, we don&#8217;t actually tend to use them in practice. In taking the philosophical tack, the authors join up our current knowledge seeking habits with traditions within philosophy, but also remind us of the huge power and reach of Wikipedia.</p>
<p>In the first article, Tollefsen claims that Wikipedia  can count as testimony on the assurance principle &#8211; that the authors are providing some assurance about the correctness of the content. Moreover, it has many qualities of group testimony as distinct from the aggregation of individual testimonies. She notes that the more well established articles represent a balanced view of the topic (as we&#8217;ll see this is perhaps overidealistic and is not entirely corroborated by the other authors.),  seen in a way as the settling of the group mind on an issue. As we don&#8217;t know the individuals or groups involved in authorship, we are also to some extent placing our trust in a system. </p>
<p>Tollefsen largely supports an &#8216;antireductionist&#8217; approach to justification, where you you are justified in believing others as long as you have no reason to believe them insincere. In the case of Wikipedia, however, she concludes that default entitlement to trust does not hold, and that we need to treat Wikipedia more cautiously, like &#8220;talking to a child&#8221;. Later on, she notes that we monitor testimony by checking it against our background beliefs, and this is how we may choose to accept the content of articles.</p>
<p>Wray takes the approach of comparing the process behind wiki articles to the practices of scientists, and comes out more in favour of science as a sound knowledge making practice. He observes how hard it is to authenticate online claims, quoting Shapin that &#8220;trust is ubiquitous in knowledge societies&#8221;. He further notes that the reliability of wikipedia is based on &#8216;invisible hand justification&#8217; &#8211; in that there is no one person in charge of quality assurance on an article. </p>
<p>Wray sees wikipedians as having less to lose than collaborating scientists and therefore less incentive to be honest and conscientious. That said, he does go on to note that the rush to publish in science is a knowledge anti-pattern and may lead to as many accuracy issues as the rush to post online, which he casts as a form of gossip.</p>
<p>Wray compares egoism and non-egoism as justification in testimony. An egoist approach would say that you need to know the character of the person testifying, and a non-egoist would say it is just enoughto be told someone else believes something. But he concludes that neither extreme really holds in Wikipedia &#8211; we dont know the reputation of the authors nor if they really believe what they tell us.</p>
<p>Sanger, a philosopher as well as one of the originators of Wikipedia discusses whether experts still needed now that non-experts can so effectively aggregate knowledge. He concludes, however, that the role of the expert has been too far downplayed in Wikipedia, as experts may be pushed out or ignored. </p>
<p>In conflict with Tollefsen&#8217;s views about the &#8220;balanced&#8221; article emerging over time, Sanger claims that article quality may be a function of  the persistance and aggression of the article&#8217;s followers, not just the lifetime of the article. In content wars, experts may be driven off by aggressive amateurs with more time on their hands.</p>
<p>Magnus asks what it means to trust Wikipedia as a knowledge source. He notes that in the famous (though controversial) Nature study, Wikipedia proved more variable and slightly less accurate than Brittanica, though perhaps in Wikipedia omissions are more common than inaccuracy.</p>
<p>Magnus provides a useful list of cues that we may use in assessing written testimony including:  authority, plausibility of style and content, calibration and sampling with usual sources. He notes that these come undone on Wikipedia. Authority is hard to determine &#8211; and not guaranteed with referencing. Style can be edited out. There is a real risk of plausible falsehoods in the content. Calibration may only occur on widely known, as opposed to more specialist portions.</p>
<p>Magnus offers some solutions: Follow sources, interpret the palimpsest &#8211; the evolution of the article &#8211; from the discuss pages, and link to the dated version of the page. All things that are possible, but even academics probably don&#8217;t do.</p>
<p>While the more cautious views on how we &#8220;should&#8221; treat Wikipedia presented in this journal are coming from experienced academics, I am fairly sure that the position of the average user &#8211; including students and teachers &#8211; is much more toward anti-egoism and anti-reductionism. Thomas Reid&#8217;s principle of credulity &#8211; that we are predisposed to trust our fellow man &#8211;  certainly holds in the large part, especially as often there is not much at stake if we are taking on false beliefs. As the need for correctness increases, however, then so does the need to spend time (and cognitive resources) in assessing credibility.</p>
<p>One area that I felt was not well covered in these papers was the tendency for online information to be recycled. The verifiability principle mentioned by Sanger works well, though often means that large sections are lifted directly from pre-existing sources on the topic, many of which are available online, many of which are written by &#8220;experts&#8221;. I expect that there is research available on the extent to which Wikipedia articles are largely paraphrased prior texts, and will report it here if I come across it.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/07/the-social-epistemology-of-wikipedia/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Once upon a life: Jeanette Winterson</title>
		<link>http://www.menarik.co.uk/2010/06/once-upon-a-life-jeanette-winterson/</link>
		<comments>http://www.menarik.co.uk/2010/06/once-upon-a-life-jeanette-winterson/#comments</comments>
		<pubDate>Thu, 17 Jun 2010 16:45:56 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Asides]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=403</guid>
		<description><![CDATA[Jeanette Winterson, talking about opening her own shop instead of selling out to a coffee chain: 
&#8220;The coffee offer forced me to focus on what I would really like to happen, instead of either doing nothing or passively accepting what someone else would like to happen&#8221;
]]></description>
			<content:encoded><![CDATA[<p>Jeanette Winterson, <a href="http://www.guardian.co.uk/lifeandstyle/2010/jun/13/once-upon-a-life-jeanette-winterson">talking about opening her own shop</a> instead of selling out to a coffee chain: </p>
<p>&#8220;The coffee offer forced me to focus on what I would really like to happen, instead of either doing nothing or passively accepting what someone else would like to happen&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/06/once-upon-a-life-jeanette-winterson/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ISKO event &#8211; digital cultural heritage</title>
		<link>http://www.menarik.co.uk/2010/06/isko-event-digital-cultural-heritage/</link>
		<comments>http://www.menarik.co.uk/2010/06/isko-event-digital-cultural-heritage/#comments</comments>
		<pubDate>Thu, 10 Jun 2010 14:31:50 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Meetings]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[digital heritage]]></category>
		<category><![CDATA[isko]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=384</guid>
		<description><![CDATA[Some notes from a set of talks at UCL on 9th June.]]></description>
			<content:encoded><![CDATA[<p><strong><a href="http://www.iskouk.org/cultural_heritage_jun2010.htm">ISKO Seeing is Believing</a> : New technologies for cultural heritage</strong></p>
<p>9 Jun 2010, UCL</p>
<p>A set of interesting talks from researchers, museum staff and a technology supplier on digital heritage. Common themes were enhancing interaction and engagement with collections through digital media and the crowdsourcing of transcription work with its attendant risks and benefits. Crowdsourcing is seen having particular potential at a time when it is hard to secure funding for digitisation and online projects.</p>
<p><strong>1. David Arnold, University of Brighton</strong></p>
<p>David Arnold presented on advances in 3D object representations for heritage collections. Not really my area this, so I only picked up a few points:</p>
<ul>
<li><a href="http://www.3d-coform.eu/">3D Coform Project</a> – part of big EU project with museums. </li>
<li>Phased adoption of technology – first phase just reproduces what you do manually. Subsequent phases only possible with tech.</li>
<li>Every time we digitise, we create potential legacy problems with the data</li>
</ul>
<p>Tools: </p>
<ul>
<li>triangulated laser scanning,  KULeuvan ARC 3D tool</li>
<li>Sourceforce FOSS mesh processing</li>
</ul>
<p>Example: <a href="http://www.publicsculpturesofsussex.co.uk/">publicscupturesofsussex.co.uk</a> – map + 3d pics</p>
<p>Metadata: 3D COFORM adopted CIDOC-CRM &#038; extension to “Digital Provenance”</p>
<p><strong>2. Andy Hudson-Smith, UCL (cf <a href="http://digitalurban.blogspot.com/">Digitalurban blog</a>)</strong></p>
<p>Andy spoke about a new project aimed to attach memories to objects: Tales of Things and electronic Memory (TOTeM). Site is:  <a href="http://www.talesofthings.com">talesofthings.com</a> </p>
<p><a href="http://www.talesofthings.com/"><img src="http://www.menarik.co.uk/wp-content/uploads/2010/06/tot_ss-300x268.png" alt="tales of things website" title="tot_ss" width="300" height="268" class="alignleft size-medium wp-image-392" /></a></p>
<p>You take a photo of an object/building etc.. of personal significance to you, upload it to their website and get a QR code sticker. API under development. </p>
<p>The object is then tagged and you can add your memories to it via the web. Object also tweets on updates. Audioboo for the audio tagging.</p>
<p>Apps: talking Oxfam objects (what happened to my old trousers?), artist adds QR to art to record people&#8217;s reactions to his art.</p>
<p>Plan to make QR code out of mosaic tiles for Shoreditch church.</p>
<p>“<a href="http://journal.media-culture.org.au/index.php/mcjournal/article/viewArticle/209/0">Tagging is Connecting</a>” paper M/C journal</p>
<p>Follow project at  <a href="twitter.com/talesofthings">twitter.com/talesofthings </a></p>
<p>Alzheimers link – triggering memories in sufferers.</p>
<p><strong>3. Melissa Terras, UCL</strong></p>
<p>Melissa spoke about her &#8220;Transcribe Bentham&#8221; Project &#038; Crowdsourcing the transcriptions.</p>
<p>Jeremy Bentham – large amount of papers &#038; material on him at UCL. </p>
<p>Crowdsourcing roots. Often those outside institutions took lead in exploiting new tech. Old idea – e.g. metal detectors revolutionising archeology with uptake by enthusiasts. </p>
<p>Holley, R D-Lib Magazine. “<a href="http://www.dlib.org/dlib/march10/holley/03holley.html">Crowdsourcing: how and why should libraries do it</a>?”</p>
<p>Crowdsourcing successes:</p>
<ul>
<li>Galaxy Zoo – 60 million galaxies classified</li>
<li>Australian Newspapers Digitisation Program www.nla.gov.au/ndp. 9000+ members thousands of lines corrected</li>
<li>V &#038; A – crowdsourcing best crops of images from collections. 14,000 to date.</li>
<li>Family search indexing of family trees www.familysearch.org</li>
</ul>
<p>and many more..</p>
<p>Learning from crowdsourcing experience:</p>
<ul>
<li>Majority of work is done by 10% of users</li>
<li>Personal interest &#038; reward – use of cognitive surplus</li>
<li>
Pensioners, disabled, terminally ill particularly keen</li>
<li>Builds up IT expertise, “addictive”, rewards = ranking</li>
<p>.</ul>
<p>Rose Holley&#8217;s checklist (on dlib). Launching system in July. </p>
<p><a href="www.ucl.ac.uk/transcribe-bentham">www.ucl.ac.uk/transcribe-bentham</a></p>
<p>..system will be based on mediawiki</p>
<p>Issues :</p>
<ul>
<li>linking transcription, catalogue &#038; image</li>
<li>Undoing vandalism</li>
<li>Quality control</li>
</ul>
<p>#TranscriBentham</p>
<p><strong>4. Fiona Romeo, National Maritime Museum </strong></p>
<p>Fiona also presented a range of NMM digital projects including crowdsourding of transcripts also. </p>
<p>“Your Paintings” project with BBC. Public Catalogue Foundation.</p>
<p><a href="http://www.flickr.com/photos/nationalmaritimemuseum/">NMM photos on Flickr</a>. Get foreign language tags as well as semantics (content related) . Some additional annotations. Get questions, which helps guide information design. Indicommons project, make it easier to share content.</p>
<p>Naval-history.net Naval history community – connected with them and got them to work on a set of scans – CC sharealike licenced.</p>
<p>NMM have a new wing under development dedicated to digital interaction and a number of interesting ideas for social sharing and enhancement of objects  in the collection.</p>
<p><strong>5. Sascha Curzon, Gallery Systems</strong></p>
<p>Sascha presented the <a href="http://www.emuseum.com/">eMuseum</a> federated search system, a nonprofit project which enables museums and galleries to expose  their catalogues, enabling end users to  search across collections. The aggregated data will also be available as XML and RDF.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/06/isko-event-digital-cultural-heritage/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Asking the audience</title>
		<link>http://www.menarik.co.uk/2010/06/group-wisdom/</link>
		<comments>http://www.menarik.co.uk/2010/06/group-wisdom/#comments</comments>
		<pubDate>Thu, 03 Jun 2010 21:56:49 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Books Reviewed]]></category>
		<category><![CDATA[groupthink]]></category>
		<category><![CDATA[social epistemology]]></category>
		<category><![CDATA[wisdom]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=345</guid>
		<description><![CDATA[Bites from the 2006 book "Infotopia" by Cass Sunstein on the aggregation of knowledge]]></description>
			<content:encoded><![CDATA[<p>SUNSTEIN, C.R., 2006. <em>Infotopia: How many minds produce knowledge</em>. New York: Oxford</p>
<p><img src="http://www.menarik.co.uk/wp-content/uploads/2010/06/infotopia-199x300.jpg" alt="infotopia" title="infotopia" width="120" class="alignleft size-medium wp-image-348" /></p>
<p>I only discovered this book recently but it contains some interesting insights into group decision making, providing a range of evidence to support &mdash; often counterintuitive &mdash; observations about the best conditions for knowledge creation. As such, it relates well to <a href="http://fas-philosophy.rutgers.edu/goldman/">Alvin Goldman&#8217;s</a> &#8220;veritistic selection practices&#8221; &mdash; social design aimed at maximising the truth value of aggregated opinion.</p>
<p>Produced at a time when prediction markets, blogs and wikis were attracting growing interest, it is the book&#8217;s analysis of the more traditional social collaborative processes that stood out for me.</p>
<p>A central idea early in the book is the Condorcet Jury Theorem, which states that &mdash; on binary and numerical judgements at least &mdash; a group is more likely to be correct than an individual under the conditions that:-</p>
<ol>
<li>Majority rule is used</li>
<li>Each individual is more likely than not to be correct</li>
</ol>
<p>Notably, only some of the group need to know the correct answer, as the rest will answer randomly preserving the effect. This is redolent of Millionaire&#8217;s &#8220;Ask the Audience&#8221; option, which another book <a href="http://www.amazon.co.uk/Wisdom-Crowds-Many-Smarter-Than/dp/0349116059/">The Wisdom of Crowds</a> claimed is correct <b>91%</b> of the time, more often than the phone a friend option. <a href="http://www.smoking-bun.com/?p=9">Some warn against</a>  calling this mathematical effect &#8220;wisdom&#8221;, however.</p>
<p><a href="http://www.menarik.co.uk/wp-content/uploads/2010/06/wwtbam_audience_response_4.jpg"><img src="http://www.menarik.co.uk/wp-content/uploads/2010/06/wwtbam_audience_response_4-300x225.jpg" alt="ask the audience screenshot" title="wwtbam_audience_response_4" width="300" height="225" class="alignleft size-medium wp-image-401" /></a></p>
<p>Sunstein goes on to provide a range of evidence supporting the observation that deliberation does not lead necessarily lead to more accurate outcomes than polled, statistical samples. This may be due to:-</p>
<ol>
<li>&#8220;<strong>hidden profiles</strong>&#8220;, or accurate information available to a group which is not accessed in deliberation (perhaps because only a few members of the group possess it, or are reluctant to go against the grain and challenge the majority view);</li>
<li> The &#8220;<strong>common knowledge effect</strong>&#8220;, where shared knowledge has more of an influence on the group&#8217;s decision than information held individually;</li>
<li><strong>Informational</strong> and <strong>reputational cascades</strong>, where individuals change their view or the willingness to speak out after hearing the views of others;</li>
<li><strong>Group polarisation</strong>, where members of a group end up with more extreme views &mdash; in line with their initial tendencies &mdash; after debate.</li>
</ol>
<p>What this all mean for knowledge creation on the social web? Perhaps that giving scores and weighting to reputation, while establishing authority and expertise, may also dissuade or detract from informed dissent, and that such dissent may also warrant recognition. </p>
<p>In <a href="http://www.ethanzuckerman.com/blog/2006/11/30/cass-sunsteins-infotopia/">another review of the book</a>, Ethan Zuckerman noted:-</p>
<p>&#8220;A large number of the most interesting projects taking place on the internet use strategies to aggregate information from multiple users to create new knowledge &#8230;. analyzing these systems in terms of their effectiveness in getting people to reveal hidden knowledge is, in my opinion, an excellent framework for evaluation.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/06/group-wisdom/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>information systems research: describe, critique or design?</title>
		<link>http://www.menarik.co.uk/2010/02/information-systems-research-describe-critique-or-design/</link>
		<comments>http://www.menarik.co.uk/2010/02/information-systems-research-describe-critique-or-design/#comments</comments>
		<pubDate>Thu, 11 Feb 2010 13:58:44 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Research]]></category>
		<category><![CDATA[is]]></category>
		<category><![CDATA[theory]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=313</guid>
		<description><![CDATA[At what level does IS research work best? This post argues for more direct involvement in design from the research community.
(image flickr/talios)]]></description>
			<content:encoded><![CDATA[<p>I sometimes find it hard to see the practical value of some IS research.  Researchers argue for a well-supported theoretical angle, come up with a pragmatic method and may present interesting observations and insights into the particular system that has been investigated. When it comes to design recommendations arising from the research, however, these often seem trite or added as an afterthought. </p>
<p>If theory and research are meant to inform practice, recommendations need to be current, relevant,  and actionable. This can be hard to achieve in a domain that develops so quickly. Checkland (1998) observed a while ago how it is very hard for theory to keep up and be informative, thereby failing to match the &#8220;ideal&#8221; dynamic:</p>
<div id="attachment_320" class="wp-caption aligncenter" style="width: 510px"><img src="http://www.menarik.co.uk/wp-content/uploads/2010/02/checkland_cycle.png" alt="Checkland&#039;s 'ideal' dynamic for theory informing practice" title="checkland_cycle" width="500"  class="size-full wp-image-320" /><p class="wp-caption-text">Checkland's ideal for theory informing practice</p></div>
<p>Whilst a range of research-theoretical stances can be taken, justified and accepted by the research community, only some seem to be close enough to practice to usefully inform it. This is where the researchers are either experimenting directly through design, or are involved enough to really engage with the language, problems and possible solutions in the domain. So I would favour interpretive or action research approaches in this regard:</p>
<div id="attachment_326" class="wp-caption aligncenter" style="width: 510px"><img src="http://www.menarik.co.uk/wp-content/uploads/2010/02/levels1.png" alt="Levels of engagement and theoretical stance (based on Aakhus &amp; Jackson (2005))" title="levels" width="500"  class="size-full wp-image-326" /><p class="wp-caption-text">Levels of engagement and theoretical stance (based on Aakhus &#038; Jackson (2005))</p></div>
<p>Whilst the more descriptive, long view can still be valuable, it is hard to <em>realise</em> this value unless the knowledge gained is acted on by those implementing new designs and approaches. As they are the ones reading the research, this should perhaps be the research community itself in the first instance. Design improvement can then be further communicated through demonstrators, but these need to be compelling enough to be taken seriously.</p>
<p><strong>References</strong></p>
<p>1. Aakhus, M. and Jackson, S., 2005. Technology, Interaction and Design. In: Fitch, K.L. and Sanders, R.E.( eds) <em>Handbook of Language and Social Interaction</em>. LEA, pp. 411-433<br />
2. Checkland, P .and Holwell ,S.  1998.  <em>Information, Systems and Information Systems: Making Sense of the Field</em>  John Wiley &#038; Sons, Inc. New York, NY, USA</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/02/information-systems-research-describe-critique-or-design/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Book Review: Metadata and Semantics</title>
		<link>http://www.menarik.co.uk/2010/01/book-review-metadata-and-semantics/</link>
		<comments>http://www.menarik.co.uk/2010/01/book-review-metadata-and-semantics/#comments</comments>
		<pubDate>Thu, 14 Jan 2010 15:18:04 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Books Reviewed]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[ontology]]></category>
		<category><![CDATA[semantics]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=295</guid>
		<description><![CDATA[A review of the proceedings of the 2nd International Conference on Metadata and Semantics Research, held in Corfu in October 2007

(Image:flickr:dullhunk)]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.springer.com/book/978-0-387-77744-3"><img alt="" src="http://www.springer.com/cda/content/image/cda_displayimage.jpg?SGWID=0-0-16-475462-0" title="Cover" class="alignleft" width="95" height="145" /></a><br />
Metadata and Semantics<br />
<a href="http://www.springer.com/book/978-0-387-77744-3">http://www.springer.com/book/978-0-387-77744-3</a><br />
Sicilia, M.; Lytras, M. (eds)<br />
2009 Springer 549pp</p>
<p><em><strong>Originally reviewed for the <a href="http://irsg.bcs.org/index.php">BCS Information Retrieval Specialists Group</a> Newsletter &#8220;<a href="http://irsg.bcs.org/display_informers.php">The Informer</a>&#8220;</strong></em></p>
<p>This volume contains the proceedings of the <strong>2nd International Conference on Metadata and Semantics Research</strong>, held in Corfu in October 2007. The conference had a 50% acceptance rate, with most of the presented papers being published in the book.</p>
<p>Despite the fact that many papers present only interim or early research work, the volume provides a fascinating overview of a cross section of metadata and semantics research in both pure form and as applied in a variety of domains, including those of cultural heritage, education and agriculture. In addition to the advances in these individual fields and domains, one is able to determine some general trends and patterns running through the work presented. Perhaps overriding all is the sense that many researchers and archivists are struggling with the problem of balancing the complexity of describing information resources within a domain adequately, while at the same time ensuring that discrete sets of knowledge can actually be successfully linked and that systems which can guide the user in retrieval and discovery can be usefully developed on top of these structures.</p>
<p>Immediately apparent as one reads the volume is the dominance of semantic web approaches to metadata and knowledge modelling, influencing migrations from stand-alone formats and conventions to those based on semantic web standards such as RDF and OWL.  Sections are devoted to semantic web applications and to ontology engineering.  In several places one sees how such approaches can provide the missing semantic aspect to rather more well-established syntactic rules and begin to enable reasoning and knowledge discovery through the use of ontologies. In <strong>Semantic Application Profiles: A Means to Enhance Knowledge Discovery in Domain Metadata Models</strong>, for instance, Koutsomitropoulos and colleagues show how the CIDOC Conceptual Reference Model for cultural heritage can be represented in OWL and used to discover related works (using a painting example).  In <strong>Capturing MPEG-7 Semantics</strong>, Dasiopoulou et al, argue that a semantic model is needed to enhance and disambiguate multimedia mark-up enabled by the MPEG-7 format.  Also very  much worthy of mention in this regard is good the article by Voss, <strong>Encoding changing country codes for the Semantic Web with ISO 3166 and SKOS</strong> which demonstrates with a grounded example how changes and versioning can be accommodated using the RDF based Simple Knowledge Organisation System.  This is clearly paramount when considering the provision of globally applicable yet cohesive and adaptive data sets.</p>
<p>The complexity of and unwieldiness of ontologies has been an issue in the past and several of the papers address this by providing either automated means of deriving ontology structures or reviewing tools used (or needed)  for ontology and semantic web data manipulation.  </p>
<p>In the first category,  <strong>Ensemble Learning of Economic Taxonomy Relations from Modern Greek Corpora</strong> by Kermanidis shows how a number of clustering algorithms can be applied in combination to arrive at quite a high level of accuracy in deriving relations, based only on the learning sets themselves.  In <strong>A new Formal Concept Analysis-based learning approach to Ontology Building</strong>, Jia and colleagues promote the FCA approach in providing a lattice of concepts rather than the hierarchy given by other popular clustering methods, which they argue as being more “true to life”. Jia et al then show how the derived ontology can then be used to provide a query refinement/expansion method. </p>
<p>In terms of user tools, Cardoso provides a useful state of the art in use of software to manipulate RDF data and ontologies in <strong>The Semantic Web : A mythical story or solid reality</strong> ,  and Enokksson et al address the problem of the remote and in-place modification of RDF in <strong>An RDF Modification Protocol based on the needs of Editing Tools</strong>.</p>
<p>While there are a number of papers that demonstrate derivations and developments of specialised or extended ontologies and metadata schemes, I found the more pragmatic papers to be more valuable and progressive for the way they manage to remain in touch with the ultimate aim of their work. A nice balance between the convenience of machine cataloguing and the need for oversight of human cataloguers is struck in Whitelaw and Collins’ <strong>Pragmatic support for taxonomy-based annotation of structured digital documents</strong>. The authors describe the derivation of metadata for Open University course documents (based on IEEE Learning Object Metadata) using a series of more- or less automated approaches and then having these evaluated by library cataloguing staff.  The optimal solution was found to be where terms are suggested from the vocabulary based on document content, but where the staff member can click through to the definition before confirming the use of the term as metadata. The authors also begin to look at the cost benefit ratio of collecting more detailed metadata to a slight increase in cataloguing effort. While these authors don’t get as far as exploring the efficiency of retrieval of documents, this is handled in Cervera et als’ <strong>Quality Metrics in Learning Objects</strong> which stresses the reusability objective and notes that quality metrics are crucial to the navigation of resources. Notably, Cervera et al describe the need for socially based rankings and ratings to be incorporated into the metadata scheme. </p>
<p>A further example of attempts to enhance the usability and comparability of metadata is provided by Nilsson et al in <strong>Formalizing Dublin Core Application Profiles: Description Set Profiles and Graph Constraints</strong>. A number of practical measures for introducing conventions for how metadata elements are used and coded are presented, including standard templates and the use of a wiki to provide for easy human editing of profiles which can then also be exported at XML, thereby bridging the gap between qualitative and more structured usage guides.</p>
<p>Against this general background of attempts to standardise, enable interoperability between collections and provide enhanced semantic infrastructures you have the inherent messiness of human decisions as to meaning and the complexity of the encoding act itself, something that is probably not discussed enough in this community but which is covered in Scifleet et al’s <strong>The Human Art of Encoding: Markup as Documentary Practice</strong>. The authors note that metadata cataloguing is socially situated and may often be idiosyncratic, though the extent to which this may affect quality and comparability is poorly known. They report on a programme of research to study this variety in the practice of participants in the Text Encoding Initiative (TEI).  Unfortunately their full findings are not covered in this volume but are well worth following up for those interested in this important question.</p>
<p>In summary, this is an interesting and varied read, with enough “current state of the art” review papers to interest the non-specialist as well as some good fodder for those already wrestling with metadata,  semantics and knowledge representation issues within particular domains.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2010/01/book-review-metadata-and-semantics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Online Information 2009 &#8211; A Twitter Timeline</title>
		<link>http://www.menarik.co.uk/2009/12/online-information-2009-a-twitter-timeline/</link>
		<comments>http://www.menarik.co.uk/2009/12/online-information-2009-a-twitter-timeline/#comments</comments>
		<pubDate>Wed, 09 Dec 2009 18:42:16 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Meetings]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[#online09]]></category>
		<category><![CDATA[conferences]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=269</guid>
		<description><![CDATA[An interactive timeline of tweets from Onine Information 2009]]></description>
			<content:encoded><![CDATA[<p>It was disappointing to miss <a href="http://www.online-information.co.uk/index.html">Online Information</a> this year, so I have been trying to catch up via the Twitter stream. Fortunately the #online09 tweets have been archived to <a href="http://twapperkeeper.com/online09/">Twapper Keeper</a> (via <a href="http://twitter.com/briankelly">@briankelly</a>). This is great, as the site makes them downloadable. </p>
<p>To see them in the context of the conference programme, I converted the tweets and the event schedule  to XML suitable for display in a <a href="http://www.simile-widgets.org/timeline/">Simile Timeline</a>. Click the screenshot below to access the resulting <a href="http://www.menarik.co.uk/tweetline">timeline mashup</a> (nb tweets take a few seconds to load).</p>
<p> <a href="http://www.menarik.co.uk/tweetline"><img src="http://www.menarik.co.uk/wp-content/uploads/2009/12/tweetline_cap.png" alt="tweetline_cap" title="tweetline_cap" width="500" height="253" class="aligncenter size-full wp-image-271" style="clear:both;" /></a></p>
<p><strong>What would have made this exercise easier, and the conference tweets easier to track</strong>:</p>
<ul>
<li>If the conference programme had been published in a more mashable format eg iCal / XML</li>
<li>If there was a way to identify a particular conference track (a sub hashtag? &#8211; or is that overkill?)</li>
<li>A way to distinguish critical comments, reportage, and general flim-flam. Though perhaps that is part of the fun!</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2009/12/online-information-2009-a-twitter-timeline/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>The Bristol Community Twitterati</title>
		<link>http://www.menarik.co.uk/2009/11/bristol-community-twitterati/</link>
		<comments>http://www.menarik.co.uk/2009/11/bristol-community-twitterati/#comments</comments>
		<pubDate>Fri, 27 Nov 2009 14:54:07 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[network analysis]]></category>
		<category><![CDATA[social graph]]></category>
		<category><![CDATA[twitter]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=227</guid>
		<description><![CDATA[An analysis of the Bristol "twolar system"..]]></description>
			<content:encoded><![CDATA[<p>Last week I downloaded a partial dataset of Twitter users reporting their location as Bristol. I started from the Bristol Social Media group&#8217;s &#8220;<a href="http://twitter.com/brrism">brrism</a>&#8221; ID and retrieved friends, followed by friends of their friends. This resulted in a set of 793 Bristol-based IDs, with 9241 friend links between them.</p>
<p>The graphic below shows the Bristolian IDs with the most friends who are also located in Bristol. The most highly connected accounts are in red towards the centre. The highest number of Bristolian friends was 183 (<a href="http://twitter.com/bristolnews">bristolnews</a>), with the average number 11, though stats like this of course ignore the fact that many people don&#8217;t report their location (or do report it as lat/long). An initial look shows that &#8220;traditional&#8221; media corporate &ndash; and individual &ndash; IDs are well linked in, but that social and creative orgs. and those backed by online community sites are also among the most popular.</p>
<p><img src="http://www.menarik.co.uk/wp-content/uploads/2009/11/brizzle_twitterati.png" alt="Subgraph of most connected IDs" title="brizzle_twitterati" width="750"  class="size-full wp-image-237" /></p>
<p>When I get time, the plan is to categorise IDs by sector and by individual / corporate (though there is a large number of in-betweens). There are also signs that the accounts are organised into four distinct clusters and it will be interesting to see if there is any discernible basis to this.</p>
<p>Here is a <a href="http://www.wordle.net">Wordle</a> of the user bios. Not too many surprises here &ndash; a good overview of Bristol&#8217;s online creative community &ndash; though the prominence of tea was interesting (23 mentions) in comparison to coffee (8) !</p>
<p><img src="http://www.menarik.co.uk/wp-content/uploads/2009/11/biowordle.png" alt="biowordle" title="biowordle" width="823" height="537" class="aligncenter size-full wp-image-243" /></p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2009/11/bristol-community-twitterati/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Social epistemology as a foundation for information services</title>
		<link>http://www.menarik.co.uk/2009/07/social-epistemology-and-lis/</link>
		<comments>http://www.menarik.co.uk/2009/07/social-epistemology-and-lis/#comments</comments>
		<pubDate>Fri, 03 Jul 2009 10:19:45 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Papers Reviewed]]></category>
		<category><![CDATA[epistemology]]></category>
		<category><![CDATA[LIS]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=211</guid>
		<description><![CDATA[An article arguing for social epistemology as a foundation and theoretical framework for information science.]]></description>
			<content:encoded><![CDATA[<p>Fallis, D., 2006. Social epistemology and information science. Annual Review of Information Science and Technology, 40(1), pp. 475-519</p>
<p>In this chapter of the Annual Review, Fallis argues for social epistemology as the best philosophical underpinning to information science.  Providing access to society&#8217;s knowledge through recorded information is argued as the main reason for information services.  Interestingly, social epistemology was first coined in information science (by Shera) , though was also developed as an idea independently within philosophy. </p>
<p>Fallis covers the debates within philosophy as to what constitutes knowledge, and how these have been viewed by information scientists, most of whom have naturally taken more pragmatic perspectives. He also notes the different (classic v revolutionary) views of social epistemology, with the former a social extension of the established idea of knowledge as justified true belief, but the latter taking the social constructivist line. A further useful distinction made is between social epistemology as a normative project &mdash; the goal of which is to maximise knowledge acquisition (or at least maximise access to authoritative information) &mdash; and the sociology of knowledge, which is about understanding and describing how social factors contribute to knowledge making.</p>
<p>In outlining debates about necessity of the truth condition, Fallis notes that librarians will help the user try to discover the &#8220;truth&#8221;, which the user in turn sees as the key objective of using the service. That said, it is acknowledged that the collection owner cannot possibly know whether all the facts contained therein are true.</p>
<p>Some interesting and important conclusions come from acknowledging the centrality of social epistemology in this way:</p>
<ol>
<li>That epistemic principles should guide collection organisation (e.g. by type of claim and evidence offered);</li>
<li>That information providers should collocate a range of viewpoints on a topic to allow the seeker to better determine the truth (a principle taken from Mill&#8217;s <em>On Liberty</em>);</li>
<li>That any aspect of an information service that enables the social acquisition of knowledge is as important as any other (sofas and coffee bars are as important as books)- I liked this one!</li>
</ol>
<p>I felt this to be a very rich and balanced article with many considerations of lasting relevance as we move into the digital age. I also really liked the exploration of the underlying goals of information services, a topic that needs to be foregrounded more often.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2009/07/social-epistemology-and-lis/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>ICTD2009: Selected Cuts</title>
		<link>http://www.menarik.co.uk/2009/04/ictd2009/</link>
		<comments>http://www.menarik.co.uk/2009/04/ictd2009/#comments</comments>
		<pubDate>Tue, 21 Apr 2009 20:12:41 +0000</pubDate>
		<dc:creator>Paul Matthews</dc:creator>
				<category><![CDATA[Meetings]]></category>
		<category><![CDATA[ict4d]]></category>
		<category><![CDATA[ictd2009]]></category>

		<guid isPermaLink="false">http://www.menarik.co.uk/?p=176</guid>
		<description><![CDATA[Some personal highlights and observations from the Information and Communication Technologies for Development conference in Doha, Qatar, 17-19th April]]></description>
			<content:encoded><![CDATA[<p>The ICTD2009 conference was held at Carnegie Mellon&#8217;s University in Doha, Qatar &#8211; A recently completed, futuristic building rising from the desert. The conference started with some thematic workshops on ICT4D research, mobiles and ICT4D definitions, followed by 2 days of peer-reviewed paper presentations, posters and demos, and a couple of big name keynotes. Papers were grouped into those looking at heathcare, e-government, social enterprise, community content, education and games. Major recurring themes were mobile platforms, qualitative v quantitative method, evaluation and generalisability/scalability.</p>
<p>Bill Gates&#8217; keynote began by focusing on the big problems in development that his Foundation is targeting &mdash; childhood mortality, immunisation and education. Perhaps to deliberately deemphasise ICT, he introduced it only later in his talk, where he talked about scalability and learning to recognise those projects with a genuine impact while recognising failures (citing <a href="http://www.digitalgreen.org/">Digital Green</a> and <a href="http://www.safaricom.co.ke/index.php?id=745">M-Pesa</a> as successful examples). In doing so, there was some feeling that he was not giving due regard to the importance of localised, context-dependent solutions (eg <a href="http://ict4dblog.wordpress.com/2009/04/18/bill-gates-and-ict4d/">Richard Heeks&#8217; response</a>)</p>
<div class="wp-caption aligncenter" style="width: 610px"><img alt="Ricardo Legorretas impressive building in Education City" src="http://www.menarik.co.uk/wp-content/themes/tma/images/latest/carnegie_doha.jpg" title="Carnegie Building" width="500" /><p class="wp-caption-text">Ricardo Legorreta&#39;s impressive Carnegie Mellon building in Doha</p></div>
<p>Earlier in the day, a very interesting study by <a href="http://melissaho.com/">Melissa Ho</a> and colleagues on ICT for health in Ghana had compared a top-down, government-sponsored solution with a more bottom-up practitioner-led initiative. Both solutions dealt with communication and referrals between health care professionals, the former being based on a web platform and the latter a mobile solution. The study found more uptake with the latter, though interestingly this also seemed to have some unintended consequences, such as doctors&#8217; phones going off at inopportune moments. This seemed to typify the delicate balance between over-structured &#8220;official&#8221; systems which struggle for uptake and the unstructured, occasionally chaotic outcomes with a user-led approach. The interesting question for me is whether such community-led work can develop sufficient structure and conformity for the solution to work efficiently into the long term.</p>
<p>Among the other (mostly very good) presentations that stood out for me was that by Aishwarya Ratan et al from <a href="http://research.microsoft.com/apps/pubs/default.aspx?id=80319">Microsoft Research on Kelsa+</a>, which did an ethnographic study of off-duty workers using a PC, capturing some compelling footage of workers teaching themselves basic computing through peer observation and experimentation.</p>
<p>The poster presentations and system demos were great, and  although I didn&#8217;t get round to all of them I had a very good live demo of <a href="http://www.frontlinesms.com/">Frontline SMS</a> linked to <a href="http://www.ushahidi.com/">Ushahidi</a> for participatory crisis reporting and a look at the <a href="http://www.mocamobile.org/">Moca system</a> for remote diagnosis in healthcare.</p>
<div class="wp-caption aligncenter" style="width: 510px"><img alt="Carlos Braga giving his closing keynote" src="http://www.menarik.co.uk/wp-content/themes/tma/images/latest/braga.jpg" title="Carlos Braga" width="500" height="332" /><p class="wp-caption-text">Carlos Braga giving his closing keynote</p></div>
<p>In his closing keynote, Carlos Braga of the World Bank talked about the global financial crisis and how it will continue to hit developing countries hard. While acknowledging the role of ICT in the credit crisis (through enabling complex but opaque derivative calculations), it is increasingly shown to play a strong role in economic growth. Braga compared the development trajectories of South Korea and Ghana, attributing a large amount of the divergence between the two on the better harnessing of knowledge in Korea. Braga also made use of the excellent animated bubble chart for GDP growth from <a href="http://graphs.gapminder.org/world/#$majorMode=chart$is;shi=t;ly=2003;lb=f;il=t;fs=11;al=30;stl=t;st=t;nsl=t;se=t$wst;tts=C$ts;sp=6;ti=2005$zpv;v=0$inc_x;mmid=XCOORDS;iid=phAwcNAVuyj1jiMAkmq1iMg;by=ind$inc_y;mmid=YCOORDS;iid=phAwcNAVuyj2tPLxKvvnNPA;by=ind$inc_s;uniValue=8.21;iid=phAwcNAVuyj0XOoBL_n5tAQ;by=ind$inc_c;uniValue=255;gid=CATID0;by=grp$map_x;scale=log;dataMin=194;dataMax=96846$map_y;scale=lin;dataMin=23;dataMax=86$map_s;sma=49;smi=2.65$cd;bd=0$inds=">Gapminder World</a></p>
<p>In sum, it was a privilege to get to sample the range of quality research and innovation going on in the ICT4D space. While the theoretical navel-gazing was occasionally wearing, there were also real signs of an emerging, more balanced synergy between the power of ICT and the headline issues in global development.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.menarik.co.uk/2009/04/ictd2009/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
