<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Online Journalism Blog &#187; Charles Arthur</title>
	<atom:link href="http://onlinejournalismblog.com/tag/charles-arthur/feed/" rel="self" type="application/rss+xml" />
	<link>http://onlinejournalismblog.com</link>
	<description>A conversation.</description>
	<lastBuildDate>Sat, 11 Feb 2012 12:06:28 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<cloud domain='onlinejournalismblog.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
		<item>
		<title>2 guest posts: 2012 predictions and &#8220;Social media and the evolution of the fourth estate&#8221;</title>
		<link>http://onlinejournalismblog.com/2011/12/29/2-guest-posts-2012-predictions-and-social-media-and-the-evolution-of-the-fourth-estate/</link>
		<comments>http://onlinejournalismblog.com/2011/12/29/2-guest-posts-2012-predictions-and-social-media-and-the-evolution-of-the-fourth-estate/#comments</comments>
		<pubDate>Thu, 29 Dec 2011 08:27:17 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[data journalism]]></category>
		<category><![CDATA[2012]]></category>
		<category><![CDATA[axel bruns]]></category>
		<category><![CDATA[Charles Arthur]]></category>
		<category><![CDATA[eric s raymond]]></category>
		<category><![CDATA[fourth estate]]></category>
		<category><![CDATA[freedom of information]]></category>
		<category><![CDATA[futurology]]></category>
		<category><![CDATA[gatekeeping]]></category>
		<category><![CDATA[gatewatching]]></category>
		<category><![CDATA[heather brooke]]></category>
		<category><![CDATA[memeburn]]></category>
		<category><![CDATA[nieman journalism lab]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=15628</guid>
		<description><![CDATA[I&#8217;ve written a couple of guest posts for Nieman Journalism Lab and the tech news site Memeburn. The Nieman post is part of a series looking forward to 2012. I&#8217;m never a fan of futurology so I&#8217;ve cheated a little and talked about developments already in progress: new interface conventions in news websites; the rise of collaboration; and the skilling<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2011/12/29/2-guest-posts-2012-predictions-and-social-media-and-the-evolution-of-the-fourth-estate/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2011%2F12%2F29%2F2-guest-posts-2012-predictions-and-social-media-and-the-evolution-of-the-fourth-estate%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2011_2F12_2F29_2F2-guest-posts-2012-predictions-and-social-media-and-the-evolution-of-the-fourth-estate_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2011%2F12%2F29%2F2-guest-posts-2012-predictions-and-social-media-and-the-evolution-of-the-fourth-estate%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><img class="alignleft" style="margin: 20px;" title="Memeburn logo" src="http://memeburn.com/img/memeburn_260.png" alt="Memeburn logo" width="260" height="57" /></p>
<p>I&#8217;ve written a couple of guest posts for Nieman Journalism Lab and the tech news site Memeburn. <a href="http://www.niemanlab.org/2011/12/paul-bradshaw-collaboration-data-2012-will-see-news-outlets-turning-talk-into-action/" onclick="urchinTracker('/outgoing/www.niemanlab.org/2011/12/paul-bradshaw-collaboration-data-2012-will-see-news-outlets-turning-talk-into-action/?referer=');">The Nieman post</a> is part of a series looking forward to 2012. I&#8217;m never a fan of futurology so I&#8217;ve cheated a little and talked about developments already in progress: new interface conventions in news websites; the rise of collaboration; and the skilling up of journalists in data.</p>
<p>Memeburn asked me a few months ago to write about <a href="http://memeburn.com/2011/12/social-media-and-the-evolution-of-the-fourth-estate/" onclick="urchinTracker('/outgoing/memeburn.com/2011/12/social-media-and-the-evolution-of-the-fourth-estate/?referer=');">social media&#8217;s impact on journalism&#8217;s role as the Fourth Estate</a>, and it took me until this month to find the time to do so. Here&#8217;s the salient passage:</p>
<blockquote><p>&#8220;But the power of the former audience is a power that needs to be held to account too, and the rise of liveblogging is teaching reporters how to do that: reacting not just to events on the ground, but the reporting of those events by the people taking part: demonstrators and police, parents and politicians all publishing their own version of events — leaving journalists to go beyond documenting what is happening, and instead confirming or debunking the rumours surrounding that.</p>
<p>&#8220;So the role of journalist is moving away from that of gatekeeper and — <a href="http://books.google.co.uk/books%20hl=en&amp;lr=&amp;id=ybSFU9aDzsoC&amp;oi=fnd&amp;pg=PR9&amp;dq=axel+bruns+gatewatching&amp;ots=93qN2wGWVS&amp;sig=2jQXjrIHFKlMdHlJRL_I75ZT338" onclick="urchinTracker('/outgoing/books.google.co.uk/books_20hl=en_amp_lr=_amp_id=ybSFU9aDzsoC_amp_oi=fnd_amp_pg=PR9_amp_dq=axel+bruns+gatewatching_amp_ots=93qN2wGWVS_amp_sig=2jQXjrIHFKlMdHlJRL_I75ZT338?referer=');">as Axel Bruns argues</a> — towards that of gatewatcher: amplifying the voices that need to be heard, factchecking the <a title="nadine dorries" href="http://www.bbc.co.uk/news/uk-politics-11597664" onclick="urchinTracker('/outgoing/www.bbc.co.uk/news/uk-politics-11597664?referer=');">MPs whose blogs are 70% fiction</a> or the <a title="SEO as a public service: Facebook, paedophiles and cartoon avatars" href="http://onlinejournalismblog.com/2010/12/05/facebook-cartoon-avatars-paedophiles-and-seo-as-a-public-service/">Facebook users scaremongering about paedophiles</a>.</p>
<p>&#8220;But while we are still adapting to this power shift, we should also recognise that that power is still being fiercely fought-over. <a href="http://www.readwriteweb.com/archives/youtube_censors_megaupload_song_video.php?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed:+readwriteweb+(ReadWriteWeb)" onclick="urchinTracker('/outgoing/www.readwriteweb.com/archives/youtube_censors_megaupload_song_video.php?utm_source=feedburner_amp_utm_medium=feed_amp_utm_campaign=Feed_+readwriteweb+_ReadWriteWeb&amp;referer=');">Old laws are being used in new ways</a>; <a href="http://www.out-law.com/page-11903" onclick="urchinTracker('/outgoing/www.out-law.com/page-11903?referer=');">new laws are being proposed</a> to reaffirm previous relationships. Some of these may benefit journalists — but ultimately not journalism, nor its fourth estate role. The journalists most keenly aware of this — <a href="http://heatherbrooke.org/" onclick="urchinTracker('/outgoing/heatherbrooke.org/?referer=');">Heather Brooke</a> in her pursuit of freedom of information; Charles Arthur in <a href="http://www.freeourdata.org.uk/blog/" onclick="urchinTracker('/outgoing/www.freeourdata.org.uk/blog/?referer=');">his campaign to ‘Free Our Data’</a> — recognise that journalists’ biggest role as part of the fourth estate may well be to ensure that everyone has access to information that is of public interest, that we are free to discuss it and what it means, and that — in the words of <a href="http://en.wikipedia.org/wiki/Eric_S._Raymond" target="_blank" onclick="urchinTracker('/outgoing/en.wikipedia.org/wiki/Eric_S._Raymond?referer=');">Eric S. Raymond</a> — “Given enough eyeballs, all bugs are shallow“.&#8221;</p></blockquote>
<p>Comments, as always, very welcome.</p>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2011%2F12%2F29%2F2-guest-posts-2012-predictions-and-social-media-and-the-evolution-of-the-fourth-estate%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2011/12/29/2-guest-posts-2012-predictions-and-social-media-and-the-evolution-of-the-fourth-estate/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Is Ice Cream Strawberry? Part 4: Human Capital</title>
		<link>http://onlinejournalismblog.com/2011/03/03/is-ice-cream-strawberry-inaugural-lecture-part-4-human-capital/</link>
		<comments>http://onlinejournalismblog.com/2011/03/03/is-ice-cream-strawberry-inaugural-lecture-part-4-human-capital/#comments</comments>
		<pubDate>Thu, 03 Mar 2011 17:55:22 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[online journalism]]></category>
		<category><![CDATA[Charles Arthur]]></category>
		<category><![CDATA[chris taggart]]></category>
		<category><![CDATA[deliberative democracy]]></category>
		<category><![CDATA[ellen miller]]></category>
		<category><![CDATA[free our data]]></category>
		<category><![CDATA[freedom of information]]></category>
		<category><![CDATA[gary becker]]></category>
		<category><![CDATA[global voices]]></category>
		<category><![CDATA[heather brooke]]></category>
		<category><![CDATA[human capital]]></category>
		<category><![CDATA[inaugural lecture]]></category>
		<category><![CDATA[is ice cream strawberry]]></category>
		<category><![CDATA[liberal democracy]]></category>
		<category><![CDATA[linked data]]></category>
		<category><![CDATA[lokman tsui]]></category>
		<category><![CDATA[MySociety]]></category>
		<category><![CDATA[objectivity]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[participatory democracy]]></category>
		<category><![CDATA[richard taylor]]></category>
		<category><![CDATA[simon perry]]></category>
		<category><![CDATA[tim berners-lee]]></category>
		<category><![CDATA[Tom Loosemore]]></category>
		<category><![CDATA[tom steinberg]]></category>
		<category><![CDATA[walsall council]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=13371</guid>
		<description><![CDATA[This is the fourth part of my inaugural lecture at City University London, &#8216;Is Ice Cream Strawberry?&#8217;. You can find part one here, part two here, and part three here. Human capital So here’s person number 4: Gary Becker, a Nobel prize-winning economist. Fifty years ago he used the phrase &#8216;human capital&#8217; to refer to the economic value that companies<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2011/03/03/is-ice-cream-strawberry-inaugural-lecture-part-4-human-capital/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2011%2F03%2F03%2Fis-ice-cream-strawberry-inaugural-lecture-part-4-human-capital%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2011_2F03_2F03_2Fis-ice-cream-strawberry-inaugural-lecture-part-4-human-capital_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2011%2F03%2F03%2Fis-ice-cream-strawberry-inaugural-lecture-part-4-human-capital%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><em>This is the fourth part of my inaugural lecture at City University London, &#8216;Is Ice Cream Strawberry?&#8217;. You can find <a href="http://onlinejournalismblog.com/2011/03/03/is-ice-cream-strawberry-inaugural-lecture-part-1-the-telegraph-myth/">part one here</a>, <a href="http://onlinejournalismblog.com/2011/03/03/is-ice-cream-strawberry-inaugural-lecture-part-2-cars-roads-and-picnics/">part two here</a>, and <a href="http://onlinejournalismblog.com/2011/03/03/is-ice-cream-strawberry-inaugural-lecture-part-3-the-production-line-has-been-replaced-by-a-network/">part three here</a>.</em></p>
<h2>Human capital</h2>
<p>So here’s person number 4: Gary Becker, a Nobel prize-winning economist.</p>
<p>Fifty years ago he used the phrase &#8216;human capital&#8217; to refer to the economic value that companies should ascribe to their employees.</p>
<p>These days, of course, it is common sense to invest time in recruiting, training and retaining good employees. But at the time employees were seen as a cost.</p>
<p>We need a similar change in the way we see our readers &#8211; not as a cost on our time but as a valuable part of our operations that we should invest in recruiting, developing and retaining.<span id="more-13371"></span></p>
<p>Any online operation that does not incorporate its users in production is <strong>not only democratically deficient, it is commercially inefficient.</strong></p>
<p>Of course some are inclined to see user generated content as a mass of ignorance, abuse and waffle. Those people should ask how much work has been put into attracting good contributors? Into developing a healthy commenting culture? And how much has been invested into giving the good users a reason to keep coming back?</p>
<h2>Journalism’s conflicted future</h2>
<p>I have spoken about journalism&#8217;s ego problem. This is worsened by the fact that journalism is going through an identity crisis, which will become increasingly problematic as it tries to reinvent itself for an uncertain future.</p>
<p>And as always, this is nothing new. In the 1920s journalism faced a similar conflict: between the journalism of information and the journalism of stories. Should we, as journalists, perform a role of providing citizens with the information that they need to make informed decisions? Or are we just in the business of great stories?</p>
<p>The source of that conflict then was the rise of the scientific method, as I explained at the start of this lecture. The source of today’s conflict could be traced to institutional change in news organisations becoming part of larger entertainment empires &#8211; and the melting pot of online publication.</p>
<p>Where you stand on the role of journalists will likely depend on whether you think you’re in the business of building cars, constructing roads or organising picnics, and what role you think journalism should perform in a democracy.</p>
<p>Is journalism part of a deliberative democracy, in which the media provides a public forum for debate and consensus?</p>
<p>Is journalism&#8217;s role is to provide citizens with information &#8211; as part of a liberal democracy?</p>
<p>Or should the media encourage participation and engagement as part of a participatory democracy?</p>
<p>The institutional history of journalism kept those views somewhat separated &#8211; as Lokman Tsui explores in<a href="http://www.lokman.org/2010/11/02/my-dissertation-lives/" onclick="urchinTracker('/outgoing/www.lokman.org/2010/11/02/my-dissertation-lives/?referer=');"> his ethnography of Global Voices</a>.</p>
<p>But as these cultures of journalism clash in the online space it is more important than ever that we reflect on our own views of where we stand.</p>
<p>And as educators we should be teaching our students to be aware of their positions and how that affects what they report on, how they report it, and who gets a voice in its coverage.</p>
<p>If objectivity is to remain a journalistic value, then it should be modern objectivity, not this 19th century construct that passes for objectivity in most newswriting: the setting up of an arbitrary fence, and the selection of a source from each side of it as an indication of ‘balance’.</p>
<h2>Culture shift</h2>
<p>In many ways culture is the way that people and institutions communicate with each other. And just as institutional culture shapes the journalism that we create, for the last couple of decades there has been a growing movement outside of news institutions that sees democracy as both participatory and information driven.</p>
<p>The campaign for Freedom of Information, the work of MySociety in opening up voting records and debate transcripts so the public could see what their representatives were doing and saying in their name. The Free Our Data campaign &#8211; which sought to give the public the right to use information that was paid for with public money. And the Linked Data and Open Data movements which have campaigned to make public bodies publish data in a form that makes it easier to interrogate.</p>
<p>What these people &#8211; and I want to name some of them here:</p>
<ul>
<li>Tom Steinberg and Tom Loosemore</li>
<li>Heather Brooke</li>
<li>Charles Arthur</li>
<li>Tim Berners Lee and</li>
<li>Chris Taggart</li>
</ul>
<p>What these people have done &#8211; and are still doing &#8211; is making power accountable, promoting a cultural expectation that we should have access to information about how our money is spent, and that most publicly funded information should be available to the people who paid for it.</p>
<p>This of course is one of the first steps to &#8216;holding power to account&#8217;, the great argument that publishers make for their existence. That is what the Telegraph did with the MPs&#8217; expenses; what The Guardian have done with Wikileaks data.</p>
<p>But too much of this groundwork is lying ignored and unsupported by the mainstream press.</p>
<p>When Walsall Council released their spending data last year the webpage received more visits than the rest of the council website. They received several enquiries from people like Chris Taggart asking for information about why particular items had been redacted &#8211; but they received only one enquiry from the local newspaper.</p>
<p>And that was to ask: &#8216;Why have you released the data early?&#8217;</p>
<p>For comparison I want to show you a video of Ellen Miller of the Sunlight Foundation in the US talking about their government’s transparency initiative.</p>
<p><iframe width="600" height="338" src="http://www.youtube.com/embed/UNQteT9Bu2w?start=128&#038;fs=1&#038;feature=oembed" frameborder="0" allowfullscreen></iframe></p>
<p>Where is the news organisation in the UK that is lobbying like this?</p>
<p>A similar cultural shift is happening around public meetings and hearings, with hyperlocal blogs who want to make processes of law and democracy transparent.</p>
<p>Simon Perry of the Ventnor Blog was <a href="http://onlinejournalismblog.com/2010/02/24/why-was-simon-perry-ejected-from-newport-coroners-court/">ejected from a coroner&#8217;s court</a> last year on the grounds that he was neither a member of the press nor a member of the public. Richard Taylor was <a href="http://www.rtaylor.co.uk/cambridge-city-council-complaints-investigator-reports-on-filming-protocol.html" onclick="urchinTracker('/outgoing/www.rtaylor.co.uk/cambridge-city-council-complaints-investigator-reports-on-filming-protocol.html?referer=');">investigated by Cambridge City Council</a> for recording public meetings &#8211; he was not told what the grounds of the investigation were. In Brighton a councillor was <a href="http://jim.killock.org.uk/blog/brighton-tries-to-use-copyright-to-censor-councillor.html" onclick="urchinTracker('/outgoing/jim.killock.org.uk/blog/brighton-tries-to-use-copyright-to-censor-councillor.html?referer=');">disciplined</a> for posting clips of council meetings to YouTube. And Heather Brooke was <a href="http://heatherbrooke.org/2010/article-court-secrecy/" onclick="urchinTracker('/outgoing/heatherbrooke.org/2010/article-court-secrecy/?referer=');">told that she could not make an audio recording of a hearing</a> because the tribunal could not “maintain the necessary degree of control over the transcript.” When Brooke asked for a copy of the ruling she was told that there was to be no written record of it.</p>
<p>This is one area where journalists and news organisations can be investing in their users. It should not just be bloggers pushing for these changes.</p>
<p><em><a href="http://onlinejournalismblog.com/2011/03/03/is-ice-cream-strawberry-inaugural-lecture-part-4-corporatisation-of-the-public-sphere/">Part five can be found here</a>.</em></p>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2011%2F03%2F03%2Fis-ice-cream-strawberry-inaugural-lecture-part-4-human-capital%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2011/03/03/is-ice-cream-strawberry-inaugural-lecture-part-4-human-capital/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>Why journalists should be lobbying over police.uk&#8217;s crime data</title>
		<link>http://onlinejournalismblog.com/2011/02/01/why-journalists-should-be-lobbying-over-police-uks-crime-data/</link>
		<comments>http://onlinejournalismblog.com/2011/02/01/why-journalists-should-be-lobbying-over-police-uks-crime-data/#comments</comments>
		<pubDate>Tue, 01 Feb 2011 10:55:20 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[data journalism]]></category>
		<category><![CDATA[adrian short]]></category>
		<category><![CDATA[Charles Arthur]]></category>
		<category><![CDATA[conrad quilty-harper]]></category>
		<category><![CDATA[crime data]]></category>
		<category><![CDATA[crimesearch.co.uk]]></category>
		<category><![CDATA[foi]]></category>
		<category><![CDATA[jonathan raper]]></category>
		<category><![CDATA[louise kidney]]></category>
		<category><![CDATA[open data]]></category>
		<category><![CDATA[police.uk]]></category>
		<category><![CDATA[public data corporation]]></category>
		<category><![CDATA[Will Perrin]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=12789</guid>
		<description><![CDATA[Conrad Quilty-Harper writes about the new crime data from the UK police force &#8211; and in the process adds another straw to the groaning camel&#8217;s back of the government&#8217;s so-called transparency agenda: &#8220;It’s useless to residents wanting to find out what was going on at the house around the corner at 3am last night, and it’s useless to individuals who<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2011/02/01/why-journalists-should-be-lobbying-over-police-uks-crime-data/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2011%2F02%2F01%2Fwhy-journalists-should-be-lobbying-over-police-uks-crime-data%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2011_2F02_2F01_2Fwhy-journalists-should-be-lobbying-over-police-uks-crime-data_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2011%2F02%2F01%2Fwhy-journalists-should-be-lobbying-over-police-uks-crime-data%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><img src="http://i.telegraph.co.uk/multimedia/archive/01815/police-map_1815700b.jpg" alt="UK police crime maps" /></p>
<p>Conrad Quilty-Harper <a href="http://blogs.telegraph.co.uk/news/conradquiltyharper/100074290/why-www-police-uk-is-useless-aka-the-oh-look-pretty-maps-effect/" onclick="urchinTracker('/outgoing/blogs.telegraph.co.uk/news/conradquiltyharper/100074290/why-www-police-uk-is-useless-aka-the-oh-look-pretty-maps-effect/?referer=');">writes about the new crime data</a> from the UK police force &#8211; and in the process adds another straw to the groaning camel&#8217;s back of the government&#8217;s so-called transparency agenda:</p>
<blockquote><p>&#8220;It’s useless to residents wanting to find out what was going on at the house around the corner at 3am last night, and it’s useless to individuals who want to build mobile phone applications on top of the data (perhaps to get a chunk of that £6 billion industry open data is supposed to create).</p>
<p>&#8220;The site’s limitations are as follows:</p>
<ul>
<li>No IDs for crimes: what if I want to check whether real life crimes have made it onto the map? Sorry.</li>
<li>Six crime categories: including “other crimes”, everything from drug dealing to bank robberies in one handy, impossible to understand category.</li>
<li>No live data: you mean I have to wait until the end of the next month to see this month’s criminality?!</li>
<li>No dates or times: funny how without dates and times I can’t tell which police manager was in charge.</li>
<li>Case status: the police know how many crimes go solved or unsolved, why not tell us this?&#8221;</li>
</ul>
</blockquote>
<p>This is why people are so <a href="http://countculture.wordpress.com/2011/01/14/whats-that-coming-over-the-hill-is-it-the-public-data-corporation/" onclick="urchinTracker('/outgoing/countculture.wordpress.com/2011/01/14/whats-that-coming-over-the-hill-is-it-the-public-data-corporation/?referer=');">concerned about the Public Data Corporation</a>. This is why we need to be <a href="http://openlylocal.com/councils/spending" onclick="urchinTracker('/outgoing/openlylocal.com/councils/spending?referer=');">monitoring exactly what spending data councils release</a>, and in what format. And this is why we need to continue to <a href="http://www.guardian.co.uk/politics/2011/jan/06/nick-clegg-freedom-information-laws" onclick="urchinTracker('/outgoing/www.guardian.co.uk/politics/2011/jan/06/nick-clegg-freedom-information-laws?referer=');">press for the expansion of FOI laws</a>. This is what we should be doing. Are we?</p>
<p>UPDATE: Will Perrin has <a href="http://www.whatdotheyknow.com/request/formation_of_advice_on_crime_map" onclick="urchinTracker('/outgoing/www.whatdotheyknow.com/request/formation_of_advice_on_crime_map?referer=');">FOI&#8217;d all correspondence </a>relating to ICO advice on the crime maps. Jonathan Raper has a<a href="http://placr.co.uk/blog/2011/02/five-reasons-to-be-cautious-about-street-level-crime-data/" onclick="urchinTracker('/outgoing/placr.co.uk/blog/2011/02/five-reasons-to-be-cautious-about-street-level-crime-data/?referer=');"> list of further flaws</a> including:</p>
<ul>
<li>Some data such as sexual offences and murder is removed – even though it would be easy to discover and locate from other police reports.</li>
<li>Data covers reported crimes rather than convictions, so some of it may turn out not to be crime.</li>
<li>The levels of policing are not provided, so that two areas with the &#8220;same&#8221; crime levels may in fact have &#8220;radically different&#8221; experiences of crime and policing.</li>
</ul>
<p>Charles Arthur <a href="http://www.guardian.co.uk/technology/2011/feb/02/uk-crime-maps-developers-unhappy" onclick="urchinTracker('/outgoing/www.guardian.co.uk/technology/2011/feb/02/uk-crime-maps-developers-unhappy?referer=');">notes </a>that: &#8220;Police forces have indicated that whenever a new set of data is uploaded – probably each month – the previous set will be removed from public view, making comparisons impossible unless outside developers actively store it.&#8221;</p>
<p>Louise Kidney <a href="http://ashinyworld.blogspot.com/2011/02/events-dear-boy-events.html" onclick="urchinTracker('/outgoing/ashinyworld.blogspot.com/2011/02/events-dear-boy-events.html?referer=');">says</a>:</p>
<blockquote><p>&#8220;What we&#8217;ve actually got with www.police.uk is neither one nor the other. Ruth looks like a crime overlord cos of all the crimes happening in her garden and we haven&#8217;t got exact point data, but we haven&#8217;t got first part of postcode data either e.g. BB5 crimes or NW1 crimes. Instead, we&#8217;ve got this weird halfway house thing where it&#8217;s not accurate, but its inaccuracy almost renders it useless because we don&#8217;t have any idea if every force uses the same parameters when picking these points, we don&#8217;t know how they pick their points, we don&#8217;t know what we don&#8217;t know in terms of whether one house in particular is causing a considerable issue with anti-social behaviour for example, allowing me to go to my local Council and demand they do something about it.&#8221;</p></blockquote>
<p>Adrian Short <a href="http://adrianshort.co.uk/2011/02/01/police-uk-official-crime-maps-there-should-be-a-law-against-it/" onclick="urchinTracker('/outgoing/adrianshort.co.uk/2011/02/01/police-uk-official-crime-maps-there-should-be-a-law-against-it/?referer=');">argues </a>that &#8220;What we’re looking at here isn’t a value-neutral scientific exercise in helping people to live their daily lives a little more easily, it’s an explicitly political attempt to shape the terms of a debate around the most fundamental changes in British policing in our lifetimes.&#8221;</p>
<p>He adds:</p>
<blockquote><p>&#8220;It’s derived data that’s already been classified, rounded and lumped together in various ways, with a bit of location anonymising thrown in for good measure. I haven’t had a detailed look at it yet but I would caution against trying to use it for anything serious. A whole set of decisions have already transformed the raw source data (individual crime reports) into this derived dataset and you can’t undo them. You’ll just have to work within those decisions and stay extremely conscious that everything you produce with it will be prefixed, “as far as we can tell”.</p>
<p>&#8220;£300K for this? There ought to be a law against it.&#8221;</p></blockquote>
<p>UPDATE 2: One frustrated developer has launched <a href="http://crimesearch.co.uk/" onclick="urchinTracker('/outgoing/crimesearch.co.uk/?referer=');">CrimeSearch.co.uk</a> to provide &#8220;helpful information about crime and policing in your area, without costing 300k of tax payers&#8217; money&#8221;</p>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2011%2F02%2F01%2Fwhy-journalists-should-be-lobbying-over-police-uks-crime-data%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2011/02/01/why-journalists-should-be-lobbying-over-police-uks-crime-data/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Where should an aspiring data journalist start?</title>
		<link>http://onlinejournalismblog.com/2010/10/04/where-should-an-aspiring-data-journalist-start/</link>
		<comments>http://onlinejournalismblog.com/2010/10/04/where-should-an-aspiring-data-journalist-start/#comments</comments>
		<pubDate>Mon, 04 Oct 2010 07:41:30 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[data journalism]]></category>
		<category><![CDATA[aron pilhofer]]></category>
		<category><![CDATA[car]]></category>
		<category><![CDATA[Charles Arthur]]></category>
		<category><![CDATA[computer assisted reporting]]></category>
		<category><![CDATA[conrad quilty-harper]]></category>
		<category><![CDATA[visualisation]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=9619</guid>
		<description><![CDATA[In writing last week&#8217;s Guardian Data Blog piece on How to be a data journalist I asked various people involved in data journalism where they would recommend starting. The answers are so useful that I thought I&#8217;d publish them in full here. The Telegraph&#8217;s Conrad Quilty-Harper: Start reading: http://www.google.com/reader/bundle/user%2F06076274130681848419%2Fbundle%2Fdatavizfeeds Keep adding to your knowledge and follow other data journalists/people who<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2010/10/04/where-should-an-aspiring-data-journalist-start/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2010%2F10%2F04%2Fwhere-should-an-aspiring-data-journalist-start%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2010_2F10_2F04_2Fwhere-should-an-aspiring-data-journalist-start_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2010%2F10%2F04%2Fwhere-should-an-aspiring-data-journalist-start%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><em>In writing last week&#8217;s Guardian Data Blog piece on <a href="http://www.guardian.co.uk/news/datablog/2010/oct/01/data-journalism-how-to-guide" onclick="urchinTracker('/outgoing/www.guardian.co.uk/news/datablog/2010/oct/01/data-journalism-how-to-guide?referer=');">How to be a data journalist</a></em><em> I asked various people involved in data journalism where they would recommend starting. The answers are so useful that I thought I&#8217;d publish them in full here. </em></p>
<h2>The Telegraph&#8217;s <strong>Conrad Quilty-Harper:</strong></h2>
<h3>Start reading:</h3>
<p><a href="http://www.google.com/reader/bundle/user%2F06076274130681848419%2Fbundle%2Fdatavizfeeds" onclick="urchinTracker('/outgoing/www.google.com/reader/bundle/user_2F06076274130681848419_2Fbundle_2Fdatavizfeeds?referer=');">http://www.google.com/reader/bundle/user%2F06076274130681848419%2Fbundle%2Fdatavizfeeds</a></p>
<p>Keep adding to your knowledge and follow other data journalists/people who work with data on Twitter.</p>
<h3>Look for sources of data:</h3>
<p>ONS stats release calendar is a good start <a href="http://www.statistics.gov.uk/hub/release-calendar/index.html" onclick="urchinTracker('/outgoing/www.statistics.gov.uk/hub/release-calendar/index.html?referer=');">http://www.statistics.gov.uk/hub/release-calendar/index.html</a> Look at the Government data stores (<a href="http://Data.gov" onclick="urchinTracker('/outgoing/Data.gov?referer=');">Data.gov</a>, <a href="http://Data.gov.uk" onclick="urchinTracker('/outgoing/Data.gov.uk?referer=');">Data.gov.uk</a>, <a href="http://Data.london.gov.uk" onclick="urchinTracker('/outgoing/Data.london.gov.uk?referer=');">Data.london.gov.uk</a> etc).</p>
<p>Check out What do they know, Freebase, Wikileaks, Manyeyes, Google Fusion charts.<span id="more-9619"></span></p>
<p>Find out where hidden data is and try and get hold of it: private companies looking for publicity, under appreciated research departments, public bodies that release data but not in a granular form (e.g. Met Office).</p>
<h3>Test out cleaning/visualisation tools:</h3>
<p>You want to be able to collect data, clean it, visualise it and map it.</p>
<p>Obviously you need to know basic Excel skills (pivot tables are how journalists efficiently get headline numbers from big spreadsheets).</p>
<p>For publishing just use Google Spreadsheets graphs, or ManyEyes or Timetric. Google MyMaps coupled with <a href="http://batchgeo.com" onclick="urchinTracker('/outgoing/batchgeo.com?referer=');">http://batchgeo.com</a> is a great beginner mapping combo.</p>
<p>Further on from that you want to try out Google Spreadsheets importURL service, Yahoo Pipes for cleaning data, Freebase Gridworks and Dabble DB.</p>
<p>More advanced stuff you want to figure out query language and be able to work with relational databases, Google BigQuery, Google Visualisation API (<a href="http://code.google.com/apis/charttools/" onclick="urchinTracker('/outgoing/code.google.com/apis/charttools/?referer=');">http://code.google.com/apis/charttools/</a>), Google code playgrounds (<a href="http://code.google.com/apis/ajax/playground/?type=visualization#org_chart" onclick="urchinTracker('/outgoing/code.google.com/apis/ajax/playground/?type=visualization_org_chart&amp;referer=');">http://code.google.com/apis/ajax/playground/?type=visualization#org_chart</a>) and other Javascript tools. The advanced mapping equivalents are ArcGIS or GeoConcept, allowing you to query geographical data and find stories.</p>
<p>You could also learn some Ruby for building your own scrapers, or Python for ScraperWiki.</p>
<h3>Get inspired:</h3>
<p>Get the data behind some big data stories you admire, try and find a story, visualise it and blog about it. You&#8217;ll find that the whole process starts with the data, and your interpretation of it. That needs to be newsworthy/valuable.</p>
<p>Look to the past!</p>
<p>Edward Tufte&#8217;s work is very inspiring: <a href="http://www.edwardtufte.com/tufte/" onclick="urchinTracker('/outgoing/www.edwardtufte.com/tufte/?referer=');">http://www.edwardtufte.com/tufte/</a> His favourite data visualisation is from 1869! Or what about John Snow&#8217;s Cholera map? <a href="http://www.york.ac.uk/depts/maths/histstat/snow_map.htm" onclick="urchinTracker('/outgoing/www.york.ac.uk/depts/maths/histstat/snow_map.htm?referer=');">http://www.york.ac.uk/depts/maths/histstat/snow_map.htm</a></p>
<p>And for good luck here&#8217;s an assorted list of visualisation tutorials.</p>
<ul>
<li><a href="http://www.google.com/url?q=http%3A%2F%2Fwww.visual-literacy.org%2Fperiodic_table%2Fperiodic_table.html&amp;sa=D&amp;sntz=1&amp;usg=AFrqEzf4s1BmyUCnmR5CaNNU4MJZ6aEj4Q" target="_blank" onclick="urchinTracker('/outgoing/www.google.com/url?q=http_3A_2F_2Fwww.visual-literacy.org_2Fperiodic_table_2Fperiodic_table.html_amp_sa=D_amp_sntz=1_amp_usg=AFrqEzf4s1BmyUCnmR5CaNNU4MJZ6aEj4Q&amp;referer=');">Visualisation of types of visualisations</a></li>
<li><a href="http://www.google.com/url?q=http%3A%2F%2Fflowingdata.com%2F2010%2F01%2F21%2Fhow-to-make-a-heatmap-a-quick-and-easy-solution%2F&amp;sa=D&amp;sntz=1&amp;usg=AFrqEzdgIuCtPiFSfJwyawHS3TzMw6KIGw" target="_blank" onclick="urchinTracker('/outgoing/www.google.com/url?q=http_3A_2F_2Fflowingdata.com_2F2010_2F01_2F21_2Fhow-to-make-a-heatmap-a-quick-and-easy-solution_2F_amp_sa=D_amp_sntz=1_amp_usg=AFrqEzdgIuCtPiFSfJwyawHS3TzMw6KIGw&amp;referer=');">How to make a heatmap</a></li>
<li><a href="http://www.google.com/url?q=http%3A%2F%2Fvisitmix.com%2FArticles%2Fseven-and-a-half-steps-to-successful-infographics&amp;sa=D&amp;sntz=1&amp;usg=AFrqEzevka3DW-V5WKLZqYrex32I3r7Y7w" target="_blank" onclick="urchinTracker('/outgoing/www.google.com/url?q=http_3A_2F_2Fvisitmix.com_2FArticles_2Fseven-and-a-half-steps-to-successful-infographics_amp_sa=D_amp_sntz=1_amp_usg=AFrqEzevka3DW-V5WKLZqYrex32I3r7Y7w&amp;referer=');">How to create successful infographics</a></li>
<li><a href="http://www.google.com/url?q=http%3A%2F%2Flifehacker.com%2Fsoftware%2Fmemory%2Fgeek-to-live--roll-your-own-timeline-207426.php&amp;sa=D&amp;sntz=1&amp;usg=AFrqEzdqPE-fQzR6ZjSW8447TFacXgH0YA" target="_blank" onclick="urchinTracker('/outgoing/www.google.com/url?q=http_3A_2F_2Flifehacker.com_2Fsoftware_2Fmemory_2Fgeek-to-live--roll-your-own-timeline-207426.php_amp_sa=D_amp_sntz=1_amp_usg=AFrqEzdqPE-fQzR6ZjSW8447TFacXgH0YA&amp;referer=');">How to create a timeline using Simile</a></li>
<li><a href="http://www.google.com/url?q=http%3A%2F%2Fwoorkup.com%2F2010%2F02%2F19%2F10-useful-google-spreadsheet-formulas-you-must-know%2F&amp;sa=D&amp;sntz=1&amp;usg=AFrqEzdFlxZA1l9MtkHXbKTwECLjmUz-qQ" target="_blank" onclick="urchinTracker('/outgoing/www.google.com/url?q=http_3A_2F_2Fwoorkup.com_2F2010_2F02_2F19_2F10-useful-google-spreadsheet-formulas-you-must-know_2F_amp_sa=D_amp_sntz=1_amp_usg=AFrqEzdFlxZA1l9MtkHXbKTwECLjmUz-qQ&amp;referer=');">10 useful Google Spreadsheet tricks</a></li>
<li><a href="http://www.google.com/url?q=http%3A%2F%2Fdigitalnewsgathering.wordpress.com%2F2010%2F04%2F24%2F10-tips-for-designing-infographics%2F&amp;sa=D&amp;sntz=1&amp;usg=AFrqEzdnUDiDWYYkCXQiMPcDa7j1dNBIJA" target="_blank" onclick="urchinTracker('/outgoing/www.google.com/url?q=http_3A_2F_2Fdigitalnewsgathering.wordpress.com_2F2010_2F04_2F24_2F10-tips-for-designing-infographics_2F_amp_sa=D_amp_sntz=1_amp_usg=AFrqEzdnUDiDWYYkCXQiMPcDa7j1dNBIJA&amp;referer=');">10 tips for designing infographics</a></li>
<li><a href="http://www.google.com/url?q=http%3A%2F%2Fwww.smashingmagazine.com%2F2007%2F10%2F18%2Fcharts-and-graphs-modern-solutions%2F&amp;sa=D&amp;sntz=1&amp;usg=AFrqEzfgq-5rXTQ9jMx_DwLRNdzVEPTsvQ" target="_blank" onclick="urchinTracker('/outgoing/www.google.com/url?q=http_3A_2F_2Fwww.smashingmagazine.com_2F2007_2F10_2F18_2Fcharts-and-graphs-modern-solutions_2F_amp_sa=D_amp_sntz=1_amp_usg=AFrqEzfgq-5rXTQ9jMx_DwLRNdzVEPTsvQ&amp;referer=');">Graphing and charting solutions in javascript and flash</a></li>
<li><a href="http://www.google.com/url?q=http%3A%2F%2Fvector.tutsplus.com%2Ftutorials%2Fdesigning%2Fhow-to-create-outstanding-modern-infographics%2F&amp;sa=D&amp;sntz=1&amp;usg=AFrqEzcWpzR5pH5sqPeb3iZuYGF_mj4o5w" target="_blank" onclick="urchinTracker('/outgoing/www.google.com/url?q=http_3A_2F_2Fvector.tutsplus.com_2Ftutorials_2Fdesigning_2Fhow-to-create-outstanding-modern-infographics_2F_amp_sa=D_amp_sntz=1_amp_usg=AFrqEzcWpzR5pH5sqPeb3iZuYGF_mj4o5w&amp;referer=');">How to create outstanding modern infographics in Illustrator</a></li>
</ul>
<h2>The Times&#8217; Jonathan Richards</h2>
<p>I&#8217;d say a couple of blogs.</p>
<ul>
<li>Hacks/Hackers: <a href="http://hackshackers.com/" onclick="urchinTracker('/outgoing/hackshackers.com/?referer=');">http://hackshackers.com/</a></li>
<li>and more importantly their Q&amp;A site: <a href="http://help.hackshackers.com/" onclick="urchinTracker('/outgoing/help.hackshackers.com/?referer=');">http://help.hackshackers.com/</a> - which answers a ton of questions about which computer language/book/manual/online tutorial to get started with.</li>
</ul>
<p>Others that spring to mind are:</p>
<ul>
<li>Michelle Minkoff: <a href="http://michelleminkoff.com/" onclick="urchinTracker('/outgoing/michelleminkoff.com/?referer=');">http://michelleminkoff.com/</a> (Hack/hacker who&#8217;s now at the LA Times. Provides a nice &#8216;not too geeky&#8217; take on the area) and</li>
<li>10,000 words: <a href="http://10000words.net/" onclick="urchinTracker('/outgoing/10000words.net/?referer=');">http://10000words.net/</a></li>
</ul>
<p>If people want more specific advice, tell them to come to the next London Hack/Hackers and track me down!</p>
<h2>The Guardian&#8217;s Charles Arthur:</h2>
<p>Obvious thing: find a story that will be best told through numbers. (I&#8217;m thinking about quizzing my local council about the effects of stopping free swimming for children. Obvious way forward: get numbers for number of children swimming before, during and after free swimming offer.)</p>
<p>If someone already has the skills for data journalism (which I&#8217;d put at (1) understanding statistics and relevance (2) understanding how to manipulate data (3) understanding how to make the data visual) the key, I&#8217;d say, is always being able to spot a story that can be told through data &#8211; and only makes sense that way, and where being able to manipulate the data is key to extracting the story. It&#8217;s like interviewing the data. Good interviewers know how to get what they want out from the conversation. Ditto good data journalists and their data.</p>
<h2>The New York Times&#8217; Aron Pilhofer:</h2>
<p>I would start small, and start with something you already know and already do. And always, always, always remember that the goal here is journalism. There is a tendency to focus too much on the skills for the sake of skills, and not enough on how those skills help enable you to do better journalism. Be pragmatic about it, and resist the tendency to think you need to know everything about the techy stuff before you do anything &#8212; nothing could be further from the truth.</p>
<p>Less abstractly, I would start out learning some basic computer-assisted reporting skills and then moving from there as your interests/needs dictate. A lot of people see the programmer/journalism thing as distinct from computer-assisted reporting, but I don&#8217;t. I see it as a continuum. I see CAR as a &#8220;gateway drug&#8221; of sorts: Once you start working with small data sets using tools like Excel, Access, MySQL, etc., you&#8217;ll eventually hit limits of what you can do with macros and SQL.</p>
<p>Soon enough, you&#8217;ll want to be able to script certain things. You&#8217;ll want to get data from the web. You&#8217;ll want to do things you can only do using some kind of scripting language, and so it begins.</p>
<p>But again, the place to start isn&#8217;t thinking about all these technologies. The place to start is thinking about how these technologies can enable you to tell stories you otherwise would never be able to tell otherwise. And you should start small. Look for little things to start, and go from there.</p>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2010%2F10%2F04%2Fwhere-should-an-aspiring-data-journalist-start%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2010/10/04/where-should-an-aspiring-data-journalist-start/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Why did you get into data journalism?</title>
		<link>http://onlinejournalismblog.com/2010/09/22/why-did-you-get-into-data-journalism/</link>
		<comments>http://onlinejournalismblog.com/2010/09/22/why-did-you-get-into-data-journalism/#comments</comments>
		<pubDate>Wed, 22 Sep 2010 10:40:13 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[data journalism]]></category>
		<category><![CDATA[Charles Arthur]]></category>
		<category><![CDATA[jonathan richards]]></category>
		<category><![CDATA[mary hamilton]]></category>
		<category><![CDATA[online journalism book]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=8423</guid>
		<description><![CDATA[In researching my book chapter (UPDATE: now published) I asked a group of journalists who worked with data what led them to do so. Here are their answers: Jonathon Richards, The Times: The flood of information online presents an amazing opportunity for journalists, but also a challenge: how on earth does one keep up with; make sense of it? You could<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2010/09/22/why-did-you-get-into-data-journalism/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2010%2F09%2F22%2Fwhy-did-you-get-into-data-journalism%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2010_2F09_2F22_2Fwhy-did-you-get-into-data-journalism_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2010%2F09%2F22%2Fwhy-did-you-get-into-data-journalism%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p>In researching my book chapter (<strong>UPDATE: <a href="http://www.amazon.co.uk/Online-Journalism-Handbook-Survive-Digital/dp/140587340X/ref=as_li_ss_mfw?&amp;camp=2486&amp;linkCode=wey&amp;tag=onlijourblog-21&amp;creative=8882" onclick="urchinTracker('/outgoing/www.amazon.co.uk/Online-Journalism-Handbook-Survive-Digital/dp/140587340X/ref=as_li_ss_mfw?_amp_camp=2486_amp_linkCode=wey_amp_tag=onlijourblog-21_amp_creative=8882&amp;referer=');">now published</a></strong>) I asked a group of journalists who worked with data what led them to do so. Here are their answers:</p>
<h2>Jonathon Richards, The Times:</h2>
<blockquote><p>The flood of information online presents an amazing opportunity for journalists, but also a challenge: how on earth does one keep up with; make sense of it? You could go about it in the traditional way, fossicking in individual sites, but much of the journalistic value in this outpouring, it seems, comes in aggregation: in processing large amounts of data, distilling them, and exploring them for patterns. To do that &#8211; unless you&#8217;re superhuman, or have a small army of volunteers &#8211; you need the help of a computer.</p>
<p>I &#8216;got into&#8217; data journalism because I find this mix exciting. It appeals to the traditional journalistic instinct, but also calls for a new skill which, once harnessed, dramatically expands the realm of &#8216;stories I could possibly investigate&#8230;&#8217;<span id="more-8423"></span></p></blockquote>
<h2>Mary Hamilton, Eastern Daily Press:</h2>
<blockquote><p>I started coding out of necessity, not out of desire. In my day-to-day work for local newspapers I came across stories that couldn&#8217;t be told any other way. Excel spreadsheets full of data that I knew was relevant to readers if I could break it down or aggregate it up. Lists of locations that meant nothing on the page without a map. Timelines of events and stacks of documents. The logical response for me was to try to develop the skills to parse data to get to the stories it can tell, and to present it in interactive, interesting and &#8211; crucially &#8211; relevant ways. I see data journalism as an important skill in my storytelling toolkit &#8211; not the only option, but an increasingly important way to open up information to readers and users.</p></blockquote>
<h2>Charles Arthur, The Guardian:</h2>
<blockquote><p>When I was really young, I read a book about computers which made the point &#8211; rather effectively &#8211; that if you found yourself doing the same process again and again, you should hand it over to a computer. That became a rule for me: never do some task more than once if you can possibly get a computer to do it.</p>
<p>Obviously, to implement that you have to do a bit of programming. It turns out all programming languages are much the same &#8211; they vary in their grammar, but they&#8217;re all about making the computer do stuff. And it&#8217;s often the same stuff (at least in my ambit) &#8211; fetch a web page, mash up two sets of data, filter out some rubbish and find the information you want.</p>
<p>I got into data journalism because I also did statistics &#8211; and that taught me that people are notoriously bad at understanding data. Visualisation and simplification and exposition are key to helping people understand.</p>
<p>So data journalism is a compound of all those things: determination to make the computer do the slog, confidence that I can program it to, and the desire to tell the story that the data is holding and hiding.</p>
<p>I don&#8217;t think there was any particular point where I suddenly said &#8220;ooh, this is data journalism&#8221; &#8211; it&#8217;s more that the process of thinking &#8220;oh, big dataset, stuff it into an ad-hoc MySQL database, left join against that other database I&#8217;ve got, see what comes out&#8221; goes from being a huge experiment to your natural reaction.</p>
<p>It&#8217;s not just data though &#8211; I use programming to slough off the repetitive tasks of the day, such as collecting links, or resizing pictures, or getting the picture URL and photographer and licence from a Flickr page and stuffing it into a blogpost.</p>
<p>Data journalism is actually only half the story. The other half is that journalists should be **actively unwilling** to do repetitive tasks if it&#8217;s machine-like (say, removing line breaks from a piece of copy, or changing a link format).</p>
<p>Time spent doing those sorts of tasks is time lost to journalism and given up to being a machine. Let the damn machines do it. Humans have better things to do.</p></blockquote>
<h2><a href="http://stdout.be/en/" onclick="urchinTracker('/outgoing/stdout.be/en/?referer=');">Stijn Debrouwere</a>, Belgian information designer:</h2>
<blockquote><p>I used to love reading the daily newspaper, but lately I can&#8217;t seem to be bothered anymore. I&#8217;m part of that generation of people news execs fear so much: those that simply don&#8217;t care about what newspapers and news magazines have to offer. I enjoy being an information designer because it gives me a chance to help reinvent the way we engage and inform communities through news and analysis, both offline and online. Technology doesn&#8217;t solve everything, but it sure can help. My professional goal is simply this: make myself love news and newspapers again, and thereby hopefully getting others to love it too.</p></blockquote>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2010%2F09%2F22%2Fwhy-did-you-get-into-data-journalism%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2010/09/22/why-did-you-get-into-data-journalism/feed/</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Data journalism pt2: Interrogating data</title>
		<link>http://onlinejournalismblog.com/2010/04/26/data-journalism-pt2-interrogating-data/</link>
		<comments>http://onlinejournalismblog.com/2010/04/26/data-journalism-pt2-interrogating-data/#comments</comments>
		<pubDate>Mon, 26 Apr 2010 09:04:32 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[data journalism]]></category>
		<category><![CDATA[adrian short]]></category>
		<category><![CDATA[Charles Arthur]]></category>
		<category><![CDATA[cleaning data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[google refine]]></category>
		<category><![CDATA[kaiser fung]]></category>
		<category><![CDATA[online journalism book]]></category>
		<category><![CDATA[scraping]]></category>
		<category><![CDATA[tony hirst]]></category>
		<category><![CDATA[trim]]></category>
		<category><![CDATA[Yahoo! Pipes]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=8398</guid>
		<description><![CDATA[This is a draft from a book chapter on data journalism (the first, on gathering data, is here). I’d really appreciate any additions or comments you can make &#8211; particularly around ways of spotting stories in data, and mistakes to avoid. UPDATE: It has now been published in The Online Journalism Handbook. &#8220;One of the most important (and least technical)<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2010/04/26/data-journalism-pt2-interrogating-data/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2010%2F04%2F26%2Fdata-journalism-pt2-interrogating-data%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2010_2F04_2F26_2Fdata-journalism-pt2-interrogating-data_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2010%2F04%2F26%2Fdata-journalism-pt2-interrogating-data%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><em>This is a draft from a book chapter on data journalism (<a href="http://onlinejournalismblog.com/2010/04/21/data-journalism-pt1-finding-data-draft-comments-invited/">the first, on gathering data, is here</a>). I’d really appreciate any additions or comments you can make &#8211; particularly around ways of spotting stories in data, and mistakes to avoid.</em></p>
<p><strong>UPDATE: <a href="http://www.amazon.co.uk/Online-Journalism-Handbook-Survive-Digital/dp/140587340X/ref=as_li_ss_mfw?&amp;camp=2486&amp;linkCode=wey&amp;tag=onlijourblog-21&amp;creative=8882" onclick="urchinTracker('/outgoing/www.amazon.co.uk/Online-Journalism-Handbook-Survive-Digital/dp/140587340X/ref=as_li_ss_mfw?_amp_camp=2486_amp_linkCode=wey_amp_tag=onlijourblog-21_amp_creative=8882&amp;referer=');">It has now been published in The Online Journalism Handbook</a>.</strong></p>
<blockquote><p>&#8220;One of the most important (and least technical) skills in understanding data is asking good questions. An appropriate question shares an interest you have in the data, tries to convey it to others, and is curiosity-oriented rather than math-oriented. Visualizing data is just like any other type of communication: success is defined by your audience&#8217;s ability to pick up on, and be excited about, your insight.&#8221; (<a href="http://astore.amazon.co.uk/onlijourblog-21/detail/0596514557" onclick="urchinTracker('/outgoing/astore.amazon.co.uk/onlijourblog-21/detail/0596514557?referer=');">Fry, 2008, p4</a>)</p></blockquote>
<p>Once you have the data you need to see if there is a story buried within it. The great advantage of computer processing is that it makes it easier to sort, filter, compare and search information in different ways to get to the heart of what &#8211; if anything &#8211; it reveals.<span id="more-8398"></span></p>
<p>The first stage in this process, then, is making sure the data is in the right format to be interrogated. Quite often this will be a spreadsheet or CSV (comma-separated values) file. If your information is in a PDF you will not be able to do a great deal with it other than re-type the values into a new spreadsheet (making sure to check you have not made any errors). A Word or Powerpoint document is likely to require the same work.</p>
<p>If the information is already online you can sometimes &#8216;scrape&#8217; it &#8211; that is, automatically copy the relevant information into a separate document. How easy this is to do depends on how structured the information is. A table in a Wikipedia entry, for example, can be &#8216;scraped&#8217; into a Google spreadsheet relatively easily (Tony Hirst gives instructions on how to do this at <a href="http://ouseful.wordpress.com/2008/10/14/data-scraping-wikipedia-with-google-spreadsheets/" onclick="urchinTracker('/outgoing/ouseful.wordpress.com/2008/10/14/data-scraping-wikipedia-with-google-spreadsheets/?referer=');">http://ouseful.wordpress.com/2008/10/14/data-scraping-wikipedia-with-google-spreadsheets/</a>) and an online CSV file and certain other structured data <a href="http://www.daybarr.com/blog/2007/12/11/yahoo-pipes-tutorial-an-example-using-the-fetch-page-module-to-make-a-web-scraper" onclick="urchinTracker('/outgoing/www.daybarr.com/blog/2007/12/11/yahoo-pipes-tutorial-an-example-using-the-fetch-page-module-to-make-a-web-scraper?referer=');">can be scraped with Yahoo! Pipes</a> (see below for more on using Yahoo! Pipes) but most scraping will involve programming (the online tool <a href="http://scraperwiki.com/" onclick="urchinTracker('/outgoing/scraperwiki.com/?referer=');">ScraperWiki </a>provides one environment to help you do this).</p>
<p><strong>Insert: Cleaning up data</strong></p>
<p>Whether you have been given data, had to scrape it, or copied it manually, you will probably need to clean it up. All sorts of things can &#8216;dirty&#8217; your data, from misspellings and variations in spelling, to odd punctuation, mixtures of numbers and letters, unnecessary columns or rows, and more. Computers, for example, will see &#8216;New Town&#8217;, &#8216;Newtown&#8217; and &#8216;newtown&#8217; as three separate towns when they may be one.</p>
<p>This can cause problems later on when analysing your data &#8211; for example, calculations not working or results not being accurate.</p>
<p>Some tips for cleaning your data include:</p>
<ul>
<li>Use a spellchecker to check for misspellings. You will probably have to add some words to the computer&#8217;s dictionary.</li>
<li>Use &#8216;find and replace&#8217; (normally in the Edit menu) to remove double-spaces and other common punctuation errors. Alternatively, if you are in Excel you can create a new column and use the =TRIM() function, which will copy the contents of the cell in the brackets and remove any spaces.</li>
<li>Remove duplicate entries &#8211; if you are using Excel there are a few ways to do this under the Data tab &#8211; search for duplicates in Help.</li>
</ul>
<p>For more tips on Excel specifically see this guide: <a href="http://office.microsoft.com/en-us/excel/HA102218401033.aspx" onclick="urchinTracker('/outgoing/office.microsoft.com/en-us/excel/HA102218401033.aspx?referer=');">http://office.microsoft.com/en-us/excel/HA102218401033.aspx</a></p>
<p>For cleaning up very large sets of data you might want to <a href="http://onlinejournalismblog.com/2010/11/11/data-cleaning-tool-relaunches-freebase-gridworks-becomes-google-refine/">use a data cleaning tool like Google Refine</a>.</p>
<h2>Spotting the story</h2>
<p>Once your data is cleaned you can start to look for anything newsworthy in it. There are some obvious places to start: if you are dealing with numbers, for example, you can work out what the &#8216;average&#8217; is (the average bonus paid to council employees, for example). Similarly, you might look for the term which appears most often (e.g. the most common reason given for arresting terrorist suspects).</p>
<p>However, Kaiser Fung, a statistician whose blog <a href="http://junkcharts.typepad.com/" onclick="urchinTracker('/outgoing/junkcharts.typepad.com/?referer=');">Junk Charts</a> is essential reading on the field, notes the dangers in lazily reaching for the average when you want to make an editorial point:</p>
<blockquote><p>&#8220;Averaging stamps out diversity, reducing anything to its simplest terms. In so doing, we run the risk of oversimplifying, of forgetting the variations around the average. Hitching one&#8217;s attention to these variations rather than the average is a sure sign of maturity in statistical thinking. One can, in fact, define statistics as the study of the nature of variability. How much do things change? How large are these variations? What causes them?&#8221; (<a href="http://astore.amazon.co.uk/onlijourblog-21/detail/0071626530" onclick="urchinTracker('/outgoing/astore.amazon.co.uk/onlijourblog-21/detail/0071626530?referer=');">Fung, 2010, p4</a>)</p></blockquote>
<p>So while averages and modes (a mode is the number or term which appears most often) can be interesting discoveries, they should most often be used as a starting point for more illuminating investigation &#8211; normally involving leaving your computer to make phonecalls and speak to sources.</p>
<p>If you are looking at data over time, you can look to see what has increased over that period, or decreased &#8211; or disappeared.</p>
<p>But you will need to gather further data to provide context to your figures. If, for example, more council staff are receiving bonuses, is that simply because more staff have been employed? How much is spent on wages, and how do your figures compare? If you are comparing one city with another, understand how their populations differ &#8211; not just in aggregate, but in relevant details such as age, ethnicity, life expectancy, etc. You will need to know where to access basic statistics like these &#8211; the National Statistics website is often a good place to start.</p>
<p>Sometimes a change in the way data is gathered or categorised can produce a dramatic change in the data itself. In one example, designer Adrian Short obtained information (via an FOI request) on parking tickets from Transport for London that showed the numbers of tickets issued against a particular offence plummeted from around 8,000 to 8 in the space of one month (<a href="http://www.guardian.co.uk/news/datablog/2009/apr/29/transport-london-parking-tickets" onclick="urchinTracker('/outgoing/www.guardian.co.uk/news/datablog/2009/apr/29/transport-london-parking-tickets?referer=');">Arthur, 2009</a>). Had people suddenly stopped committing that parking offence, or was there another explanation? A quick phonecall to Transport for London revealed that traffic wardens were issued with new handsets around the same time. Guardian journalist Charles Arthur hypothesised:</p>
<blockquote><p>&#8220;Could it be that s46 [another offence which had a steep rise at the same time] is the default on the screen to issue a new ticket, and that wardens don&#8217;t bother to change it? Whatever it is, there&#8217;s a serious problem for TfL if those aren&#8217;t all s46 offences which have been ticketed since August 2006. Because if the ticket isn&#8217;t written out to the correct offence, then the fine isn&#8217;t payable. Theoretically, TfL might have to pay back millions in traffic fines for people who have been ticketed for s46 offences when they were actually committing s25 or s30 offences.&#8221;</p></blockquote>
<p>This particular story came about at least in part because that information was easy to visualise.</p>
<p><em>The <a href="http://onlinejournalismblog.com/2010/04/28/data-journalism-pt3-visualising-data-comments-wanted/">next section covers visualisation</a>. In the meantime, once again I’d really appreciate any additions or comments – particularly around ways of spotting stories in data, and mistakes to avoid.</em></p>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2010%2F04%2F26%2Fdata-journalism-pt2-interrogating-data%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2010/04/26/data-journalism-pt2-interrogating-data/feed/</wfw:commentRss>
		<slash:comments>18</slash:comments>
		</item>
		<item>
		<title>Every news organisation should have a Datastore</title>
		<link>http://onlinejournalismblog.com/2009/06/15/every-news-organisation-should-have-a-datastore/</link>
		<comments>http://onlinejournalismblog.com/2009/06/15/every-news-organisation-should-have-a-datastore/#comments</comments>
		<pubDate>Mon, 15 Jun 2009 09:50:58 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[online journalism]]></category>
		<category><![CDATA[Charles Arthur]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[datastore]]></category>
		<category><![CDATA[Guardian]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=2818</guid>
		<description><![CDATA[You may know about The Guardian&#8217;s Datastore: a compilation of &#8220;publicly-available data for you to use free&#8221; that&#8217;s been around for a few months now. You know the sort of thing: university tables; MPs&#8217; expenses; tax paid by the FTSE 100. It has already produced some great work from what I once described as the &#8220;Technician&#8221; variant of distributed journalism.  But a column by<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2009/06/15/every-news-organisation-should-have-a-datastore/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F06%2F15%2Fevery-news-organisation-should-have-a-datastore%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2009_2F06_2F15_2Fevery-news-organisation-should-have-a-datastore_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F06%2F15%2Fevery-news-organisation-should-have-a-datastore%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><img src="http://static.guim.co.uk/sys-images/Guardian/Pix/pictures/2009/6/9/1244556652985/games-music-dvds.png" alt="" /></p>
<p>You may know about <a href="http://www.guardian.co.uk/data-store" onclick="urchinTracker('/outgoing/www.guardian.co.uk/data-store?referer=');">The Guardian&#8217;s Datastore</a>: a compilation of &#8220;publicly-available data for you to use free&#8221; that&#8217;s been around for a few months now. You know the sort of thing: <a href="http://www.guardian.co.uk/news/datablog/2009/jun/02/universityguide-choosingadegree" onclick="urchinTracker('/outgoing/www.guardian.co.uk/news/datablog/2009/jun/02/universityguide-choosingadegree?referer=');">university tables;</a> <a href="http://www.guardian.co.uk/news/datablog/2009/may/08/mps-expenses-houseofcommons" onclick="urchinTracker('/outgoing/www.guardian.co.uk/news/datablog/2009/may/08/mps-expenses-houseofcommons?referer=');">MPs&#8217; expenses</a>; <a href="http://www.guardian.co.uk/business/tax-gap-blog/2009/jan/30/1" onclick="urchinTracker('/outgoing/www.guardian.co.uk/business/tax-gap-blog/2009/jan/30/1?referer=');">tax paid by the FTSE 100</a>.</p>
<p>It has already produced <a href="http://groups.google.com/group/guardian-api-talk/browse_thread/thread/b1afd5500ad47eb2?pli=1" onclick="urchinTracker('/outgoing/groups.google.com/group/guardian-api-talk/browse_thread/thread/b1afd5500ad47eb2?pli=1&amp;referer=');">some</a> <a href="http://ouseful.wordpress.com/2009/04/02/visualising-mps-expenses-using-scatter-plots-charts-and-maps/" onclick="urchinTracker('/outgoing/ouseful.wordpress.com/2009/04/02/visualising-mps-expenses-using-scatter-plots-charts-and-maps/?referer=');">great</a> <a href="http://onlinejournalismblog.com/2009/05/19/using-google-spreadsheets-as-a-database-no-it-really-is-very-interesting-honest/">work</a> from what I <a href="http://onlinejournalismblog.com/2007/10/02/a-model-for-the-21st-century-newsroom-pt2-distributed-journalism/">once described as the &#8220;Technician&#8221; variant of distributed journalism</a>. </p>
<p>But <a href="http://www.guardian.co.uk/technology/2009/jun/11/charles-arthur-filesharing-piracy" onclick="urchinTracker('/outgoing/www.guardian.co.uk/technology/2009/jun/11/charles-arthur-filesharing-piracy?referer=');">a column by Charles Arthur</a> recently was the first example I&#8217;ve seen of Datastore being used for, well, more ordinary data &#8211; the sort of information journalists deal with every week. Here&#8217;s how it appeared in print:</p>
<blockquote><p>&#8220;I dug up the figures from the UK music industry: the British record industry&#8217;s trade association (the BPI), and the UK games industry (via its trade body, Elspa) as well as the DVD industry (through the UK Film Council and the British Video Association). The results are over on the Guardian Data Store (<a href="http://bit.ly/data01" onclick="urchinTracker('/outgoing/bit.ly/data01?referer=');">http://bit.ly/data01</a>), because they are the sort of numbers that should be available to everyone to chew over.</p>
<p>&#8220;What did I find? Total spending has grown – but music spending is being squeezed. The games industry – hardware and <a href="http://www.guardian.co.uk/technology/software" onclick="urchinTracker('/outgoing/www.guardian.co.uk/technology/software?referer=');">software</a> – has grown from £1.4bn in 1999 (the year Napster started, and the music business stood rabbit-transfixed) to £4.04bn in 2008. That&#8217;s 12% annual compound growth. You&#8217;d kill for an endowment like that. Even DVD sales and rental take a £2.5bn bite out of consumers&#8217; available funds, double that of 1999.</p>
<p>&#8220;So the music industry&#8217;s deadliest enemy isn&#8217;t filesharing – it&#8217;s the likes of Nintendo, Microsoft and Sony, and a zillion games publishers.&#8221;</p></blockquote>
<p>That link (which frustratingly isn&#8217;t active in the online article) takes you to a Datablog post by Arthur which in turn links to a rather simple spreadsheet. </p>
<p>And it&#8217;s the simplicity that I think is important.</p>
<p>It&#8217;s one thing to link to huge datasets that benefit from <a href="http://en.wikipedia.org/wiki/Linus's_Law" onclick="urchinTracker('/outgoing/en.wikipedia.org/wiki/Linus_s_Law?referer=');">lots of eyeballs</a> looking for stories, or perming the data in different ways.</p>
<p>But it&#8217;s something else to link to the more everyday figures journalists deal with; to <strong>show your sums</strong>, in short.</p>
<p>Is this a natural extension of the blogging culture of linking to your sources? I think it is. And the more journalists get used to publishing their work on the likes of Google Spreadsheets, the better journalism we will get.</p>
<p>So why aren&#8217;t more journalists doing it? And why aren&#8217;t more news organisations providing a place for them to do it? Or are they? I&#8217;d love to know of any other individual or organisational examples.</p>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F06%2F15%2Fevery-news-organisation-should-have-a-datastore%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2009/06/15/every-news-organisation-should-have-a-datastore/feed/</wfw:commentRss>
		<slash:comments>7</slash:comments>
		</item>
		<item>
		<title>&#8216;Journalists: learn to code&#8217; says Guardian&#8217;s Arthur</title>
		<link>http://onlinejournalismblog.com/2009/01/20/journalists-learn-to-code-says-guardians-arthur/</link>
		<comments>http://onlinejournalismblog.com/2009/01/20/journalists-learn-to-code-says-guardians-arthur/#comments</comments>
		<pubDate>Tue, 20 Jan 2009 09:22:19 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[online journalism]]></category>
		<category><![CDATA[Charles Arthur]]></category>
		<category><![CDATA[coding]]></category>
		<category><![CDATA[Guardian]]></category>
		<category><![CDATA[journalism education]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=1997</guid>
		<description><![CDATA[Charles Arthur of The Guardian makes his point pretty plain: &#8220;If I had one piece of advice to a journalist starting out now, it would be: learn to code&#8221; &#8220;Let’s be clear that I’m not saying “code” as in “get deep into C++ or Java&#8221; &#8230; I mean it in the sense of having a nodding acquaintance with methods of<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2009/01/20/journalists-learn-to-code-says-guardians-arthur/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F01%2F20%2Fjournalists-learn-to-code-says-guardians-arthur%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2009_2F01_2F20_2Fjournalists-learn-to-code-says-guardians-arthur_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F01%2F20%2Fjournalists-learn-to-code-says-guardians-arthur%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p>Charles Arthur of The Guardian makes his point pretty plain: &#8220;<a title="Permanent Link: If I had one piece of advice to a journalist starting out now, it would be: learn to code" rel="bookmark" href="http://www.charlesarthur.com/blog/?p=1098" onclick="urchinTracker('/outgoing/www.charlesarthur.com/blog/?p=1098&amp;referer=');">If I had one piece of advice to a journalist starting out now, it would be: learn to code&#8221;</a></p>
<blockquote><p>&#8220;Let’s be clear that I’m not saying “code” as in “get deep into C++ or Java&#8221; &#8230; I mean it in the sense of having a nodding acquaintance with methods of programming, and perhaps a few languages, so that when something comes along where you’ll need, say, to transform data from one form to another, you can. Or where you need to make your own life easier by automating some process or other.</p>
<p>&#8221; &#8230; None of which is saying you shouldn’t be talking to your sources, and questioning what you’re told, and trying to find other means of finding stuff out from people. But nowadays, computers are a sort of primary source too. You’ve got to learn to interrogate them effectively &#8211; and quote them meaningfully &#8211; too.&#8221;</p></blockquote>
<p><a href="http://onlinejournalismblog.com/2008/12/04/model-for-the-21st-century-newsroom-pt6-new-journalists-for-new-information-flows/">Amen to that</a>.</p>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F01%2F20%2Fjournalists-learn-to-code-says-guardians-arthur%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2009/01/20/journalists-learn-to-code-says-guardians-arthur/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

