Archive for January, 2012

Twitter’s ‘censorship’ is nothing new – but it is different

Over the weekend thousands of Twitter users boycotted the service in protest at the announcement that the service will begin withholding tweets based on the demands of local governments and law enforcement. Protesting against censorship is laudable, but it is worth pointing out that most online services already do the same, whether it’s Google’s Orkut; Apple removing apps from its
Read more…

A lesson in UGC, copyright, and the law (again)

Terence Eden filmed the above video demonstrating O2′s phone security flaw. He put it on YouTube with the standard copyright licence. And someone at Sky News ignored that when they used it without permission. But what’s interesting about Terence’s blog post about the experience is the legal position that Sky then negotiated from – an experience that journalism students, journalists
Read more…

The £10,000 question: who benefits most from a tax threshold change?

UPDATE [Feb 14 2012]: Full Fact picked up the challenge and dug into the data: “The crucial difference is in methodology – while the TPA used individuals as its basis, the IFS used households as provided by the Government data. “This led to substantially different conclusions. The IFS note that using household income as a measure demonstrates increased gains for
Read more…

A new Scottish datablog (and a treemap in Liverpool)

The Scotsman has a newish data blog, set up (I’m rather proud to say) by one of my former PA/Telegraph trainees: Jennifer O’Mahony. This is particularly important as so much data covered in the ‘national’ press tends to be English-only due to devolution. The Department of Education, for example, only publishes English education data. If you want Scottish education data you need
Read more…

Word cloud or bar chart?

One of the easiest ways to get someone started on data visualisation is to introduce them to word clouds (it also demonstrates neatly how not all data is numerical). Using tools like Wordle and Tagxedo, you can paste in a major speech and see it visualised within a minute or so. But is a word cloud the best way of
Read more…

Report: Social Media and News

Last year I was commissioned to write a report on ‘Social Media and News’ for the Open Society Media Program, as part of the ‘Mapping Digital Media’ series. The report is now available here (PDF). As I say in the introduction, I focused on “the areas that are most strongly contested and hold the most importance for the development of news
Read more…

Data journalism awards

Yesterday saw the launch of the first (surprisingly) international data journalism awards, backed by the European Journalism Centre*, Google, and the Global Editors Network. There are 6 awards – 3 categories, each split into national/international and local/regional subcategories: investigative journalism; visualisation; and apps. Each comes with prize money of 7,500 euros. The closing date for entries is April 10. It’s
Read more…

Comment call: Objectivity and impartiality – a newsroom policy for student projects

I’ve been updating a newsroom policy guide for a project some of my students will be working on, with a particular section on objectivity and impartiality. As this has coincided with the debate on fact-checking stirred by the New York Times public editor Arthur Brisbane, I thought I would reproduce the guidelines here, and invite comments on whether you think it hits
Read more…

Sockpuppetry and Wikipedia – a PR transparency project

Last month you may have read the story of lobbyists editing Wikipedia entries to remove criticism of their clients and smear critics. The story was a follow-up to an undercover report by the Bureau of Investigative Journalism and The Independent on claims of political access by Bell Pottinger, written as a result of investigations by SEO expert Tim Ireland. Ireland was particularly interested
Read more…

SFTW: Scraping data with Google Refine

For the first Something For The Weekend of 2012 I want to tackle a common problem when you’re trying to scrape a collection of webpage: they have some sort of structure in their URL like this, where part of the URL refers to the name or code of an entity: http://www.ltscotland.org.uk/scottishschoolsonline/schools/freemealentitlement.asp?iSchoolID=5237521 http://www.ltscotland.org.uk/scottishschoolsonline/schools/freemealentitlement.asp?iSchoolID=5237629 http://www.ltscotland.org.uk/scottishschoolsonline/schools/freemealentitlement.asp?iSchoolID=5237823 In this instance, you can see that
Read more…