Category Archives: data journalism

2 how-tos: researching people and mapping planning applications

Mapping planning applications

Sid Ryan’s planning applications map

Sid Ryan wanted to see if planning applications near planning committee members were more or less likely to be accepted. In two guest posts on Help Me Investigate he shows how to research people online (in this case the councillors), and how to map planning applications to identify potential relationships.

The posts take in a range of techniques including:

  • Scraping using Scraperwiki and the Google Drive spreadsheet function importXML
  • Mapping in Google Fusion Tables
  • Registers of interests
  • Using advanced search techniques
  • Using Land Registry enquiries
  • Using Companies House and Duedil
  • Other ways to find information on individuals, such as Hansard, LinkedIn, 192.com, Lexis Nexis, whois and FriendsReunited

If you find it useful, please let me know – and if you can add anything… please do.

Motion graphic video workflow – a video tutorial

Motion graphics has become an increasingly popular way to present data in a compelling visual form. In a series of videos guest contributor Sihlangu Tshuma outlines his workflow process for managing a motion graphics video project, the results of which are shown at the end. All 13 videos are also available in this playlist.

1: Motion graphics introduction

2: Researching the project

3: Motion graphics treatments Continue reading

Notes on setting up a regional newspaper datablog

Behind the Numbers - Birmingham's regional datablog

I’ve been working recently with the Birmingham Mail to launch Behind The Numbersa new datablog project with Birmingham City University supported by Help Me Investigate. I’m told that it is probably the UK’s first regional newspaper datablog, although whether that’s a meaningful claim is debatable*.

The first story generated by the project – what is the worst time to be seen at A&E – was published in the newspaper a week ago. But it’s what happens next that’s going to be interesting. Continue reading

Is this an Excel killer? QueryTree app lowers the bar on data journalism

QueryTree

Sometimes the most impressive tools solve a problem you never knew you had. In the case of QueryTree, a new data analysis tool, that problem is something most people never question: spreadsheets.

For all the shiny-shiny copy-and-paste-click-and-drag-ness in new journalism tools, most data digging comes back to at least some simple spreadsheet work, and that represents a significant hurdle for many journalists used to working with simpler tools.

While interface design has undergone generations of improvement on the web, spreadsheet software interfaces have remained largely unchanged for decades.

So why did no one think to do this before?

QueryTree - how the drag and drop interface works

You only need 10 choices

Continue reading

FAQ: a review of 2012 with Data Driven Journalism.net

The Data Driven Journalism website asked me a few questions as part of their end-of-2012 roundup. You can find the article there, but for the sake of archiving, my responses are copied below (without the helpful pictures they added):

What do you do?

I’m a data journalism trainer and Iecturer. I run the MA in Online Journalism at Birmingham City University and am a visiting professor in online journalism at City University London. I’m also the author of Scraping for Journalists.

What was your biggest data driven achievement this year?

An investigation into the allocation of Olympic torchbearer places. The investigation came about as a result of scraping details on torchbearers from the official website. But it was also a great example of collaboration between non-journalists and journalists, as well as a number of techniques outside of core data journalism.

The investigation led to questions in Parliament and international media coverage. In the final week of the Olympic torch relay we published a short ebook about the affair, with all proceeds going to the Brittle Bone Society.

What was your favourite data journalism project this year and why?

I really liked Landportal.info, which is attempting to map land ownership – it’s highlighting a global trend of companies buying up land in Africa which would be easy to overlook by journalists. The New York Times’s multimedia treatment of performance data in three Olympic events across over a century was really well done. And I’m always looking at how data journalism can be used in softer news, where Anna Powell-Smith’s What Size Am I? is a great example of fashion/consumer data journalism.

For sheer significance I can’t avoid mentioning Nate Silver’s work on the US election – that was a watershed for data journalism and an embarrassment for many political pundits.

More broadly – what excites you in this field at the moment? Any interesting developments that you’d like to mention?

There’s a lot of consolidation at the moment, so less of the spectacular developments – but I am excited at how data journalism is being taken on by a wider range of companies. This year I’ve spent a lot more time training staff at consumer magazine publishers, for example.

I’m also excited about some of the new journalism startups based on public data like Rafat Ali’s Skift. In terms of tools, it’s great to see network analysis added to Fusion Tables, and the Knight Digital Media Center’s freeDive makes it very easy indeed to create a public database from a Google Doc.

What about disappointments?

I am constantly disappointed by publishers who say they don’t have the resources to do data journalism. That shows a real lack of imagination and understanding of what data journalism really is. It doesn’t have to be a spectacular interactive data visualisation – it can simply be about getting to better stories more quickly, accurately and more deeply through a few basic techniques.

Any predictions about what the future holds for data journalism in 2013?

I’ve just been training someone from Chile so I’m hoping to see more data journalism there!

Anything else you’d like to share with everyone?

Happy Christmas!

News:rewired – Interview with Nicolas Kayser-Bril

French data journalist Nicolas Kayser-Bril (and former OJB contributor) gave the keynote speech at news:rewired. He used to work for OWNI, but since 2011 has been the CEO of Journalism++, a start-up that ‘accompanies newsrooms in their transitions towards the web of data’.

During his presentation he tried to explain the first steps that anyone interested in this area should follow to start producing stories, like building a datastore.

After the speech we had a quick chat with him about the importance of introducing data in newsrooms, the situation in France (where he feels data journalism is very dynamic – “a lot of people are doing stuff, like in Liberátion or Le Monde”) and the skills that a journalist should have to get started. “There are some stories nowadays that require the use of data intensely,” he says. “Especially when it comes into public policies.”

“As a data journalist you need curiosity and the ability to teach yourself: the basic skills of any journalist.”