Tag Archives: data journalism

The 10 most-read posts (and one page) on the Online Journalism Blog in 2014

ojb post frequency 2014

The last 2 months of 2014 saw a return to regular blogging after some quiet periods earlier in the year

2014 was the 10th anniversary of the Online Journalism Blog, so I thought I’d better begin keeping track of what each year’s most-read posts were.

In 2014 the overriding themes for this blog were programming for journalists, web security, and social media optimisation. Here are the most-read posts of the year, plus one surprisingly popular new page with some background and updates. Continue reading

“I haven’t got time” is not acceptable when it comes to basic data techniques

clock workings

Picking apart the time you spend on things can identify false economies. Image by Vittorio Pandolfi

Yesterday I spoke at the BBC Data Day: an event bringing together people at the BBC interested in data-related issues, techniques and tools. During the question and answer session following my talk one person mentioned a common reason why he wasn’t using data journalism techniques:

“I haven’t got the time.”

For some reason this time the phrase bristled. And later I realised why.

A journalist wouldn’t get away with saying they “hadn’t got the time” to get a response quote.

A journalist wouldn’t get away with saying they “hadn’t got the time” to get the background to a story.

A journalist wouldn’t get away with saying they “hadn’t got the time” to check a key fact. Continue reading

FAQ: Do you need new ethics for computational journalism?

This latest post in the FAQ series answers questions posed by a student in Belgium regarding ethics and data journalism.

Q: Do ethical issues in the practice of computational journalism differ from those of “traditional” journalism?

No, I don’t think they do particularly – any more than ethics in journalism differ from ethics in life in general. However, as in journalism versus life, there are areas which attract more attention because they are the places we find the most conflict between different ethical demands.

For example, the tension between public interest and an individual’s right to privacy is a general ethical issue in journalism but which has particular salience in data journalism, when you’re dealing with data which names individuals.

I wrote about this in a book chapter which I’ve published in parts on the blog. Continue reading

How to: combine multiple rows in a dataset where text is split across them (Open Refine)

When you’ve converted data from a PDF to a spreadsheet it’s not uncommon for text to end up being split across multiple rows, like this: text split across rows In this post I’ll explain how you can use Open Refine to quickly clean the data up so that the text is put back together and you have a single row for each entry. Continue reading

This simple piece of visualisation will have you rethinking what you know about impact and mobile

deadworkers

It’s not often I encounter a piece of data journalism which solves a common problem in the field – and it’s even rarer to find a piece of work which tackles two.

But that’s just what lean data journalism Ampp3d did last week when it published a piece of visualisation on the deaths of construction workers in Qatar.

The two problems? Creating impact on mobile – and making big numbers meaningful. Continue reading

Finding Stories in Spreadsheets – ebook now live!

Finding stories in spreadsheets book cover

Cover design by Matt Buck/Drawnalism

My latest ebook – Finding Stories in Spreadsheets – is now live on Leanpub.

As with Scraping for Journalists, I’m publishing the book week-by-week so the book can be updated based on reader feedback, user suggestions and topical developments.

Each week you can download a new chapter covering a different technique for finding stories, from calculating proportions and changes, to combining data, cleaning it up, testing it, and extracting specific details.

There’s also a downloadable spreadsheet at the end of each chapter with a series of exercises to practise that chapter’s technique and find particular stories.

Along the way I tackle some other considerations in telling the story, such as context and background, and the importance of being specific in the language that you use.

If there’s anything you’d like covered in the book let me know. You can also buy the book in a ‘bundle’ with its sister title Data Journalism Heist, which covers quick-turnaround techniques for finding stories in spreadsheets using pivot tables and advanced filters.

Unicorns, racehorses – and a mule cameo: data journalism in 2014

unicorn animated gif

Recently it has felt like data journalism might finally be taking a step forward after years spent treading water. I’ve long said that the term ‘data journalism’ was too generic for work that includes practices as diverse as scraping, data visualisation, web interactives, and FOI. But now, in 2014, it feels like different practitioners are starting to find their own identity.

It starts with the unicorn. Continue reading