Peruvian news organisation Convoca has launched an interactive tool to enable citizens to access environmental information related to the behaviour of Peruvian mining companies.
2014 was the 10th anniversary of the Online Journalism Blog, so I thought I’d better begin keeping track of what each year’s most-read posts were.
In 2014 the overriding themes for this blog were programming for journalists, web security, and social media optimisation. Here are the most-read posts of the year, plus one surprisingly popular new page with some background and updates. Continue reading
Yesterday I spoke at the BBC Data Day: an event bringing together people at the BBC interested in data-related issues, techniques and tools. During the question and answer session following my talk one person mentioned a common reason why he wasn’t using data journalism techniques:
“I haven’t got the time.”
For some reason this time the phrase bristled. And later I realised why.
A journalist wouldn’t get away with saying they “hadn’t got the time” to get a response quote.
A journalist wouldn’t get away with saying they “hadn’t got the time” to get the background to a story.
A journalist wouldn’t get away with saying they “hadn’t got the time” to check a key fact. Continue reading
This latest post in the FAQ series answers questions posed by a student in Belgium regarding ethics and data journalism.
Q: Do ethical issues in the practice of computational journalism differ from those of “traditional” journalism?
No, I don’t think they do particularly – any more than ethics in journalism differ from ethics in life in general. However, as in journalism versus life, there are areas which attract more attention because they are the places we find the most conflict between different ethical demands.
For example, the tension between public interest and an individual’s right to privacy is a general ethical issue in journalism but which has particular salience in data journalism, when you’re dealing with data which names individuals.
When you’ve converted data from a PDF to a spreadsheet it’s not uncommon for text to end up being split across multiple rows, like this: In this post I’ll explain how you can use Open Refine to quickly clean the data up so that the text is put back together and you have a single row for each entry. Continue reading
It’s not often I encounter a piece of data journalism which solves a common problem in the field – and it’s even rarer to find a piece of work which tackles two.
But that’s just what lean data journalism Ampp3d did last week when it published a piece of visualisation on the deaths of construction workers in Qatar.
The two problems? Creating impact on mobile – and making big numbers meaningful. Continue reading
My latest ebook – Finding Stories in Spreadsheets – is now live on Leanpub.
As with Scraping for Journalists, I’m publishing the book week-by-week so the book can be updated based on reader feedback, user suggestions and topical developments.
Each week you can download a new chapter covering a different technique for finding stories, from calculating proportions and changes, to combining data, cleaning it up, testing it, and extracting specific details.
There’s also a downloadable spreadsheet at the end of each chapter with a series of exercises to practise that chapter’s technique and find particular stories.
Along the way I tackle some other considerations in telling the story, such as context and background, and the importance of being specific in the language that you use.
If there’s anything you’d like covered in the book let me know. You can also buy the book in a ‘bundle’ with its sister title Data Journalism Heist, which covers quick-turnaround techniques for finding stories in spreadsheets using pivot tables and advanced filters.