Category Archives: online journalism

Investigative journalism’s AI challenges: accuracy and bias, explainability and resources

screenshots of guidelines on AI

Having outlined the range of ways in which artificial intelligence has been applied to journalistic investigations in a previous post, some clear challenges emerge. In this second part of a forthcoming book chapter, I look at those challenges and other themes: from accuracy and bias to resources and explainability.

Continue reading

AI in investigative journalism: mapping the field

screenshots of various examples of AI being used in journalism, including Serenata de Amor, Leprosy of the Land and The Implant Files

Investigative journalists have been among the earliest adopters of artificial intelligence in the newsroom, and pioneered some of its most compelling — and award-winning — applications. In this first part of a draft book chapter, I look at the different branches of AI and how they’ve been used in a range of investigations.

Continue reading

VIDEO: How can virtual reality, 360 degree video and augmented reality be used for journalism — and what are the challenges?

Virtual reality and augmented reality have opened up a range of new opportunities for journalists and publishers — as well as new challenges.

In this video, made for students on the MA in Data Journalism and the MA in Media Production at Birmingham City University, I explain what types of stories and projects suit these technologies, what to consider when using them, and some useful techniques from those who have worked in the field.

The MA courses at BCU have an open day in June: you can register here.

You can find links to all of the examples used in the video on the YouTube page.

This video is shared as part of a series of video posts.

Explainers — explained (unpublished extract from the Online Journalism Handbook)

Explainers are one of the most widely used forms of ‘evergreen’ content. In this unpublished extract from the latest edition of the Online Journalism Handbook, removed due to word limit, I explore why they are so popular, what types of subject are suitable, and how explainers are structured.

Continue reading

VIDEO: Using ChatGPT and generative AI tools in journalism

A few months ago I delivered a webinar for the European Data Journalism Network and DataNinja about the range of ways that journalists can use ChatGPT and other generative AI tools — from idea generation and mapping systems to help with spelling and coding — and what issues they need to be aware of.

The video is now available online and you can watch it below.

VIDEO: How to plan an investigation or large editorial project

Planning an investigation, or any larger editorial project, raises its own particular challenges — but if you know where to look, you can find resources that are especially useful in anticipating and tackling those.

This video, made for students on the MA in Data Journalism at Birmingham City University, introduces and explores two such resources: Mark Lee Hunter‘s story-based inquiry method; and breaking down an investigation into five roles; . It also touches on issues to consider in undercover reporting or the use of subterfuge.

Further video clips of Mark Lee Hunter and Luuk Sengers are embedded below:

Continue reading

Olaya Argüeso: “Data journalism stories, when they are done, are really easy to implement on the local level”

Olaya Argueso Perez: fotocredit: Ivo Mayr, CORRECTIV2
Photo: Ivo Mayr, CORRECTIV.Europe

The CORRECTIV.Europe project, founded by German investigative media outlet Correctiv, aims to help local journalists publish data stories who wouldn’t otherwise have the time or money to do it. Cristina Puerta speaks to its editor-in-chief Olaya Argüeso.

“[CORRECTIV.Europe] is about giving the European citizens a feeling that they are on the same boat together”, editor-in-chief Olaya Argüeso explains.

Local journalism, she says, has been “neglected”, and it is now, when people suffer the consequences of global phenomena — for example, climate change because of flooding and droughts where they live — that they realise how important local journalism is.

News avoidance is at an all-time high, and while Argüeso feels breaking global problems down to a local level cannot be the solution, it can, she says, show citizens what they can do about those problems.

Continue reading

I’ve updated the Inverted Pyramid of Data Journalism — and brought together resources for every stage

Inverted pyramid of data journalism: conceive, compile, clean, context, combine (with 'question' throughout). Communicate: vis, narrate, humanise, personalise, audiolise/materialise, utilise

It’s over a decade since I published the Inverted Pyramid of Data Journalism. The model has been translated into multiple languages, taught all over the world, and included in a number of books and research papers. But in that time the model has also developed and changed through discussion and teaching, so here’s a round up of everything I’ve written or recommended on the different stages — along with a revised model in English (shown above; versions have been published in German, Spanish, Finnish, Russian and Ukrainian).

The most basic change to the Inverted Pyramid of Data Journalism is the recognition of a stage that precedes all others — idea generation — labelled ‘Conceive’ in the diagram above.

This is often a major stumbling block to people starting out with data journalism, and I’ve written a lot about it in recent years (see below for a full list).

The second major change is to make questioning more explicit as a process that (should) take place through all stages — not just in data analysis but in the way we question our sources, our ideas, and the reliability of the data itself.

A third change is to remove the ‘socialise‘ option from the communication pyramid: in conversation with Alexandra Stark I realised that this is covered sufficiently by the ‘utilise’ stage (i.e. making something useful socially).

Replacing that is a new communication option — in fact, two: audiolise and physicalise. This recognises the emergence of sonification as a method of communicating data, and physical methods of representing data from crochet to art installations.

Alongside the updated pyramid I’ve been using for the past few years I also wanted to round up links to a number of resources that relate to each stage. Here they are…

Continue reading

How to combine two datasets to put a story into context (book extract)

One of the most common challenges in a data-driven story is combining two sets of data — such as events and populations — to put a story into context. In an extract from the ebook Finding Stories in Spreadsheets, I explain how to use lookup functions to combine two tables. The longer ebook version of this tutorial includes a dataset and exercise to employ these techniques.

Combining data is often a great way of telling new stories about spreadsheets. For example: you may have one table showing pass rates for each school in an area, and another table showing their addresses. Combining these would allow you to identify geographical patterns, or to place them on a map.

You could also combine the addresses with poverty rates for different locations, or unemployment to see if there’s a possible relationship (remembering that correlation does not equal causation), or to identify the schools performing particularly well despite local conditions. In the video below, for example, I walk through an example of combining data on different sports teams’ attendances with data on their rankings, allowing you to see who’s attracting large crowds despite their poor performance.

The VLOOKUP function is one of the most widely-used tools in combining data in this way. It stands for Vertical lookup, and means that the spreadsheet will look up and down a column (i.e. vertically) for whatever you ask it. In more recent versions of Excel the XLOOKUP function has been introduced to make the process easier — but the process is similar for both.

Continue reading