Category Archives: data journalism

9 takeaways from the Data Journalism UK conference

Attendees in a lecture theatre with 'data and investigative journalism conference 2025 BBC Shared Data Unit' on the screen.

Last month the BBC’s Shared Data Unit held its annual Data and Investigative Journalism UK conference at the home of my MA in Data Journalism, Birmingham City University. Here are some of the highlights…

Continue reading →

FAQ: La Nacion interview in English

Leave a reply

Paul Bradshaw: “La falta de datos puede ser la historia en sí misma”
Periodista británico y docente, propone mirar donde otros no miran: los silencios estadísticos, los vacíos oficiales y las voces excluidas pueden ser claves para revelar problemas sistémicos

Earlier this month I was interviewed for a feature about data journalism in the Argentina newspaper La Nacion. Here are the full questions and answers, in English, published as part of the FAQ series.

Continue reading →

How do I get data if my country doesn’t publish any?

1 Reply

Spotlight photo by Paul Green on Unsplash

In many countries public data is limited, and access to data is either restricted, or information provided by the authorities is not credible. So how do you obtain data for a story? Here are some techniques used by reporters around the world.

Continue reading →

De vanligste vinklene journalister bruker når de forteller historier med data

Leave a reply

7 vanlige vinkler for datahistorier_
Omfang, Veksling, Rangering, Variasjon, Utforske, Relasjoner, Dårlig/åpne, + saker

Datadrevet historiefortelling kan deles i syv hovedkategorier ifølge en analyse av 200 artikler. I den første av to poster vil jeg demonstrere de fire mest brukte vinklene i nyhetshistorier, hvordan de kan gi deg flere muligheter som reporter, og hvordan de kan hjelpe deg med å arbeide mer effektivt med data.

De fleste datasett kan fortelle mange historier — så mange at det for noen kan virke overveldende eller forstyrrende. Å identifisere hvilke historier som er mulige, og å velge den beste historien innenfor den tiden og de ferdighetene du har tilgjengelig, er en viktig redaksjonell ferdighet.

Mange nybegynnere innen datajournalistikk søker ofte først etter historier om sammenhenger (årsak og virkning) — men disse historiene er vanskelig og tidkrevende. Du kan ønske å fortelle en historie om ting som blir verre eller bedre — men mangle dataene for å fortelle den. Hvis du har svært liten tid og vil komme i gang med datajournalistikk, er de raskeste og enkleste historiene du kan fortelle med data, historier om omfang.

Continue reading →

Niamh McIntyre: tips from a career in data journalism

1 Reply

The Bureau of Investigative Journalism’s Big Tech Reporter Niamh McIntyre has been working with data for eight years — but it all stemmed from an “arbitrary choice” at university. She spoke to MA Data Journalism student Leyla Reynolds about how she got started in the field, why you don’t need to be a maths whizz to excel, and navigating the choppy waters of the newsroom.

Starting out on any new path can be daunting, but in the minutes before my phone call with Niamh McIntyre, I’m acutely aware that upping sticks to Birmingham and training in data journalism at the grand old age of 29 is nothing less than a tremendous luxury.

A younger me might have — would have — quaked at such a scenario, so I’m keen to know more about Niamh’s work, which ranges from investigating the gig work industry to private children’s homes.

Continue reading →

VIDEO: Developing ideas for factual storytelling

Leave a reply

Strong factual storytelling relies on good idea development. In this video, part of a series of video posts made for students on the MA in Data Journalism at Birmingham City University, I explain how to generate good ideas by avoiding common mistakes, applying professional techniques and considering your audience.

The links mentioned in the video include:

Identifying bias in your writing — with generative AI

Leave a reply

Applications of genAI in the journalism process
Pyramid with the third 'Production' tier highlighted:

Identify jargon and bias; improve spelling, grammar, structure and brevity

In the latest in a series of posts on using generative AI, I look at how tools such as ChatGPT and Claude.ai can help help identify potential bias and check story drafts against relevant guidelines.

We are all biased — it’s human nature. It’s the reason stories are edited; it’s the reason that guidelines require journalists to stick to the facts, to be objective, and to seek a right of reply. But as the Columbia Journalism Review noted two decades ago: “Ask ten journalists what objectivity means and you’ll get ten different answers.”

Generative AI is notoriously biased itself — but it has also been trained on more material on bias than any human likely has. So, unlike a biased human, when you explicitly ask it to identify bias in your own reporting, it can perform surprisingly well.

It can also be very effective in helping us consider how relevant guidelines might be applied to our reporting — a checkpoint in our reporting that should be just as baked-in as the right of reply.

In this post I’ll go through some template prompts and tips on each. First, a recap of the rules of thumb I introduced in the previous post.

Continue reading →

Using generative AI as a scraping assistant

Leave a reply

Applications of genAI in the journalism process
Research
Pyramid with the second 'research' level highlighted: Scope diverse sources, explore documents, form advanced searches, and write/fix code for scraping and analysis — Scraping is part of the research phase of a project

In the fourth of a series of posts from a workshop at the Centre for Investigative Journalism Summer School (the first part covered idea generation; the second research; the third spreadsheets), I look at using generative AI tools such as ChatGPT and Google Gemini to help with scraping.

One of the most common reasons a journalist might need to learn to code is scraping: compiling information from across multiple webpages, or from one page across a period of time.

But scraping is tricky: it requires time learning some coding basics, and then further time learning how to tackle the particular problems that a specific scraping task involves. If the scraping challenge is anything but simple, you will need help to overcome trickier obstacles.

Large language models (LLMs) like ChatGPT are especially good at providing this help because writing code is a language challenge, and material about coding makes up a significant amount of the material that these models have been trained on.

This can make a big difference in learning to code: in the first year that I incorporated ChatGPT into my data journalism Masters at Birmingham City University I noticed that students were able to write more advanced scrapers earlier than previously — and also that students were less likely to abandon their attempts at coding.

You can also start scraping pretty quickly with the right prompts (Google Colab allows you to run Python code within Google Drive). Here are some tips on how to do so…

Continue reading →

Using generative AI as a spreadsheet and data cleaning assistant

1 Reply

Generative AI tools like ChatGPT and Gemini can be a big help when dealing with data in spreadsheets. In this third of a series of posts from a workshop at the Centre for Investigative Journalism Summer School (the first part covered idea generation; the second research), I outline tips and techniques for using those tools to help with spreadsheet formulae and reshaping data.

Whether you come across data as part of story research, or compile data yourself, chances are that at some point you will need to write a formula to ask questions of that data, or make it possible to ask questions (such as creating a column which extracts data from another).

If you find yourself coming up against the limits of your spreadsheet knowledge, then genAI tools can be useful both in breaking through those — while expanding your knowledge of functions and formula writing.

Writing spreadsheet formulae with ChatGPT or other genAI tools

Continue reading →

Investigative journalism and ChatGPT: using generative AI for sourcing and story research

Leave a reply

In the second of a series of posts from a workshop at the Centre for Investigative Journalism Summer School (read the first part on idea generation here), I look at using generative AI tools such as ChatGPT and Google Gemini to improve sourcing and story research.

Research is arguably the second-highest risk area (after content generation) for using generative AI within journalism. The most obvious reason for this is AI’s ability to make things up (“hallucinate“) — but there are other reasons too.

Continue reading →

Online Journalism Blog

Comment, analysis and links covering online journalism and online news, citizen journalism, blogging, vlogging, photoblogging, podcasts, vodcasts, interactive storytelling, publishing, Computer Assisted Reporting, User Generated Content, searching and all things internet.

Category Archives: data journalism

9 takeaways from the Data Journalism UK conference

FAQ: La Nacion interview in English

How do I get data if my country doesn’t publish any?

De vanligste vinklene journalister bruker når de forteller historier med data

Niamh McIntyre: tips from a career in data journalism

VIDEO: Developing ideas for factual storytelling

Identifying bias in your writing — with generative AI

Using generative AI as a scraping assistant

Using generative AI as a spreadsheet and data cleaning assistant

Writing spreadsheet formulae with ChatGPT or other genAI tools

Investigative journalism and ChatGPT: using generative AI for sourcing and story research