Category Archives: online journalism

Words as data: how data journalists tell stories about documents and text

Documents and other collections of text can be goldmines for data journalism — if you know how to approach them as data. Here are some techniques and inspiration for your next data project.

From stories about political speech and song lyrics, to street names and social media chatter, data journalists now have a wide range of examples of text-as-data to draw inspiration and guidance from, while tools such as Pinpoint and NotebookLM are making text analysis easier than ever.

I compiled a list of over 200 pieces of data journalism where text or documents were used as sources. Quantification techniques ranged from counting the frequency of a single word and using Google’s ngram viewer, to machine learning and topic modelling.

Looking at those articles it’s clear that, once quantified, journalists tell the same stories about text as any other piece of data: using the seven most common angles.

But how those angles are used — and how often — is where it gets interesting…

7 common angles for data stories: text and documents
Scale: how often words/phrases are used
Change: how language has changed
Ranking: the most/least common words/phrases
Variation: e.g. in relation to gender, ethnicity, ideology etc.
Exploration: journeys through multiple angles; interactives
Relationships: correlations, similarities and connections
Meta: ‘how we quantified text’
Leads: clusters, patterns or themes for further digging

Continue reading →

PEER: a technique for brainstorming interviewees and story sources

Leave a reply

One way to ensure you generate a wide range of potential sources for a story — or for potential story leads — is to use a checklist. The PEER framework is just that: four categories to help journalists generate more names on any given story — and think more creatively about whose voices might add something to that story.

4 icons: Power, expertise, experience, representative

PEER is a mnemonic (based on a previous post) for remembering the following four types of source:

💪 Power
🧠 Expertise
👁️‍🗨️ Experience
🗣️ Representative

Each type of source brings something different to the story: voices of power primarily (but not solely) answer questions about action: what was or is being done, what should or would be done about a particular issue. These are easily the most commonly quoted sources in news reporting.

People with expertise can answer the “why” and “how” questions — and are often more likely to speak to journalists — while those with experience can verify or validate (put a human face to) events. Representatives can speak to the wider impact or significance of an issue, or represent community sentiment about it.

Making each type of source explicit allows us to think about what those roles really mean — and identify less obvious ideas for sources with power, expertise, experience or representative qualities.

Continue reading →

How to use FOI to develop good journalism habits

Leave a reply

Freedom of Information (FOI) requests are not only one of the best ways to get original and exclusive stories that set your reporting apart — they’re also a good way to develop core journalism habits like curiosity, scepticism, and creativity. Here are some tips on how to get started with FOI while developing those qualities.

Being curious: how often is this happening? How much has it increased?

Headlines:
Rising numbers of hospital patients so fed up they discharge themselves
Figures reveal how many lives firefighters have saved
Welsh parents owe thousands in school dinner debts — All these stories involve asking the question “how much” or “how many” about an issue or event

Headlines:
How the cost of paying up is sending bailiffs' diaries wild
Council use of bailiffs to chase debts jumps 16% in two years
Acid attack hospital admissions have almost doubled
Student Loans Company overcharges 78,000 graduates
Schools converting to academies cost councils £30m — All these stories involve asking the question “how much” or “how many” about an issue or event

Curiosity is the first quality I identified in my series on the 7 habits of successful journalists — and FOI is a great way to hone that.

One good way to get started with FOI is to identify an event or problem that you’ve read about, and get curious about it: how many times is that event happening? How much is that problem costing? These are perfect questions for FOI.

Continue reading →

6 formas de comunicar jornalismo de dados (a Pirâmide Invertida do Jornalismo de Dados – parte 2)

Leave a reply

etapas de comunicação (visualizar, narrar, humanizar, personalizar, sonorizar/materializar, utilizar).

A pirâmide invertida do jornalismo de dados mapeia o processo de utilização de dados na reportagem, desde a geração de ideias, passando pela limpeza, contextualização e combinação, até à comunicação. A fase final — a comunicação — apresenta uma série de opções: desde a visualização e sonificação até à personalização e ferramentas. Mas quais são as melhores práticas para cada uma?

(Também disponível em inglês, alemão e espanhol, russo e ucraniano).

1. Visualização

A visualização é normalmente a forma mais rápida de comunicar os resultados do jornalismo de dados: ferramentas gratuitas como Datawrapper e Flourish muitas vezes exigem apenas que você carregue os seus dados e escolha entre várias opções de visualização.

Continue reading →

A pirâmide invertida do jornalismo de dados: Do conjunto de dados à história

Leave a reply

Diagrama mostrando a pirâmide invertida do jornalismo de dados com duas pirâmides conectadas: uma preta com as etapas de produção (conceber, compilar, limpar, contextualizar, combinar) ligada por "questionar" a uma verde com as etapas de comunicação (visualizar, narrar, humanizar, personalizar, sonorizar/materializar, utilizar).

Os projetos de jornalismo de dados envolvem várias etapas, cada uma apresentando seus próprios desafios. Para ajudar a compreendê-las, criei o que chamei de ‘Pirâmide Invertida do Jornalismo de Dados’. Ela delineia as etapas que precisam ser consideradas à medida que a matéria avança desde a conceção inicial até a comunicação dos resultados, e como elas se relacionam entre si. Abaixo, explico cada etapa, identifico questões a considerar conforme o projeto avança e ofereço conselhos e dicas sobre como enfrentá-las.

(Também disponível em Inglês, Alemão, Espanhol, Finlandês, Russo e Ucraniano.)

Etapa 1: Conceber

O primeiro desafio que um jornalista enfrenta é conceber uma ideia viável para uma matéria baseada em dados.

Continue reading →

Parallel prompting: another way to avoiding deskilling with AI

Leave a reply

Too often discussion around using AI is “either/or” — an assumption that you either use AI for a task, or do it yourself. But there’s another option: do both.

“Parallel prompting“* is the term I use for this: while you perform a task manually, you also get the AI to perform the same task algorithmically.

For example, you might brainstorm ideas for a story while asking ChatGPT to do the same. Or you might look for potential leads in a company report — and upload it to NotebookLM to perform the same task. You might draft an FOI request but get Claude to draft one too, or get Copilot to rewrite the intro to a story while you attempt the same thing.

Then you compare the results.

Continue reading →

When to report on a meme (and when not to): Bösch’s MATTER checklist

Leave a reply

Marcus Bösch, the editor of the Understanding TikTok newsletter, has put together a checklist for “when a meme is everywhere and you’re unsure whether to cover it, contextualise it, or leave it alone.” (PDF version here).

The checklist — M.A.T.T.E.R. — covers six things to consider: Meaning, ‘Affect’ (emotion), Type of format, Temporality, Ethics and relevance.

M — Meaning (Lore & Context)
🎭 A — Affect (Vibe)
📱 T — Type (Format & Platform)
⏳ T — Temporality (Lifecycle & Speed)
⚖️ E — Ethics
📈 R — Relevance — Image from Understanding TikTok

Continue reading →

“I don’t want it to be easy” and other objections to using AI

Leave a reply

In September I took part in a panel at the African Journalism Education Network conference. The most interesting moment came when members of the audience were asked if they didn’t use AI — and why.

Thanks to @ajenda_edu for inviting me to their panel on AI in journalism education at #AJEN2025. Especially interesting was when attendees shared their reasons for *not* using AI…
(yes, it's time for a thread)
— Paul Bradshaw (@paulbradshaw) September 4, 2025

Continue reading →

How to stop AI making you stupid: hybrid destination-journey prompting

1 Reply

A local map-style illustration where a pinned "answer" destination is visible, but the route is overlaid with checkpoints labelled “confidence”, “sources”, “counter-arguments”, “verify”, “edit” (image generated by ChatGPT).

Last month I wrote about destination and journey prompts, and the strategy of designing AI prompts to avoid deskilling. In some situations a third, hybrid approach can also be useful. In this post I explain how such hybrid destination-journey prompting works in practice, and where it might be most appropriate.

Continue reading →

FAQ: How has journalism been transformed?

1 Reply

In the latest FAQ, I’m publishing here answers to some questions from a Turkish PR company (published on LinkedIn here)…

Q: In your view, what has been the most significant transformation in digital journalism in recent years?

There have been so many major transformations in the last 15 years. Mobile phones in particular have radically transformed both production and consumption — but having been through all those changes, AI feels like a biggest transformation than all the changes that we’ve already been through.

It’s not just playing a role in transforming the way we produce stories, it’s also involved in major changes around what happens with those stories in terms of how they are distributed, consumed, and even how they are perceived: the rise of AI slop and AI-facilitated misinformation is going to radically accelerate the lack of trust in information (not just the media specifically). I’m being careful to say ‘playing a role’ because of course the technology itself doesn’t do anything: it’s how that technology is designed by people and used by people.

Continue reading →

Online Journalism Blog

Comment, analysis and links covering online journalism and online news, citizen journalism, blogging, vlogging, photoblogging, podcasts, vodcasts, interactive storytelling, publishing, Computer Assisted Reporting, User Generated Content, searching and all things internet.

Category Archives: online journalism

Words as data: how data journalists tell stories about documents and text

Documents and other collections of text can be goldmines for data journalism — if you know how to approach them as data. Here are some techniques and inspiration for your next data project.

PEER: a technique for brainstorming interviewees and story sources

How to use FOI to develop good journalism habits

Being curious: how often is this happening? How much has it increased?

6 formas de comunicar jornalismo de dados (a Pirâmide Invertida do Jornalismo de Dados – parte 2)

1. Visualização

A pirâmide invertida do jornalismo de dados: Do conjunto de dados à história

Etapa 1: Conceber

Parallel prompting: another way to avoiding deskilling with AI

Too often discussion around using AI is “either/or” — an assumption that you either use AI for a task, or do it yourself. But there’s another option: do both.

When to report on a meme (and when not to): Bösch’s MATTER checklist

“I don’t want it to be easy” and other objections to using AI

How to stop AI making you stupid: hybrid destination-journey prompting