Category Archives: data journalism

How to use FOI to develop good journalism habits

Freedom of Information (FOI) requests are not only one of the best ways to get original and exclusive stories that set your reporting apart — they’re also a good way to develop core journalism habits like curiosity, scepticism, and creativity. Here are some tips on how to get started with FOI while developing those qualities.

Being curious: how often is this happening? How much has it increased?

Curiosity is the first quality I identified in my series on the 7 habits of successful journalists — and FOI is a great way to hone that.

One good way to get started with FOI is to identify an event or problem that you’ve read about, and get curious about it: how many times is that event happening? How much is that problem costing? These are perfect questions for FOI.

Continue reading

6 formas de comunicar jornalismo de dados (a Pirâmide Invertida do Jornalismo de Dados – parte 2)

etapas de comunicação (visualizar, narrar, humanizar, personalizar, sonorizar/materializar, utilizar).

A pirâmide invertida do jornalismo de dados mapeia o processo de utilização de dados na reportagem, desde a geração de ideias, passando pela limpeza, contextualização e combinação, até à comunicação. A fase final — a comunicação — apresenta uma série de opções: desde a visualização e sonificação até à personalização e ferramentas. Mas quais são as melhores práticas para cada uma?

(Também disponível em inglês, alemão e espanhol, russo e ucraniano).

1. Visualização

A visualização é normalmente a forma mais rápida de comunicar os resultados do jornalismo de dados: ferramentas gratuitas como Datawrapper e Flourish muitas vezes exigem apenas que você carregue os seus dados e escolha entre várias opções de visualização.

Continue reading

A pirâmide invertida do jornalismo de dados: Do conjunto de dados à história

Diagrama mostrando a pirâmide invertida do jornalismo de dados com duas pirâmides conectadas: uma preta com as etapas de produção (conceber, compilar, limpar, contextualizar, combinar) ligada por "questionar" a uma verde com as etapas de comunicação (visualizar, narrar, humanizar, personalizar, sonorizar/materializar, utilizar).

Os projetos de jornalismo de dados envolvem várias etapas, cada uma apresentando seus próprios desafios. Para ajudar a compreendê-las, criei o que chamei de ‘Pirâmide Invertida do Jornalismo de Dados’. Ela delineia as etapas que precisam ser consideradas à medida que a matéria avança desde a conceção inicial até a comunicação dos resultados, e como elas se relacionam entre si. Abaixo, explico cada etapa, identifico questões a considerar conforme o projeto avança e ofereço conselhos e dicas sobre como enfrentá-las.

(Também disponível em Inglês, Alemão, Espanhol, Finlandês, Russo e Ucraniano.)

Etapa 1: Conceber

O primeiro desafio que um jornalista enfrenta é conceber uma ideia viável para uma matéria baseada em dados.

Continue reading

FAQ: On data journalism and open data

In the second part of this FAQ (first part here), I respond to more answers to questions from a Turkish PR company (published on LinkedIn here)…

Q: What skills do you think a journalist must absolutely have when working with data?

There are three core skills I always begin with: sorting, filtering, and calculating percentages (proportion and change). You can do most data journalism stories with those alone.

Alongside those basic technical skills it’s important to have the basic editorial skills of checking a source against other sources (following up your data by getting quotes or interviews), and being able to communicate what you’ve found clearly for a particular audience.

Continue reading

FAQ: How has journalism been transformed?

In the latest FAQ, I’m publishing here answers to some questions from a Turkish PR company (published on LinkedIn here)…

Q: In your view, what has been the most significant transformation in digital journalism in recent years? 

There have been so many major transformations in the last 15 years. Mobile phones in particular have radically transformed both production and consumption — but having been through all those changes, AI feels like a biggest transformation than all the changes that we’ve already been through. 

It’s not just playing a role in transforming the way we produce stories, it’s also involved in major changes around what happens with those stories in terms of how they are distributed, consumed, and even how they are perceived: the rise of AI slop and AI-facilitated misinformation is going to radically accelerate the lack of trust in information (not just the media specifically). I’m being careful to say ‘playing a role’ because of course the technology itself doesn’t do anything: it’s how that technology is designed by people and used by people. 

Continue reading

Visualisation as an editorial process

In the second part of this extract from a book chapter in the new Routledge Companion to Visual Journalism, I look at the editorial processes involved in data visualisation, along with the ethical considerations and challenges encountered along the way.

Decisions around what data to visualise and how to visualise it involve a range of ethical considerations and challenges, and it is important to emphasise that data visualisation is an editorial process just as much as any other form of factual storytelling.

Journalists and designers employ a range of rhetorical devices to engage an audience and communicate their story, from the choice of the chart and its default views or comparisons, to the use of colour, text and font, and animations and search suggestions (Segel and Heer 2011; Hullman & Diakopoulos 2011).

Chart types are story genres

The chart that a journalist chooses to visualise data plays a key role in suggesting the type of story that is being told, and what the user might do with the data being displayed.

If a pie chart is chosen then this implies that the story is about composition (parts of a whole). In contrast, if a bar chart is used then the story is likely to be about comparison.

Line charts imply that the reader is being invited to see something changing over time, while histograms (where bars are plotted along a continuum, rather than ranked in order of size) invite us to see how something is distributed across a given scale.

Scatterplots — which plot points against two values (such as the cancer rate in each city against the same city’s air pollution) — invite us to see relationships.

Continue reading

Data, data visualization and interactives within news

In this extract from a book chapter in the new Routledge Companion to Visual Journalism, I look at how the explosion of data as a source for journalists, and the separation of content from interface in online publishing, have combined to lay the foundations for a range of new storytelling forms, from interactive infographics and timelines to charticles and scrollytelling.

Although the term ‘data journalism’ is a relatively recent one, popularised around 2010, data has been part of journalism throughout its history, from early newsletters covering stock prices and shipping schedules in the 17th century, to The Guardian’s 1821 first edition front page table of school spending, US investigations of politicians’ travel expenses in the 1840s and campaigning factchecking of lynching in the 1890s.

The introduction of computers into the newsroom in the 20th century added a new dimension to the practice. After some early experimentation by CBS News in predicting the outcome of the 1952 presidential election by applying computer power to data, a major breakthrough came in the 1960s with Philip Meyer’s use of databases and social science methods to investigate the causes of riots in Detroit.

Continue reading

Double counting: how to spot it and how to avoid it

Double counting — counting something more than once in data — can present particular risks for journalists, leading to an incorrect total or proportion. Here’s how to spot it — and what to do about it.

Look at the following chart showing the gender of teachers in UK schools, based on data on teacher headcounts. Notice anything wrong? (There are at least two problems)

Pie chart: Sex of teachers in UK schools
There are three visible slices: male, female and 'total', which takes up more than half of the pie.

The most obvious problem is that the chart appears to be ‘comparing apples with oranges’ (things that aren’t comparable). Specifically: “male”, “female”, and “unknown” are similar categories which can fairly be compared with each other, but “total” is a wider category that contains the other three.

I’ve used a pie chart here to make it easier to spot: we expect a pie chart to show parts of a whole, not the whole as well as its parts.

But the same problem should be obvious from the same data in a table before visualising it:

Table showing headcount of teachers in the categories: female, male, unknown, total, plus a grand total of all four at the bottom

The table shows us that we have both a “Total” and a “Grand Total”. This is a red flag. There can only be one total, so if there’s more than one that’s a strong sign of double counting.

Why is this happening? We need to take a look at the data.

Continue reading

7 técnicas de design de prompts para IA generativa que todo jornalista deveria conhecer

Ferramentas como o ChatGPT podem parecer falar a sua língua, mas, na verdade, falam uma linguagem de probabilidade e suposições fundamentadas. Você pode fazer-se entender melhor — e obter resultados mais profissionais — com algumas técnicas simples de prompting. Aqui estão as principais para adicionar ao seu kit de ferramentas (Este post foi traduzido do inglês original usando o Claude Sonnet 4.5 como parte de uma experiência. Por favor, avise-me se encontrar algum erro ou traduções incorretas).

Técnicas de design de prompts para IA generativa

Prompting de função

Prompting de exemplo único

Prompting recursivo

Geração aumentada por recuperação

Cadeia de pensamento

Meta prompting

Prompting negativo

Prompting de função

O prompting de função envolve atribuir um função específico à sua IA. Por exemplo, você pode dizer “Você é um correspondente experiente de educação” ou “Você é o editor de um jornal nacional britânico” antes de delinear o que está a pedir que façam. Quanto mais detalhes, melhor.

pesquisas contraditórias sobre a eficácia do prompting de função, mas no nível mais básico, fornecer um papel é uma boa maneira de garantir que você fornece contexto, o que faz uma grande diferença na relevância das respostas.

Continue reading

Telling stories with data: more on the difference between ‘variation’ stories and ‘ranking’ angles

7 common angles for data storie: scale, change, ranking, variation, explore, relationships, bad data, leads
The 7 angles. Also available in Norwegian, Portuguese, Uzbek and Finnish.

One of the most common challenges I encounter when teaching people the 7 most common story angles in data journalism is confusion between variation and ranking stories. It all comes down to the difference between process and product.

That’s because both types of story involve ranking as a piece of data analysis.

We might rank the number of specialist teachers in the country’s schools, for example, in order to tell either of the following stories:

  • “There are more specialist science teachers than those in any other subject, new data reveals”
  • “New data reveals stark differences in the number of specialists teaching each subject in secondary schools

The first story reveals which subject has the most teachers — it is a ranking angle because it ranks teachers by subject.

The second story reveals the simple fact that variation exists, without focusing on any particular subject.

Continue reading