Here are the angles journalists use most often to tell the stories in data

In my data journalism teaching and training I often talk about common types of stories that can be found in datasets — so I thought I would take 100 pieces of data journalism and analyse them to see if it was possible to identify how often each of those story angles is used.

I found that there are actually broadly seven core data story angles. Many incorporate other angles as secondary dimensions in the storytelling (a change story might go on to talk about the scale of something, for example), but all the data journalism stories I looked at took one of these as its lead.

In the first of a two-part series (also in Norwegian, Uzbek, Portuguese and Finnish.) I walk through how the four most common angles can help you identify story ideas, the variety of their execution, and the considerations to bear in mind.

Data angle 1: Scale — “This is how big a problem is”

Screenshots of headlines:
Cost of rural crime hits £50m
Three quarters of driving test centres at maximum wait time — Stories leading on the scale of something might focus on total amounts, proportions or clusters on maps

Perhaps the most common type of story found in data is the scale story: these are stories that identify a big problem, the size of an issue which has become topical.

At their most simple scale stories provide an update on new numbers being released: it could be the latest unemployment figures, the amount of crime, air pollution, money spent on some area, births, deaths or marriages.

During the first months of the pandemic, for example, we had daily scale stories on the numbers of cases, deaths, and tests, among other things.

Examples of scale stories include ‘Death toll in UK care homes from coronavirus may be 6,000, study estimates‘, but also stories like ‘Unduly lenient sentences review scheme ‘inadequate‘ where the lead is based on reaction to the scale of an issue you have identified.

Sometimes scale is provided as backgroud to a single-event story, as in ‘Drone causes Gatwick Airport disruption‘ (how many near misses are there?) or to a policy proposal, such as in ‘New drivers could be banned from driving at night, ministers say‘ (how many new drivers are under 19).

Scale stories are one of the easier genres to write: in many cases no calculation is needed.

Indeed, the main work involved is likely to be in setting context to that scale — at its worst a scale story simply becomes a ‘big number’ story (“A lot of money was spent on stuff” or “Something happens to a lot of people”), and the reader is left unclear whether this is actually newsworthy or just normal.

For that reason it’s important to put scale into context by using percentages or proportions (e.g. “one in five”) or comparisons and analogies (“The money spent on the scheme is the equivalent of the wages of 500 teachers”).

You might also bring in change and/or variation as a secondary angle: establishing historical context to the scale you’ve just outlined, or how that scale varies.

In the New York Times piece above, for example, the “true toll” (scale) of the coronavirus outbreak is immediately contextualised by charts which show how that has changed since the start of the year, in different parts of the country.

Data angle 2: Change and stasis — things are going up, things are going down, things aren’t happening

Change-led stories can reveal how much something has changed (or not) — or focus on why

Change stories are almost as common as scale stories — and probably more straightforward to pitch.

After all, change is inherently newsworthy and gives you the verb (“rises”, “plummets”, “[goes] up”) that you need in a headline.

Once you’ve identified some sort of change in your data it’s likely you will need further reporting to answer the “why?” question. Why are those numbers going up or down?

You might also add a secondary angle to your story which explores ranking in that trend – the areas or categories where those numbers have gone up or dropped the most and least.

This can help you direct your reporting on ‘Why?’ because chances are that the areas affected most will be those most aware of the issue, and able to comment on it.

When reporting on change it’s important to be aware of two considerations: seasonality and margins of error.

Seasonality is the role that (typically predictable and normal, and therefore non-newsworthy) seasonal factors can play in numbers, such as the end of a financial year or school term, the release of new cars or simply changing temperatures. Year-on-year comparisons (this August compared to last August, for example) or seasonal adjustment is often used to prevent this effect.

The margin of error, meanwhile, is the range within which the real numbers actually lie. Because many datasets are based on samples which are then generalised to the rest of the population being looked at, a margin of error (or confidence intervals) is used to indicate how accurate that generalisation actually is. If any change is within that margin of error then we can’t really report that anything has changed.

If your analysis finds a lack of change, that can also be newsworthy. In this story on company insolvencies, for example, we fully expected to find an increase during the pandemic but the absence of any change in companies going bust was surprising — and led us to interview experts about why this might be the case. In other cases, a lack of change can also be significant because it means things aren’t improving — especially where resources have been invested, or promises made.

Data angle 3: Ranking and outliers — who’s best and who’s worst? Who’s unusual and why?

Headlines:
Saudi Arabia are third biggest spenders
Which area has the most school absences? — Ranking stories can lead on who is top or bottom – or where something of interest ranks

Ranking stories are all about who or what comes out worst or best in a dataset, or where a particular entity of interest (the local police force, schools or teams, or an industry if it’s the specialist press) sits in comparison to others.

Typical stories in this category might include “Local area one of worst areas for crime” or “Local schoolchildren get third best results in the country”.

You might focus on the places ‘worst-hit’, as in “The parts of Birmingham in top 10 UK areas worst-hit by Universal Credit advances“, or you might look at where your sector compares to others, as in “Construction is third most dangerous UK industry.”

But ranking stories can also be about the best or worst times, places or categories that a dataset ‘reveals’.

The Economist article above, for example, is about the top-ranked month for listening to gloomy songs. A Birmingham Live story, on the other hand, leads on The most common crimes in Sandwell – and where you’re most likely to be a victim.

The Economist, by the way, dedicated part of one data journalism newsletter to ‘How to compile an index‘:

“How useful are such indices? Any ranking that isn’t built on objective criteria is open to criticism. Qualitative rankings are built on subjective measures. Perhaps “tolerable” means almost the same to someone as “uncomfortable”—whereas “intolerable” might feel twice as bad as “undesirable”? On ordinal scales the distance between these measures is subjective—and yet they have to be assigned a numerical score for the ranking to work.

“The Economist has been publishing its Big Mac index, a measure of currency valuations, since 1986. In 2011 we published the Shoe-Thrower’s index, which assessed the potential for unrest across the Arab world. And this year, we’ve created a global normalcy index, which is tracking countries’ recovery from covid-19. An imperfect measure is better than having no means of comparison at all.”

Ranking stories need to be careful about context: an area may have the most crime, disease, or pollution simply because it also has the most people. Reporting dates can skew data, too: COVID case rates tended to peak on Tuesdays because the figures “include many deaths not reported over the weekend”, as FullFact pointed out.

Data angle 4: Variation — “postcode lotteries”, maps and distributions

Headlines: Scottish data shows big variations in school exclusions
US courts show wide regional disparities in granting qualified immunity

Variation stories are similar to ranking stories — but with a key difference: the story here isn’t who or what is top or bottom. The story leads instead on the very fact that there are wide differences at all.

For that reason, this story angle works best when we expect there to be equal treatment or consistency.

A classic example of a variation story uses a choropleth map or heatmap to show how some parts of a country have less access to something, or more demand for something, than other parts.

The phrase “postcode lottery”, for example, reflects the sense that a person’s access to something which is supposed to be equally distributed is actually a game of chance.

The BBC data unit story “IVF: NHS couples ‘face social rationing’“, for example, maps out how where you live in England can mean the difference between being able to access fertility treatment or not.

A variation story may be revealing that the unfairness exists — or, if people are aware of it, precisely how and where it plays out (particularly in their area).

Algorithmic accountability stories such as ProPublica’s Machine Bias series are often about variation and the unfairness that is revealed when an algorithm is unpicked: it may be people being sentenced differently, or given different insurance quotes, despite no meaningful difference between them on the dimensions that matter.

A variation story can equally be used to highlight areas of underserved demand, or lack of supply: one story that I worked on for the BBC Shared Data Unit about electric car charging points involved identifying how much infrastructure existed in the country, and where. The picture that the data painted provided a foundation for case studies and reaction.

Telling stories with data: more on the difference between ‘variation’ stories and ‘ranking’ angles

In the second part of this series I look at the other three angles: exploratory stories, those that focus on data itself, and angles about relationships.

This post is also available in Norwegian, and this article explains the principles in Uzbek. A version of the diagram is also available in Finnish.

Online Journalism Blog

Comment, analysis and links covering online journalism and online news, citizen journalism, blogging, vlogging, photoblogging, podcasts, vodcasts, interactive storytelling, publishing, Computer Assisted Reporting, User Generated Content, searching and all things internet.

Here are the angles journalists use most often to tell the stories in data

Data angle 1: Scale — “This is how big a problem is”

Data angle 2: Change and stasis — things are going up, things are going down, things aren’t happening

Data angle 3: Ranking and outliers — who’s best and who’s worst? Who’s unusual and why?

Data angle 4: Variation — “postcode lotteries”, maps and distributions

10 thoughts on “Here are the angles journalists use most often to tell the stories in data”

Leave a comment Cancel reply

Data angle 1: Scale — “This is how big a problem is”

Data angle 2: Change and stasis — things are going up, things are going down, things aren’t happening

Data angle 3: Ranking and outliers — who’s best and who’s worst? Who’s unusual and why?

Data angle 4: Variation — “postcode lotteries”, maps and distributions

Share this:

Related

10 thoughts on “Here are the angles journalists use most often to tell the stories in data”

Leave a comment Cancel reply