Monthly Archives: May 2019

If we are using AI in journalism we need better guidelines on reporting uncertainty

Chart: women speak 27% of the time in Game of Thrones

The BBC’s chart mentions a margin of error

There’s a story out this week on the BBC website about dialogue and gender in Game of Thrones. It uses data generated by artificial intelligence (AI) — specifically, machine learning —  and it’s a good example of some of the challenges that journalists are increasingly going to face as they come to deal with more and more algorithmically-generated data.

Information and decisions generated by AI are qualitatively different from the sort of data you might find in an official report, but journalists may fall back on treating data as inherently factual.

Here, then, are some of the ways the article dealt with that — and what else we can do as journalists to adapt.

Margins of error: journalism doesn’t like vagueness

The story draws on data from an external organisation, Ceretai, which “uses machine learning to analyse diversity in popular culture.” The organisation claims to have created an algorithm which “has learned to identify the difference between male and female voices in video and provides the speaking time lengths in seconds and percentages per gender.”

Crucially, the piece notes that:

“Like most automatic systems, it doesn’t make the right decision every time. The accuracy of this algorithm is about 85%, so figures could be slightly higher or lower than reported.”

And this is the first problem. Continue reading

Advertisements

I’m holding a taster day for anyone interested in studying data or multiplatform journalism (part time or full time)

MA journalism taster day

I’ll be holding a special ‘Taster Day’ on June 11 for anyone interested in studying journalism at postgraduate level — specifically data journalism (which includes a part time PGCert option for those already working in the industry) and multiplatform journalism (full time only).

  • In the morning (10am-12.30pm), I will be running a taster of the MA and part time PGCert courses in Data journalism. I’ll be covering introductory data journalism techniques and also discussing some tips, tricks and trends to watch out for
  • In the afternoon (2pm-4.30pm), I’ll be hosting a taster session of the MA in Multiplatform and Mobile Journalism. This will cover reporting news for multiplatform audiences, and how to use mobile journalism to report stories

I’ll be making time in both sessions for questions and discussion about postgraduate study and developments in journalism.

If you are interested in attending either event, please sign up on the Eventbrite page here. Drop me a line on Twitter @paulbradshaw or email paul.bradshaw@bcu.ac.uk if you have any questions!

From making data physical to giving journalists confidence (and a few other things too): Data Journalism UK 2019

marie segger at data journalsm uk 19

Last week saw the third Data Journalism UK conference, an opportunity for the country’s data journalists to gather, take stock of the state of the industry and look at what’s ahead.

The BBC Shared Data Unit’s Pete Sherlock kicked off the event, looking back at the first 18 months of the unit’s existence. In that period the unit has trained 15 secondees and helped generate over 600 stories across more than 250 titles in the regional press.

Sherlock highlighted two stories in particular to demonstrate how the data unit had helped equip regional reporters in holding power to account: the Eastern Daily Press’s Dominic Gilbert‘s story on legal aid deserts, and JPI Media’s Aimee Stanton‘s report on electric car charging points.

Both stories resulted in strong pushback – from the Ministry of Justice and the electric car industry respectively – but their new data journalism skills gave them the confidence to persist with the story. Continue reading