Tag Archives: Quartz

Words as data: how data journalists tell stories about documents and text

Documents and other collections of text can be goldmines for data journalism — if you know how to approach them as data. Here are some techniques and inspiration for your next data project.

From stories about political speech and song lyrics, to street names and social media chatter, data journalists now have a wide range of examples of text-as-data to draw inspiration and guidance from, while tools such as Pinpoint and NotebookLM are making text analysis easier than ever.

I compiled a list of over 200 pieces of data journalism where text or documents were used as sources. Quantification techniques ranged from counting the frequency of a single word and using Google’s ngram viewer, to machine learning and topic modelling.

Looking at those articles it’s clear that, once quantified, journalists tell the same stories about text as any other piece of data: using the seven most common angles.

But how those angles are used — and how often — is where it gets interesting…

7 common angles for data stories: text and documents 
Scale: how often words/phrases are used
Change: how language has changed
Ranking: the most/least common words/phrases
Variation: e.g. in relation to gender, ethnicity, ideology etc.
Exploration: journeys through multiple angles; interactives
Relationships: correlations, similarities and connections
Meta: ‘how we quantified text’
Leads: clusters, patterns or themes for further digging
Continue reading

On International Women’s Day here are 7 data journalism projects about women’s issues

Photo: Pixabay

Women represent 49.5% of the world’s population, but they do not have a corresponding public, political and social influence. In recent years, more and more women have raised their voices, making society aware of their challenges — data journalists included. To commemorate International Women’s Day, Carla Pedret presents a list of data journalism projects that detail the sacrifices, injustices and prejudices that women still have to face in the 21st century.

Continue reading

17 takeaways on mobile publishing from Monetising Media 2015

mobile desktop timeHow do you reach an audience which is consuming more content on mobile than desktop – and at different schedules to the traditional print model? At Monetising Media last week industry leaders shared their concerns and strategies for succeeding on mobile without losing quality and content. Maria Crosas Batista sums up some of the key takeaways:

Product rather than platform: Lee Wilkinson, Hearst Magazines International

Lee Wilkinson bets on product rather than platform:

“If your audience is the core of a product strategy you are more likely to engage and reach them when your content is out there.”

Continue reading