Tag Archives: Finding Stories In Spreadsheets

How to combine two datasets to put a story into context (book extract)

excel for journalists book — This is an extract from Finding Stories in Spreadsheets. Get the book here.

One of the most common challenges in a data-driven story is combining two sets of data — such as events and populations — to put a story into context. In an extract from the ebook Finding Stories in Spreadsheets, I explain how to use lookup functions to combine two tables. The longer ebook version of this tutorial includes a dataset and exercise to employ these techniques.

Combining data is often a great way of telling new stories about spreadsheets. For example: you may have one table showing pass rates for each school in an area, and another table showing their addresses. Combining these would allow you to identify geographical patterns, or to place them on a map.

You could also combine the addresses with poverty rates for different locations, or unemployment to see if there’s a possible relationship (remembering that correlation does not equal causation), or to identify the schools performing particularly well despite local conditions. In the video below, for example, I walk through an example of combining data on different sports teams’ attendances with data on their rankings, allowing you to see who’s attracting large crowds despite their poor performance.

The VLOOKUP function is one of the most widely-used tools in combining data in this way. It stands for Vertical lookup, and means that the spreadsheet will look up and down a column (i.e. vertically) for whatever you ask it. In more recent versions of Excel the XLOOKUP function has been introduced to make the process easier — but the process is similar for both.

Continue reading →

What are regular expressions — and how to use them in Google Sheets to get data from text

Data journalism tips in Albanian (Të gjesh histori në faqet e Excel-it)

1 Reply

Data journalism book Stories with Spreadsheets

I spent some time recently in Albania delivering some training in data journalism to journalists. While I was there, the translator Ermal Como translated two chapters of my data journalism book Finding Stories in Spreadsheets, and a 14-page cribsheet for Excel formulae, into Albanian.

In addition, there is an exercise on learning spreadsheet techniques by finding stories in European Investment Bank data.

With his permission I’ve made the three documents available for anyone who might find them useful. You can find them embedded below: Continue reading →

All my data journalism ebooks are $5 or less this Christmas

How to: fix spreadsheet dates that are in both US and UK formats

16 Replies

This map by Artem Karimov shows which countries use which data formats

It’s quite common when working with Google Sheets to have data set to US format (Month-Day-Year) without realising it. This is because Google will format your dates based on what ‘locale’ or language you have set – and the default is US English.

Instructions on how to change that are here – but what if it’s too late? What if you’ve already inputted or imported data which, when updated to a different format, will make it the wrong date? Continue reading →

How to: calculate or find rankings in spreadsheets using RANK, LARGE and SMALL

2 Replies

This is an extract from Finding Stories in Spreadsheets. Get the book here.

The ebook version of this tutorial includes a dataset and exercise to employ these techniques.

Right at the start of my book on Excel for journalists I talk about sorting data to find out which values come top or bottom. However, there is a family of functions which will give you a lot more control in finding out not just who is top or bottom, but the rank of any value in any series of values.

This is particularly useful if you want to compare ranks.

Many stories are based on finding out where your own country or region ranks in the latest data

Ranking isn’t just about statistics – it can be used in consumer stories too

For example, say you had a table showing school performance across the last two years.

Each table shows the percentage of pupils achieving the top grades in that year. You can use RANK to find out what rank each percentage would have placed the school in for each year. Continue reading →

Spanish version of data journalism book now live: “Excel para periodistas”

2 Replies

Cover image by Matt Buck at Drawnalism.

A Spanish translation of my ebook ‘Finding Stories in Spreadsheets’ is now live. The book is available for an initial special discount price of $4.99 for ‘early adopters’.

Meanwhile, I’m continuing to add chapters to the English version, which you can download from here. The latest chapters deal with using query-based URLs in Google Sheets, and generating random numbers for mapping.

Spanish version of ‘Excel for Journalists’ on the way

HOW TO: Find out the ages of people using Excel

3 Replies

This post is taken from the ebook Finding Stories With Spreadsheets

“How do I calculate an age in Excel?” Marion Urban, a French journalist and student on the MA in Online Journalism in Birmingham, was preparing data for the forthcoming UK General Election.

In order to do this Marion had downloaded details on the candidates who had stood successfully in the previous election.

“It was a very young intake. But it wasn’t easy to calculate their ages.”

Indeed. You would think that calculating ages in Excel would be easy. But there is no off-the-shelf function to help you do so. Or at least, no easy-to-find function.

Instead there are a range of different approaches: some of them particularly, and unnecessarily complicated.

In this extract from Finding Stories in Spreadsheets I will outline one approach to calculating ages, which also illustrates a useful technique in using spreadsheets in stories: the ability to break down a problem into different parts. Continue reading →

Finding Stories in Spreadsheets – ebook now live!

2 Replies

Finding stories in spreadsheets book cover

Cover design by Matt Buck/Drawnalism

My latest ebook – Finding Stories in Spreadsheets – is now live on Leanpub.

As with Scraping for Journalists, I’m publishing the book week-by-week so the book can be updated based on reader feedback, user suggestions and topical developments.

Each week you can download a new chapter covering a different technique for finding stories, from calculating proportions and changes, to combining data, cleaning it up, testing it, and extracting specific details.

There’s also a downloadable spreadsheet at the end of each chapter with a series of exercises to practise that chapter’s technique and find particular stories.

Along the way I tackle some other considerations in telling the story, such as context and background, and the importance of being specific in the language that you use.

If there’s anything you’d like covered in the book let me know. You can also buy the book in a ‘bundle’ with its sister title Data Journalism Heist, which covers quick-turnaround techniques for finding stories in spreadsheets using pivot tables and advanced filters.

Online Journalism Blog

Comment, analysis and links covering online journalism and online news, citizen journalism, blogging, vlogging, photoblogging, podcasts, vodcasts, interactive storytelling, publishing, Computer Assisted Reporting, User Generated Content, searching and all things internet.

Tag Archives: Finding Stories In Spreadsheets

How to combine two datasets to put a story into context (book extract)

What are regular expressions — and how to use them in Google Sheets to get data from text

Data journalism tips in Albanian (Të gjesh histori në faqet e Excel-it)

All my data journalism ebooks are $5 or less this Christmas

How to: fix spreadsheet dates that are in both US and UK formats

How to: calculate or find rankings in spreadsheets using RANK, LARGE and SMALL

Spanish version of data journalism book now live: “Excel para periodistas”

Spanish version of ‘Excel for Journalists’ on the way

HOW TO: Find out the ages of people using Excel

Finding Stories in Spreadsheets – ebook now live!