Using Google Spreadsheets as a database (no, it really is very interesting, honest)

This post by Tony Hirst should be recommended reading for every journalist interested in the potential of computers for reporting.

Why? Because it shows you how you can use Google spreadsheets to interrogate data as if it was a database; and because it demonstrates the importance of news organisations releasing data to their users.

Put aside any intimidation you might feel at the mention of APIs and query languages. What it boils down to is this: you can alter the web address of a Google spreadsheet to filter the data and find the story.

Simple as that. 

Hirst uses the example of the spreadsheet of MPs expenses recently released by The Guardian (they’ve also published Lords expenses). By altering the URLs this is what he generates (I’m quoting his bullet points):

OK, you need to know the words to use (and if you have a link to an easy reference for these let me know*), but this is still a lot easier than using programming languages and databases.

As I say, this also illustrates the importance of publishing raw data so users can interrogate it in their own ways, which is precisely what The Guardian’s Data Store has been doing, meaning that people like Tony can create interfaces like this.

Wonderful.

*Tony has very generously created this page which helps you formulate your search – and generates the URL. If you were working on a different spreadsheet you could just replace the spreadsheet URL and change any column references accordingly.

UPDATE: Tony also has a version which allows you to pick from Guardian datasets.

Bookmark the permalink. Post a comment or leave a trackback:

7 Comments

  1. m
    Posted May 21, 2009 at 10:04 pm | Permalink

    Curious problem with all of the links in the comments section.

  2. Posted May 22, 2009 at 8:32 am | Permalink

    Thanks – I recently updated my version of wordpress so I wonder if one of the plugins is not working probably: my guess being the dofollow one.

  3. Posted May 24, 2009 at 7:29 pm | Permalink

    The mention of Google Spreadsheets here is a little misleading – you could equally do the same sort of analysis using an offline tool like Microsoft Excel or Calc (the free OpenOffice equivalent).
    The point remains though that most journalists could do with help in getting to grips with this; I’ve had colleagues asking this week precisely about how they can use spreadsheets to cut through the swathes of data around expenses.

  4. Posted May 25, 2009 at 10:59 am | Permalink

    But spreadsheets are just databases. Its just another label?

    Whether you use google, microsoft, Zoho or any other out there.

    And of course you can sort the fields to generate the ‘useful’ data. This should not be ‘news’.

  5. Posted May 28, 2009 at 8:37 pm | Permalink

    James, Calli – the ‘news’ here it seems to me is that you can alter the URL in ways that filter information in more sophisticated ways than just using ’sort’. It’s that that makes this more like a database you can interrogate using combinations of factors than a simple spreadsheet

  6. James
    Posted August 2, 2009 at 1:47 am | Permalink

    This is interesting indeed.

    I tried with my own spreadsheet,
    “http://spreadsheets.google.com/tq?tqx=out%3Ahtml&tq=select+count%28A%29+where+A%3DPHP%3Fkey%3D0AipNn919hx-OdHNhTDl1RExJVlFMUXVwOGUtNjdzanc&hl=en”

    but I always got this message:
    “Oops, an error occured.
    Status: error
    Reason: Access denied
    Description: Access denied”

    Anyone can view and edit the spreadsheet via the key I used in this example, how come I still get access denied ?

  7. Nicolas
    Posted September 24, 2009 at 1:36 pm | Permalink

    The problem is that don’t work with private spreadsheet only public. Any idea to work with provate spreadsheet and make queries like this. Will be great !

7 Trackbacks

  1. [...] to see a couple of people picking up on my post about using Google Spreadsheets as a database (e.g. Using Google Spreadsheets as a database (no, it really is very interesting, honest) over at the Online Journalism blog), but it struck me that the URL hacking involved might still [...]

  2. [...] have through a laboriously detailed and publicly accessible Google spreadsheet. As Paul Bradshaw points out at the Online Journalism Blog, this structure actually allows reporters (or in fact anyone who is [...]

  3. By The golden age of data journalism? on May 21, 2009 at 4:17 am

    [...] have through a laboriously detailed and publicly accessible Google spreadsheet. As Paul Bradshaw points out at the Online Journalism Blog, this structure actually allows reporters (or in fact anyone who is [...]

  4. By links for 2009-05-21 « on May 21, 2009 at 9:30 pm

    [...] Using Google Spreadsheets as a database (no, it really is very interesting, honest) | Online Journal… [...]

  5. By links for 2009-05-22 « Sarah Hartley on May 22, 2009 at 8:02 pm

    [...] Using Google Spreadsheets as a database (no, it really is very interesting, honest) | Online Journal… Shows you how you can use Google spreadsheets to interrogate data as if it was a database; and because it demonstrates the importance of news organisations releasing data to their users. (tags: spreadsheets data google) [...]

  6. By links for 2009-05-25 « David Black on May 25, 2009 at 9:10 am

    [...] Using Google Spreadsheets as a database (no, it really is very interesting, honest) – Online J… "What it boils down to is this: you can alter the web address of a Google spreadsheet to filter the data and find the story." (tags: internet journalism data webdevelopment database google googledocs api) [...]

  7. [...] has already produced some great work from what I once described as the “Technician” variant of distributed [...]

Post a Comment

Your email is never published nor shared. Required fields are marked *

*
*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>

Subscribe without commenting