Data journalism pt4: visualising data – tools and publishing (comments wanted)

This is a draft from a book chapter on data journalism (here are parts 1; two; and three, which looks the charts side of visualisation). I’d really appreciate any additions or comments you can make – particularly around tips and tools.

UPDATE: It has now been published in The Online Journalism Handbook.

Visualisation tools

So if you want to visualise some data or text, how do you do it? Thankfully there are now dozens of free and cheap pieces of software that you can use to quickly turn your tables into charts, graphs and clouds.

The best-known tool for creating word clouds is Wordle (wordle.net). Simply paste a block of text into the site, or the address of an RSS feed, and the site will generate a word cloud whose fonts and colours you can change to your preferences. Similar tools include Tagxedo (tagxedo.com) and Wordlings (http://wordlin.gs), both of which allow you to put your word cloud into a particular shape.

ManyEyes (manyeyes.alphaworks.ibm.com/manyeyes/) also allows you to create word clouds and tag clouds – as well as word trees and phrase nets that allow you to see common phrases. But it is perhaps most useful in allowing you to easily create scattergrams, bar charts, bubble charts and other forms. The site also contains a raft of existing data that you can play with to get a feel for the site. Similar tools that allow access to other data include Factual (factual.com), Swivel (swivel.com)[see comments], Socrata (socrata.com) and Verifiable.com (verifiable.com). And Google Fusion Tables (tables.googlelabs.com) is particularly useful if you want to collaborate on tables of data, as well as offering visualisation options.

More general visualisation tools include widgenie (widgenie.com), iCharts (icharts.net), ChartTool (onlinecharttool.com) and ChartGo (www.chartgo.com). FusionCharts is a piece of visualisation software with a Google Gadget service that publishers may find useful. You can find instructions on how to use it at www.fusioncharts.com/GG/Docs

If you want more control over your visualisation – or want it to update dynamically when the source information is updated, Google Chart Tools (code.google.com/apis/charttools) is worth exploring. This requires some technical knowledge, but there is a lot of guidance and help on the site to get you started quickly.

Tableau Public is a piece of free software you can download (tableausoftware.com/public) with some powerful visualisation options. You will also find visualisation options on spreadsheet applications such as Excel or the free Google Docs spreadsheet service. These are worth exploring as a way to quickly generate charts from your data on the fly.

Publishing your visualisation

There will come a point when you’ve visualised your data and need to publish it somehow. The simplest way to do this is to take an image (screengrab) of the chart or graph. This can be done with a web-based screencapture tool like Kwout (kwout.com), a free desktop application like Skitch (skitch.com) or Jing (jingproject.com), or by simply using the ‘Print Screen’ button on a PC keyboard (cmd+shift+3 on a Mac) and pasting the screengrab into a graphics package such as Photoshop.

The advantage of using a screengrab is that the image can be easily distributed on social networks, image sharing websites (such as Flickr), and blogs – driving traffic to the page on your site where it is explained.

If you are more technically minded, you can instead choose to embed your chart or graph. Many visualisation tools will give you a piece of code which you can copy and paste into the HTML of an article or blog post in the place you wish to display it (this will not work on most third party blog hosting services, such as WordPress.com). One particular advantage of this approach is that the visualisation can update itself if the source data is updated.

Alternatively, an understanding of Javascript can allow you to build ‘progressively enhanced’ charts which allow users to access the original data or see what happens when it is changed.

Showing your raw data

It is generally a good idea to give users access to your raw data alongside its visualisation. This not only allows them to check it against your visualisation but add insights you may not otherwise gain. It is relatively straightforward to publish a spreadsheet online using Google Docs (see the sidebar on publishing a spreadsheet)

SIDEBAR: How to: publish a spreadsheet online

Google Docs (docs.google.com) is a free website which allows you to create and share documents. You can share them via email, by publishing them as a webpage, or by embedding your document in another webpage, such as a blog post. This is how you share a spreadsheet:

  1. Open your spreadsheet in Google Docs. You can upload a spreadsheet into Google Docs if you’ve created it elsewhere – there is a size limit, however, so if you are told the file is too big try removing unnecessary sheets or columns.
  2. Look for the ‘Share’ button (currently in the top right corner) and click on it.
  3. A drop-down menu should appear. Click on ‘Publish as a web page’
  4. A new window should appear asking which sheets you want to publish. Select the sheet you want to publish and click ‘Start publishing’ (you should also make sure ‘Automatically republish when changes are made’ is ticked if you want the public version of the spreadsheet to update with any data you add.)
  5. Now the bottom half of that window – ‘Get a link to the published data’ – should become active. In the bottom box should be a web address where you can now see the public version of your spreadsheet. If you want to share that, copy the address and test that it works in a web browser. You can now link to it from any webpage.
  6. Alternatively, you can embed your spreadsheet – or part of it – in another webpage. To do this click on the first drop-down menu in this area – it will currently say ‘Web page’ – and change it to ‘HTML to embed in a page’. Now the bottom box on this window should show some HTML that begins with
  7. If you want to embed just part of a spreadsheet, in the box that currently says ‘All cells’ type the range of cells you wish to show. For example, typing A1:G10 will select all the cells in your spreadsheet from A1 (the first row of column A) to G10 (the 10th row of column G). Once again, the HTML below will change so that it only displays that section of your spreadsheet.

Once again, I’d welcome any comments on things I may have missed or tips you can add. Part 5, on mashups, is now available here.

About these ads

27 thoughts on “Data journalism pt4: visualising data – tools and publishing (comments wanted)

  1. Pingback: links for 2010-04-28 « Onlinejournalismtest's Blog

  2. Pingback: links for 2010-04-28 « A Web editor's tale

  3. Pingback: Online Journalism Blog: Visualising data – tools and publishing | Journalism.co.uk Editors' Blog

  4. Gavin Wray

    I really like the Many Eyes tool – straightforward to use and it produces clear results – though it’s worth exploring the terms of use carefully if you want to use it beyond experimentation. In particular, the terms around uploading data to the service can be off-putting. I guess the lesson is experiment with all the tools you can, but check out the details when you want to use a particular tool for public projects.

    Reply
      1. Gavin Wray

        Glad the comment was useful.

        A small group at the Observatory spent some time looking at different visualisation tools in Autumn 2009. The tools vary quite a bit in what you can with them, how you can share your results and how well you can maintain visualisations over time.

        If anyone wants to compare tools, I’ve shared a little comparison spreadsheet to help you decide which tools suit your needs.

      2. Gavin Wray

        Eek, I didn’t share it properly. Fixed now. Anyone should be able to view and edit the doc. I’ve put some anonymised content in there too.

  5. Pingback: links for 2010-05-21 « Onlinejournalismtest's Blog

  6. Pingback: 1001 Medios » Blog Archive » Periodismo de datos: “Everything is data”

  7. Pingback: One ambassador’s embarrassment is a tragedy, 15,000 civilian deaths is a statistic | Online Journalism Blog

  8. Pingback: Data journalism pt3: visualising data – charts and graphs (comments wanted) | Online Journalism Blog

  9. Pingback: One ambassador’s embarrassment is a tragedy, 15,000 civilian deaths is a statistic » Article » OWNI.eu, Digital Journalism

  10. Pingback: One ambassador’s embarrassment is a tragedy, 15,000 civilian deaths is a statistic | Colombo Herald

  11. Pingback: L’embarras d’un ambassadeur est une tragédie, 15.000 victimes civiles une statistique » Article » OWNI, Digital Journalism

  12. Pingback: Data journalism pt3: visualising data – charts and graphs (comments wanted) | Online Journalism Blog

  13. Pingback: How to be a data journalist « beatsnpeace

  14. Pingback: How to be a data journalist | Tech2Crave

  15. Pingback: [Tweets] Science, data, avenir : 1001 liens sur le journalisme | Quand les singes prennent le thé

  16. Pingback: Periodismo de datos | webperiodistas

  17. Pingback: Nenápadný půvab dat, aneb úvod do datové žurnalistiky (II.) – praxe – DATOVÁ ŽURNALISTIKA

  18. Pingback: Private Krankenversicherung kostenlos

  19. Pingback: One ambassador’s embarrassment is a tragedy, 15,000 civilian deaths is a statistic | Online Journalism Blog

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s