Local newspaper data journalism – school admissions in Birmingham

data journalism at the Birmingham Mail - school admissions data

The Birmingham Mail has been trying its hand at data journalism with school admissions data. It’s a good place to start – the topic attracts a lot of interest (and so justifies the investment of time) while people tend to be interested in more than just who finishes top and bottom of the tables (justifying the choice of medium).

The results are impressive. Applications data is plotted on a Google map on the main page, while an “interactive chart” page allows you to compare schools across various criteria, and also narrow the sample by selecting from two drop down menus (town and school).

The charts have been made in Tableau, which includes a download link at the bottom. However, you need Tableau itself (free, but PC only) to open it.

A further page features links to tables for each area. Sadly, the pages containing tables do not contain any link to the raw data. This presents an extra hurdle to users – although you can scrape the table into a Google spreadsheet using the =import formula. If you want to see how, here’s a spreadsheet I created from the data by doing just that. Click on the first cell to see the formula that generates it.

I asked David Higgerson, Trinity Mirror’s Head of Multimedia and the man whose name appears on the Tableau data, to explain the process behind the project. It seems the information was a combination of freely available data and that acquired via FOI.

“The Mail took the data available – number of places available, number of first choice applicants and number of total applicants – and worked out a ratio of first choice applicants per place. This is relevant to parents because councils try to allocate places to children based on preference once they’ve decided which schools a child is eligible for. Eligibility varies depending on type of school.

“The figures showed how popular faith schools were, and also how fierce competition was for places at grammar schools. That’s the story which generated most interest.

“As you’ve said on your blog, the hardest part was making the data uniform, and the making it relevant to readers.

“In print, it ran across three days. Day one was grammar schools, day two was all schools and day three revealed how catchment areas for oversubscribed schools which use distance from school to fill their last few places.

“Online, Google Fusion was used to create maps, Tableau for the interactive chart which lets people choose based on town or school, and Tableizer for the quick tables which appear in the section too. We also had a play with Scribble Maps, which we think has real potential for print/online newsrooms.”

It seems education reporter Kat Keogh deserves the credit for spotting the stories in the data, “with the usual support you’d expect in the newsroom – newsdesk etc.”

David and Anna Jeys experimented with the online presentation and others laid out the data for print.

BBC new linking guidelines issued – science journals mentioned

The BBC have just emailed new linking guidelines to their staff. They stipulate that linking is “essential” to online journalism and in one slide (it’s a PowerPoint document) titled ‘If you remember nothing else’ highlight how linking will change:

What we used to do…

  • Lists of archive news stories
  • Homepages only on external websites
  • No inline linking in news stories

What we do now – think adding value…

  • Avoid news stories and link to useful stuff – analysis, explainers, Q&As, pic galleries etc
  • On external websites look beyond homepage to pages of specific relevance
  • Inline linking in news stories is OK when it’s to a primary source

Other points of note in the document include the repeated emphasis on useful deep linking, and the importance of the newstracker module (which links to coverage on other news sites). Curiously, when referring to inline links it does say that “different rules can apply” to BBC blogs – “speak to blogs team if in doubt”.

Something I did look for – and find – was a reference to linking to scientific journals. And here it is: “In news stories inline links must go to primary sources only– eg scientific journal article or policy report (1 or 2 per story; avoid intro)”

This is significant given the previous campaigning on this issue.

On the whole it’s a good set of guidance – I’ll refrain from publishing it in hope that the BBC will…

UPDATE: It seems The Guardian followed up the story and embedded the document, so here it is:

BBC guidelines for linking – Sept 2010

‘Making it findable’ – the creed of the hyperlocal blogger

I’ve written a post over at Podnosh.com (full disclosure: where I do some training and consultancy) on ‘Making it findable’ – the creed of the hyperlocal blogger, reporting on a discussion berween hyperlocal bloggers and local government officials at Hyperlocal Govcamp West Midlands. The meat of what I’m saying is in the middle:

“I noticed a recurring theme from the bloggers’ perspective on their role – something unique to online journalism, and which I’ve commented on before: the duty to make things findable.

“Bloggers repeatedly referred to information about the local democratic process that was hidden away on council websites – and which they worked hard to make available and interesting to their community. Council meeting times; minutes; planning meetings.

“At one point someone said that the bloggers were there to “hold power to account”. Not always in the active sense of posing difficult questions – but also in making the invisible visible; the obscure findable.

“By doing so they are not only shedding a light on the workings of local government, but transferring power. “This is your responsibility”, it says – not “This is my story”.”

There’s a nice comment below saying it “is the closest anyone, including me – has ever got to stating what my blog is about.” Full post here.

Online journalism student RSS reader starter pack: 50 RSS feeds

Teaching has begun in the new academic year and once again I’m handing out a list of recommended RSS feeds. Last year this came in the form of an OPML file, but this year I’m using Google Reader bundles (instructions on how to create one of your own are here). There are 50 feeds in all – 5 feeds in each of 10 categories. Like any list, this is reliant on my own circles of knowledge and arbitrary in various respects. But it’s a start. I’d welcome other suggestions.

Here is the list with links to the bundles. Each list is in alphabetical order – there is no ranking:

5 of the best: Community

A link to the bundle allowing you to add it to your Google Reader is here.

  1. Blaise Grimes-Viort
  2. Community Building & Community Management
  3. FeverBee
  4. ManagingCommunities.com
  5. Online Community Strategist

5 of the best: Data

This was a particularly difficult list to draw up – I went for a mix of visualisation (FlowingData), statistics (The Numbers Guy), local and national data (CountCulture and Datablog) and practical help on mashups (OUseful). I cheated a little by moving computer assisted reporting blog Slewfootsnoop into the 5 UK feeds and 10,000 Words into Multimedia. Bundle link here. Continue reading

Interview: Ton Zijlstra on open data in the EU (audio)

A couple weeks ago I spoke at the PICNIC festival in Amsterdam. While I was there I grabbed an interview with Ton Zijlstra, who has been following open data developments across EU governments very closely. You can find the interview embedded below:

[audio:http://audioboo.fm/boos/186944-ton-zijlstra-on-open-data-in-the-eu.mp3%5D

Something I wrote for the Guardian Datablog (and caveats)

I’ve written a piece on ‘How to be a data journalist’ for The Guardian’s Datablog. It seems to have proven very popular, but I thought I should blog briefly about it if you haven’t seen one of those tweets.

The post is necessarily superficial – it was difficult enough to cover the subject area for a 12,000-word book chapter, so summarising further into a 1,000 word article was almost impossible.

In the process I had to leave a huge amount out, compensating slightly by linking to webpages which expanded further.

Visualising and mashing, as the more advanced parts of data journalism, suffered most, because it seemed to me that locating and understanding data necessarily took precedence.

Heather Billings, for example, blogged about my “very British footnote [which was the] only nod to visual presentation”. If you do want to know more about visualisation tips, I wrote 1,000 words on that alone here. There’s also this great post by Kaiser Fung – and the diagram below, of which Fung says: “All outstanding charts have all three elements in harmony. Typically, a problematic chart gets only two of the three pieces right.”:

Trifecta checkup

On Monday I blogged the advice on where aspiring data journalists should start in full. There’s also the selection of passages from the book chapter linked above. And my Delicious bookmarks on data journalism, visualisation and mashups. Each has an RSS feed.

I hope that helps. If you do some data journalism as a result, it would be great if you could let me know about it – and what else you picked up.

Hyperlocal voices: Adirondack Almanack / John Warren

hyperlocal voices - Adirondack Almanack, John Warren

Following a nomination via the Online Journalism Blog Facebook group, this Hyperlocal Voices looks at a US blog: the Adirondack Almanack, which covers the rural Adirondack region of upstate New York.

Launched in 2005 out of frustration with the lack of coverage from the mainstream media, the site now boasts 20 contributors, “mostly veteran local writers, journalists, and editors and includes media professionals from local radio, magazines, and newspapers,” says founder John Warren. Here’s the full interview with John:

What made you decide to set up the blog?

The Adirondacks is home to the largest park and the largest state-level protected area in the contiguous United States (it’s also the largest National Historic Landmark). The park is over 6 million acres in size (that makes it bigger than Vermont, or Yellowstone, Yosemite, Grand Canyon, Glacier, and Great Smoky Mountains National Parks combined).

However, about half the land is publicly owned and the rest privately owned, including several villages. That mix of public and private land makes the Park a unique area and fodder for some heated discussions over sustainable development, wilderness, environmental and outdoor recreation issues. I felt strongly that local news media was not fully representing the variety of perspectives on these important issues – many of which are important in other parts of the country as well. Continue reading

Hyperlocal voices: Jon Clarke (Beckenhamtown.us)

hyperlocal site Beckenhamtown.us

Jon Clarke launched the UK hyperlocal site Beckenhamtown.us 2 years ago using the social network builder Ning. He sees the site as differing from traditional publishers in offering everyone a free voice, as well as providing a space to play out local debates around issues such as academy schools and parking zones. Here’s the interview in full:

Who were the people behind the blog, and what were their backgrounds before setting it up?

Me, and no one else, I’ve been in digital media at various ad agencies for over 10 years and therefore am au fait with lots of the ways to create and promote a website.

What made you decide to set up the blog?

The main reason was that I thought Beckenham was not well served with a ‘live’ and ‘community’ based website, there just weren’t any for what is quite a neighbourly area for neighbours to talk and share local things.

When did you set up the blog and how did you go about it?

The site was set up in August 2008. I’m not a programmer or web designer so I used the Ning.com community website platform that allows one to cut and paste and move various features around to make a good community site. I then used my knowledge to bring in lots of dynamic content, widgets and RSS feeds to pad out the site and bring it alive.

I wanted to use a co.uk address but it was gone so I plumped instead for a .US address. I thought it best represented who the website was for and about – all of US in Beckenham Town. Continue reading

A brilliant Donald Duck mashup – Right Wing Radio Duck

Jonathan McIntosh of Rebellious Pixels has just published a mashup of Donald Duck cartoons matched to a mashed-up Glenn Beck (of Fox News) voice track, called “Right Wing Radio Duck”.

[youtube:http://www.youtube.com/watch?v=HfuwNU0jsk0%5D

Jonathan has taken dozens of segments from the cartoon archives, and dozens of voice clips from Glenn Back, to create a new jigsaw from existing pieces, satirising the North American Right.

This is work of studio quality. Alternatively, it can be produced by an individual in their bedroom, and can potentially in this case be a career-creating “splash”.

Either way, it demonstrates how high the bar can be raised. It also illustates the advantages of having a liberal set of copyright laws. How difficult would it be to make this in the UK?

Here’s the Youtube blurb:

“This is a re-imagined Donald Duck cartoon remix constructed using dozens of classic Walt Disney cartoons from the 1930s to 1960s. Donald’s life is turned upside-down by the current economic crisis and he finds himself unemployed and falling behind on his house payments. As his frustration turns into despair Donald discovers a seemingly sympathetic voice coming from his radio named Glenn Beck.

“This transformative remix work constitutes a fair-use of any copyrighted material as provided for in section 107 of the US copyright law. “Right Wing Radio Duck” by Jonathan McIntosh is licensed under a Creative Commons BY-NC-SA 3.0 License – permitting non-commercial sharing with attribution.”

As a contrast, this below is an agitprop video produced by Lib Dem campaigners within a few hours of Gordon Brown’s decision to back away from holding an Election in Autumn 2007. This one was made so quickly, that they used a US version of “The Grand Old Duke of York”.

[youtube:http://www.youtube.com/watch?v=l22kHO5jdRU%5D

This video did not circulate outside the political/media community.

Open data meets FOI via some nifty automation

OpenlyLocal generated FOI request

Now this is an example of what’s possible with open data and some very clever thinking. Chris Taggart blogs about a new tool on his OpenlyLocal platform that allows you to send a Freedom of Information (FOI) request based on a particular item of spending. “This further lowers the barriers to armchair auditors wanting to understand where the money goes, and the request even includes all the usual ‘boilerplate’ to help avoid specious refusals.”

It takes around a minute to generate an FOI request.

The function is limited to items of spending above £10,000. Cleverly, it’s also all linked so you can see if an FOI request has already been generated and answered.

Although the tool sits on OpenlyLocalFrancis Irving at WhatDoTheyKnow gets enormous credit for making their side of the operation work with it.

Once again you have to ask why a media organisation isn’t creating these sorts of tools to help generate journalism beyond the walls of its newsroom.