Reflections on the Birmingham Hacks & Hackers Hackday (#hhhbrum)

map of GP surgeries - first thousand results

Last week I spent a thoroughly fascinating day at a hackday for journalists and web developers organised by Scraperwiki. It’s an experience that every journalist should have, for reasons I’ll explore below but which can be summed up thus: it will challenge the way you approach information as a journalist.

Disappointingly, the mainstream press and broadcast media were represented by only one professional journalist. This may be due to the time of year, but then that didn’t prevent journalists attending last week’s Liverpool event in droves. Senior buy-in is clearly key here – and I fear the Birmingham media are going to left behind if this continues.

Because on the more positive side there was a strong attendance from local bloggers such as Michael Grimes, Andy Brightwell (Podnosh), Clare White (Talk About Local) and Nicola Hughes (Your Local Scientist) – plus Martin Moore from the Media Standards Trust and some journalism students.

How it worked

After some brief scene-setting presentations, and individual expressions of areas of interest, the attendees split into 5 topic-based groups. They were:

The data behind the cancellation of Building for Schools projects
Leisure facilities in the Midlands.
Issues around cervical smear testing
Political donations
And our group, which decided to take on the broad topic of ‘health’, within the particular context of plans to give spending power to GP consortia.

By the end of the day all groups had results – which I’ll outline at the end of the post – but more interesting to me was the process of developers and journalists working together, and how it changed both camp’s working practices.

The work process

This is a genuinely collaborative process – not the linear editorial-and-production divide that so many journalists are used to.

Developers and journalists are continually asking each other for direction as the project develops: while the developers are shaping data into a format suitable for interpretation, the journalist might be gathering related data to layer on top of it or information that would illuminate or contextualise it.

This made for a lot of hard journalistic work – finding datasets, understanding them, and thinking of the stories within them, particularly with regard to how they connected with other sets of data and how they might be useful for users to interrogate themselves.

It struck me as a different skill to that normally practised by journalists – we were looking not for stories but for ‘nodes’: links between information such as local authority or area codes, school identifiers, and so on. Finding a story in data is relatively easy when compared to a project like this, and it did remind me more of the investigative process than the way a traditional newsroom works.

Team roles

Afterwards I wondered what an effective hackday team might consist of, organisationally. Typically hackdays split people into journalists and coders, but in practice the skillsets are more subtle than that. Being a journalist, for example, doesn’t guarantee that you can find datasets; likewise the range of data we came across made it necessary for someone to keep track of it all (social bookmarking helps) and ensure we avoided distraction – but also were able to adapt and change if a more interesting discovery became possible.

So here is an initial attempt to outline the sort of roles a hackday team might work best with:

Coders, obviously – some knowledge of datasets is advantage; having pre-cleaned a dataset even better. For this reason it may be worth using a wiki to lead up to the hackday to identify questions and datasets of interest so that work might be done in advance.
A project manager of sorts. Someone to keep track of the focus of the project and the datasets involved or needed (and the progress with those). Again, a wiki might suit this well – or a Posterous blog so that those outside the hackday can more easily comment.
Someone with computer assisted reporting (CAR) skills to find datasets.
A journalist, blogger or expert who understands the data, e.g. codes, context, etc. – and/or who to speak to to get more data or context
Someone to do visualisation. If no one is able, then an alternative might be to assign someone the role of creating the visualisation and as part of that, researching visualisation tools [LINK] and examples.

I’m probably missing roles – particularly within the subtleties of coding – so please post a comment if you can think of others.

What we did

Broadly we had gathered because of a shared interest in health. In discussion we decided that the key topical element was the plan to hand spending power to GP consortia. I’m not sure whether this was too broad a topic or whether that actually made for a longer-term project that we will return to. The initial thought was that we would create a resource that would become increasingly useful as the handover took place.

The first thing we needed was a clean list of all GP practices in GB. By midday Rob Styles had compiled such a list in RDF format, including location, their relationship to the Primary Care Trust (PCT), and their LSOA (Lower Layer Super Output Area) Code. In addition this had a unique URI for each practice code and a URI for each PCT code. These things are important in linking this data to other datasets.

Clare White, meanwhile, was identifying indicators & data to add on, such as life expectancy, income, etc.

Keith Alexander was turning life expectancy into another RDF with area codes as unique URIs, with a view to match to codes from other area using geocoding.

Martin Moore was also tracking down data; Mark Bentley tracked down PCT and SHA populations; and I used my Twitter network to find other sources, while using advanced search techniques to track down a range of NHS data websites (there are, it appears, a lot), including earnings data and translations of industry jargon. Andrew Mackenzie actually used a telephone, calling people at West Midlands Observatory, getting contextual information and leads for NHS data, and working out ONS subgroup area codes & ONS group areas.

If that paragraph has not completely geeked you out, you’re doing well.

I bookmarked most of the data sources I found at http://delicious.com/paulb/data+health. Here are just a selection:

Online GP practice results database
Health Poverty Index allows you to compare poverty indicators in 2 different areas (click on ‘Tabled data’ on the left to get it in more scrapable form).
NHS Comparators allows you to look at differences in referral and access rates at various levels including GP practice but requires you to register – Andrew Mckenzie found that it would be a week or so before that was processed.
Hospital episodes statistics (HES) and
HESonline
NHS Choices API
West Midlands Public Health Observatory

The results

By the end of the day we had a national map of all 8,000 GP practices (the first 1,000 shown in the image above) – quite an achievement in itself, but not headline-grabbing yet. To give us an editorial angle for the end-of-day presentations, Rob searched for a spreadsheet of mental health prevalence and came up with the headlines that Manchester was the most ‘phobic’ place in Britain, and the Isle of Wight the least.

The next step would be to add GP population information (taken from an FOI response on What Do They Know) and Quality and Outcomes Framework data (scraped from the GP practice results database). There may be ways to match results from the GMC database of medical practitioners to practices too, while FOI requests could add more specific information, and I’ve since been sent links to other potentially interesting datasets.

The Scraperwiki blog gives more detail on what the other projects produced – all of them were hugely impressive. Andy Brightwell blogged about their leisure data project – which “hoped to find out more about the relationship between population density, deprivation and the provision of leisure facilities” here. And Michael Grimes has created a page using HTML5 to visualise how the majority of the cancelled Building Schools for the Future were in Labour-controlled areas.

Both links provide further insight into the possibilities of data journalism in just one day.

Thankfully, it’s not going to be just one day. Podnosh have already organised a Speed Data event, and there are further hack days planned for Birmingham – specifically around health – by Talis and NHSlocal.

8 thoughts on “Reflections on the Birmingham Hacks & Hackers Hackday (#hhhbrum)”

Peter Demain July 27, 2010 at 2:58 pm

Something about a Liverpool Echo journalist complimenting others who came along to this event as ‘very bright and talented’ is laughably ironic. The Echo itself – like the Hull Daily Mail, Manchester Evening News – and other rags have precious little talent within them themselves; they don’t want to know it since talent likes to express itself – and they disregard expression.

For those who aren’t familiar with the Echo; it is like what Orwell referred to in a famous writing that projected a future for the working class; newspapers full of “crime, sport, astrology and intrigue” – I’m misquoting though this is mostly correct.

It’s a poor rag, based in Runcorn as they don’t really give a crap for Liverpool itself what with being one of Mirror Group’s rags. Money takes precedent over principle as per the usual – but their circulation took a big hit as a result of this. It really isn’t ever beyond the quality of the Mirror, and completely misses news that is valuable just like any churnalist factory.

If any Birmingham journo deigns to take his tips from that place in a ‘hackday’ then he should probably not even be in the profession in the first place.

– Pete @ dirtygarnet.com

Reply ↓
1. Paul Bradshaw July 27, 2010 at 3:02 pm
  
  A bit harsh to write off every employee of a company that way, Pete. We’re not defined by our employers – now that would be Orwellian.
  
  Reply ↓
Peter Demain July 27, 2010 at 3:15 pm

I never quite said that: Doesn’t it strike you as ironic? Cashing paycheques from a local bastion of churnalistic demerit one minute whilst masquerading as an educator of ‘talented, bright’ hacks the next?

Also the term ‘Orwellian’ is refers to totalitarian. He actually did many esssays on soceity as well as those famous satirical books versus authority…so it’s a bit of a stretch to connote that he means that this style of journalism compares to the Orwellian pejorative. As opposed to being just simple rubbish.

A lot of people who ostensibly ‘teach’ people about journalism in these events need to practice what they preach. If they do not then they’re either putting on a false show of things; are garden-variety hypocrites; or more likely both.

-Pete @ dirtygarnet.com

Reply ↓
simon gray July 27, 2010 at 4:49 pm

@peter – ok, so you’ve made your comment about the Liverpool Echo (& Trinity Mirror Group generally) – what did you think of the actual event itself?

Reply ↓
Peter Demain July 27, 2010 at 5:41 pm

To a great extent I believe fine journalists, like most good writers, are born and not made. Prior to all these events cropping up you actually had journalism in a better shape than at present. Now we’ve a trade bloated with filler columnists, people who never leave the office to gather/report, not to mention the soul-sucking PA wire vigil so many keep – this and technological change means circulations are falling with a certain pigheadedness present in many rags that value quantity over quality.

If you are a young talent then it really is upto you to read up on old school reportage. Quality stuff, stuff that requires going out and snapping photos, interviewing people in friendliness with your press card for legimatacy’s sake. It isn’t all that ‘hard’ to do, it’s just being economically viable and making enough money to do it for all the time invested. In a nutshell: Getting old school journalism which takes time and effort to keep oneself fed, watered, housed and with a modestly comfortable standard of living is just too hard for youngsters starting out nowadays.

I’m biased in this direction since I’m in that camp myself; I personally view the cheap, easy and cannibilizing culture of local and national press as an illness that journalism may not recover from in my lifetime. If you’re a News Corp/Trinity Mirror editor who values profit over quality and principles then rarely do I sell you material because of what I just outlined – so basically I’m marginalized, and that’s the story among many journos who value personal investigative stuff away from the desk and wire.

So I disregard these events because you need to go out and get experienced. Skills like those in old school type gathering cannot be taught in some seminar by some posturing hack from the Liverpool Echo or any other urine-poor newspaper simply because they don’t really know or endorse good journalism themselves.

Pete @ dirtygarnet.com

Reply ↓
1. simon gray July 27, 2010 at 6:04 pm
  
  So what you’re saying is, you’re passing judgment about an event you never actually attended? That doesn’t seem like very good journalism to me.
  
  Reply ↓
2. Paul Bradshaw July 27, 2010 at 7:08 pm
  
  I clearly haven’t communicated the event effectively enough, Peter, since it wasn’t a “seminar” and the Liverpool event wasn’t taught by Liverpool hacks. In fact, the day is about actually doing journalism instead of talking about it, in other words “going out and getting experienced” as you recommend. It means poring through official documents, FOI responses, talking to sources, and getting to the heart of the issue. In the short term that means a story or two; longer term it means allowing people to scrutinise power themselves without a middle man.
  
  Reply ↓
Peter Demain July 27, 2010 at 6:20 pm

Well you probably knew that already since I’d have already mentioned it had I been there; tad redundant to facetiously point that out don’t you think? Paul never posted this just for attendees to comment on.

I have been to similar functions where journos meet up to share stuff and tips although none have received the moniker of ‘hackday’ – but these things in my experience are a close mimickry of a chat over a couple of pints in a pub minus the alcohol. I don’t learn much from talk with others over the generalized subject of ‘good journalism’, some is inevitably irrelevant, and often we just drift onto other topics. Some get a little more from it perhaps…but I’d never term it a valuable day towards one’s efforts anymore than a social call in the boozer would be.

By default it isn’t ‘good journalism’ to report on some obscure meeting between those in the trade since it has limited audience interest. Smacks of conceit and pretence: That’s why Paul’s article links to a local rag’s blog and not a more widely read publication.

-Pete @ dirtygarnet.com

Reply ↓

Online Journalism Blog

Comment, analysis and links covering online journalism and online news, citizen journalism, blogging, vlogging, photoblogging, podcasts, vodcasts, interactive storytelling, publishing, Computer Assisted Reporting, User Generated Content, searching and all things internet.

Reflections on the Birmingham Hacks & Hackers Hackday (#hhhbrum)

How it worked

The work process

Team roles

What we did

The results

8 thoughts on “Reflections on the Birmingham Hacks & Hackers Hackday (#hhhbrum)”

Leave a comment Cancel reply

How it worked

The work process

Team roles

What we did

The results

Share this:

Related

8 thoughts on “Reflections on the Birmingham Hacks & Hackers Hackday (#hhhbrum)”

Leave a comment Cancel reply