New UK site launches to tackle lobbying data

Who's Lobbying treemap

I’ve been waiting for the launch of Who’s Lobbying ever since they stuck up that little Post-It note on a holding page in the run-up to the general election. Well now the site is live – publishing and visualising lobbying data, beginning with information about “ministerial meetings with outside interests, based on the reports released by UK government departments in October.”

This information is presented on the homepage very simply: with 3 leaderboards and a lovely search interface.

Who's Lobbying homepage

There are also a couple of treemaps to explore, for a more visual (and clickable) kick.

These allow you to see more quickly any points of interest in particular areas. The Who’s Lobbying blog notes, for instance, that “the treemap shows about a quarter of the Department of Energy and Climate Change meetings are with power companies. Only a small fraction are with environmental or climate change organisations.”

It also critically notes in another post that

“The Number 10 flickr stream calls [its index to transparency] a “searchable online database of government transparency information”. However it is really just a page of links to department reports. Each report containing slightly different data. The reports are in a mix of PDF, CSV, and DOC formats.

“Unfortunately Number 10 and the Cabinet Office have not mandated a consistent format for publishing ministerial meeting information.

“The Ministry of Defence published data in a copy-protected PDF format, proventing copy and paste from the document.

DEFRA failed to publish the name of each minister in its CSV formatted report.

“The Department for Transport is the only department transparent enough to publish the date of each meeting.

“All other departments only provided the month of each meeting – was that an instruction given centrally to departments? Because of this it isn’t possible to determine if two ministers were at the same meeting. Our analysis is likely to be double counting meetings with two ministers in attendance.

“Under the previous Labour government, departments had published dates for individual meetings. In this regard, are we seeing less transparency under the Conservative/Lib Dem coalition?”

When journalists start raising these questions then something will really have been achieved by the open data movement. In the meantime, we can look at Who’s Lobbying as a very welcome addition to a list of sites that feels quite weighty now: MySociety’s family of tools as the grandaddy, and ElectionLeaflets.org (formerly The Straight Choice), OpenlyLocal, Scraperwiki, Where Does My Money Go? and OpenCharities as the new breed (not to mention all the data-driven sites that sprung up around this year’s election). When they find their legs, they could potentially be quite powerful.

Hyperlocal Voices: Hedon Blog (Ray Duffill)

Hyperlocal voices: Hedon Blog

The Hedon Blog covers communities in Hedon, East Yorkshire. Established by Ray Duffill at the beginning of last year, he has since gone on to launch the HU12 site as well. This post is part on the ongoing Hyperlocal Voices series.

Who were the people behind the blog, and what were their backgrounds?

I set the Hedon Blog up after being made redundant from a career in Community Development.

What made you decide to set up the blog?

The Hedon Blog was set up as a hobby to keep my ‘hand-in’ with new social media tools I’d discovered on the web whilst working in my previous job as a Community Development Manager in Blackpool.

Specifically, I wanted to find out if Hedon had any community and voluntary groups operating in the area. On the surface it seemed that very little community activity was going on in the town. That was my initial impression and a view shared by neighbours and relatives who had lived in the area much longer.

The process of setting up the blog and nurturing its development has enabled me to re-discover my home town. Hedon is no longer just the place I live – it’s a place I’m proud of and love!

When did you set up the blog and how did you go about it?

I set up the blog on WordPress.com. It took me two minutes to set up and think of the highly original “Hedon Blog” title.

The first post was written in February 2009. I pressed ‘publish’ and thought “What next?”. I had no plan and no real objectives or goals to aim towards. This is not a model to follow!

Using my legs, eyes and ears I explored and unearthed the ‘undiscovered country’ of a small but thriving community infrastructure in the town. I reported back on my findings on the blog. And, as the ‘word-of-mouth’ spread, then people began sending me in notices of community events and other activities in the town.

What other blogs, bloggers or websites influenced you?

Whilst working in Blackpool I had found about Nick Booth‘s (Podnosh) ‘Social Media Surgeries‘ taking place in Birmingham. Inspired by those, I made an early commitment that I would only use social-media tools that were free, easy to use and share, and that could be easily taught to others.

The internet should be about liberating community news and information. I abide by these ideas with the Hedon Blog. Any community can do what I do – you don’t need shed-loads of dosh in order to obtain an effective online voice. Having financial backing and a friendly geek obviously helps – but they are not essential.

The next major influence was Talk About Local and its first ‘un-conference’ in Stoke. From being an isolated individual I was suddenly part of a major phenomenon that involved people from across the country and the world. We even had a name for what we were doing – hyperlocal!

Adam Westbrook has been the other major influence on the blog’s development. I heard him speak and was inspired by his views on the future of journalism.

Locally, in Hull, digital developer Jon Moss has helped through setting up Hull Digital. Individuals met through this network have offered me enormous encouragement and support.

How did – and do – you see yourself in relation to a traditional news operation?

I obtained Adam Westbrook’s e-book on Newsgathering for Hyperlocal Websites and now run the site as a news gathering operation.

Learning from some of the journalistic methods described in that publication has enabled me to put the blog on a professional footing and achieve a credibility in the eyes of public and private sector organisations (as well as voluntary and community groups) who now regularly supply me with press releases and other material.

In this sense I have ‘borrowed’ from the traditional media those things that can help me promote, inform and help build communities in my town.

What have been the key moments in the blog’s development editorially?

The Hedon Blog now sits as part of a wider website family under the www.hu12.net banner. This means I can concentrate community news via the Hedon Blog but now have an outlet for more contentious and controversial material – and a means to obtain some advertising income.

What sort of traffic do you get and how has that changed over time?

I have grown a local audience largely by word of mouth. In my fist month of operation I got 213 visits (WordPress stats) but get those figures and more every day now with occasional daily spikes of over 500 – 800 visits.

I never approached this from a business or journalist point of view – but rather as a civic duty or community activity. The downside of this approach is the obvious: a lack of income to re-invest in the project and to pay for its main motivating force: Me!

This activity has brought me great pleasure but has been draining on time and personal resources.

An exercise in interactive thinking

I’ve just run through an exercise with my class of students from the MA in Television and Interactive Content at Birmingham City University. The exercises are intended to get them to think about the web as more than just a repository of content, but a platform that people use in different ways depending on who they are and what they want to do.

I thought I would share them here – for my own record if nothing else.

What’s your topic – who is your userbase?

Most editorial productions begin with a topic, and an angle on that topic. They also have a particular audience in mind, which dictates the tone that is taken in its production: a documentary aimed at 5-year-olds is going to have a very different tone to one aimed at 25-year-olds, even if the topic is the same.

This gives you the People bit of the POST method I’ve written about previously – the starting point for everything that follows.

From there then you can identify

  • The objectives of those people*.
  • How you might help those people meet those objectives (the strategy and technology)
  • Which of those strategies match your own objectives – or those of the person you’re pitching to

*In the original POST method the objectives are yours, but I would suggest starting with users’ objectives because you need a mutually beneficial outcome.

Here’s how it works in practice: Continue reading

Data cleaning tool relaunches: Freebase Gridworks becomes Google Refine

When I first saw Freebase Gridworks I was a very happy man. Here was a tool that tackled one of the biggest problems in data journalism: cleaning dirty data (and data is invariably dirty). The tool made it easy to identify variations of a single term, and clean them up, to link one set of data to another – and much more besides.

Then Google bought the company that made Gridworks, and now it’s released a new version of the tool under a new name: Google Refine.

It’s notable that Google are explicitly positioning Refine in their video (above) as a “data journalism” tool.

You can download Google Refine here.

Further videos below. The first explains how to take a list on a webpage and convert it into a cleaned-up dataset – a useful alternative to scraping:

The second video explains how to link your data to data from elsewhere, aka “reconciliation” – e.g. extracting latitude and longitude or language.

What inflation has to do with the price of fish

Inflation image by Gregor Rohrig

Inflation image by Gregor Rohrig - click to see source

One of the forms of data that journalists frequently have to deal with is prices. And while it’s one thing to say that things are getting more expensive, making a meaningful comparison between what things cost now and what things cost then is a different kettle of fish altogether.

Factoring in inflation can make all the difference between arbitrary comparisons that provide no insight whatsoever, and genuinely meaningful reporting.

Thanks to computing power it’s actually quite easy for journalists to factor inflation into their reporting – by using an inflation calculator. It’s also easier to find historical price data with data-driven search engines like Wolfram Alpha.

But inflation is only half of the calculation you need. The other is earnings.

Professor Ian Stewart illustrates this perfectly in this article in The Telegraph:

“[A] 1991 pint cost around £1.40, which is £1.80 in today’s money. The current price is around £2.80, so beer really is more expensive. On the other hand, the average salary in 1991 was £19,000, and today it is £38,000. Relative to what we earn, a pint costs exactly the same as it did 19 years ago.

“Our house? That would be £125,000 today, so it has gone up by 84 per cent. Relative to average earnings, however, the increase is only 10 per cent.

“The Guardian knows about inflation, and said that the pub pint has increased by 68 per cent in real terms. But this compares the real increase in new money with the original price in old money. If I did the calculation like that for my house it would have gone up by 850 per cent. Calculated sensibly, the rise in the price of beer is about 55 per cent relative to inflation, and zero per cent relative to earnings.”

Of course the danger in averages is that they only illustrate aggregate change, and if you’re talking about a purchase that a particular section of the population makes – or you’re only talking to a particular region – then a national average may not be as meaningful a comparison to make.

If the poor are getting poorer and the rich richer then a pint of beer really is more expensive for some – and cheaper for others – than it used to be. Likewise, particular parts of the country might be suffering more from house price increases than others because of local average wages and local house prices.

It’s also worth pointing out that, when talking about financial data, a median is a much more useful measure to take than a mean.

Finally, aside from the statistical considerations it’s worth coming back to some of the basics of pricing. Ian again:

“There are two things to remember about prices. One is basic economics: if something gets too expensive for people to buy it, they don’t. So prices and wages have to stay in step, broadly speaking – though with big fluctuations in some commodities, such as housing. The other is inflation. We all know it exists, but we forget that when we start comparing prices. ‘My God! A Ford Anglia cost only £295 in 1940!’ True, but the average salary then was £370. The equivalent price today is £30,000, which will buy you a Jaguar XF.”

Cooks Source anger moves on to Dairy Goat Journal’s Dave Belanger

Cooks Source fake Facebook page discusses Dairy Goat Journal

UPDATE 2 – from Cathy in the comments (Nov 11): Dave Belanger has now paid the fee.

UPDATE – thanks to Vicki in the comments (Nov 11): Dave Belanger has responded to Suzanne, reinstating the image on their website with a credit and link, and offering to pay. However, he has refused to pay the amount requested by Suzanne, and Suzanne is now planning to take the magazine to court. Her reasoning is admirable, and it’s fair to say that contributions of commenters have helped her to make a well-informed stance:

“Countryside Publications is a five million dollar company. He accused me of being opportunistic by asking for an increased fee for the unauthorized and uncredited use.

“This is not about money. I may never see the $2100. If I do, it will be a long time from now. If I wanted to make a quick buck, I’d take the $500 [offered]. (I could use it.) But if I let him not only steal the photo but pay no penalty for it, there’s no reason for him to not steal again. After all, what did it cost him? He can steal photos all he wants and only pay for them (at a price he sets) if he’s caught. Just who is opportunistic? He published my photo without authorization or credit then says, here, take $500 or NOTHING.”

There’s also some detail about the possible impact on the publishers from Internet users:

“P.S. He mentioned receiving phone calls and emails from my readers and said he was not concerned about it. He admitted there had also been some subscription cancellations, but that people cancelled subscriptions and started subscriptions every day and that he had no reason to believe any subscription cancellations were related to his treatment of my work.”

The original post:

Oh dear. It appears another magazine editor is about to feel the force of a thousand emails following a blogger’s complaint of breach of copyright and – more importantly – said editor’s response to their request for fair payment and acknowledgement of authorship.

The editor in question is Dave Belanger who – apparently – hung up on Suzanne McMinn when she called to ask that her photo – used in Dairy Goat Journal – was properly credited.

With 80 comments already – many of them saying they have called and written to the magazine – and the case also being discussed on the fake Cooks Source Facebook page – you can only hope Dave looks at the Cooks Source and reacts quickly.

*All about this that I can find looks credible, but I’m extra cautious of this being an opportunistic hoax.

via Ulrike in the comments.

Facebook ‘mentions’ – a ‘talking point’ engine

Facebook mentions mockup

Mockup by Oliver Chiang - click to see original article

Oliver Chiang reports on a new feature being tested by Facebook that he dubs ‘mentions’. It tells you if more than one of your friends mentions the same thing – a celebrity, for example, or an event, or organisation.

I’ve often said that Facebook is the ultimate news publisher. ‘What my friends are doing’ was powerful enough, and they have since expanded their editorial proposition (with Pages) into ‘What companies and personalities I am interested in are doing’ too. The ‘mentions’ feature appears to extend that concept further, essentially identifying what is a ‘talking point’ in your social circle and circles of interest, and bringing that to your attention.

That is, of course, one of the things that journalists do.

Of additional interest are the obvious commercial applications of this technology. In fact, the focus at the moment on linking these ‘talking points’ to official ‘Pages’ drives that commercial application home rather forcefully. This overly-commercial application may in fact be a weakness – and it will be interesting to see if it is tweaked before being rolled out (Facebook’s history suggests they are more likely to tweak it than withdraw it).

In the meantime, publishers should be watching closely to see how advertisers respond to the potential of ‘mentions’ – and if there is any way they might adapt similar ideas for their own users and advertisers.

Hyperlocal voices: Mike Rawlins, Pits N Pots (Stoke)

The Hyperlocal Voices series continues with a look at Pits n Pots, a site with its own Wikipedia entry. The site – set up in frustration at the lack of an opeb public forum in the local media – is frequently given as an example of the best of hyperlocal blogging.

Who were the people behind the blog?

Tony Walley & Mike Rawlins, we don’t have interesting ‘job’ titles, we are simply Mike & Tony.

Tony Walley is a company director & broadcaster. Tony has run a successful aluminium stockholding firm in Staffordshire for around 20 years and is politically active. He has worked as a local radio broadcaster mainly covering sport, off and on for around 10 years.

Mike Rawlins is a serial web meddler who had been working in Transport & Logistics for around 18 years. He decided to leave the rat race and to live the dream of being a photographer and a full time serial web meddler 2008.

We have 3 other casual writers who cover various subjects for us.

What made you decide to set up the blog?

The coverage of local politics in the local media at the time was poor to say the least. Commenting on anything that was published in the local media was subject to a policy of censorship rather than moderation. Tony decided that there needed to be a forum where people could discuss local politics in an adult manner without being censored.

When did you set up the blog and how did you go about it?

Tony Walley and I founded the site in September 2008.

What other blogs, bloggers or websites influenced you?

Pits n Pots was one of the first sites in the political scrutiny forum so it is difficult to name other sites as being influencers.

How did – and do – you see yourself in relation to a traditional news operation?

We like to think we have the same ethos as traditional news operations, wanting to provide news and views to interested parties, but without the need to sex up or sensationalise stories.

We don’t carry any significant advertising and don’t need to sell papers.

We research and write our articles because we feel there is a gap in the market.

How are we different? We run very light and are able to react to stories very quickly where the local press can only really publish once a day even on their website. We are able to be far more experimental with new platforms and technologies than the traditional media.

How are we the same? – We are the same in so far as we have a desire to put news in to the public domain.

What have been the key moments in the blog’s development editorially?

The article about the Polish Spitfire being used by the BNP was quite a big one that made all the national press.

Our coverage of the EDL rally in Stoke-on-Trent was another key moment. This was the first time we really worked closely with the police. Our YouTube channel was the second most viewed news channel for 2 days after the rally.

What sort of traffic do you get and how has that changed over time?

Depending on the stories on the day we can get anywhere between 3000 & 5000 unique visitors each day.

Flow vs stock – and how people consume news online

I’ve only just come across this post by Robin Sloan applying the economic concepts of flow and stock to online news:

There are two kinds of quantities in the world. Stock is a static value: money in the bank, or trees in the forest. Flow is a rate of change: fifteen dollars an hour, or three-thousand toothpicks a day. Easy. Too easy.

But I actually think stock and flow is the master metaphor for media today. Here’s what I mean:

Flow is the feed. It’s the posts and the tweets. It’s the stream of daily and sub-daily updates that remind people that you exist.

Stock is the durable stuff. It’s the content you produce that’s as interesting in two months (or two years) as it is today. It’s what people discover via search. It’s what spreads slowly but surely, building fans over time.

Sloan’s argument is that ‘flow’ – constant updates – is very much the focus of attention at the moment, but that ‘stock’ should not be neglected.

It’s a very appealing metaphor – not least because research suggests that users’ consumption of online news matches it. A previous study by Associated Press backs Sloan’s argument, finding that users were overwhelmed by constant updates but hungered for more depth. It recommended that publishers invest more resources in deeper reporting – in other words, in ‘stock’.

Pablo Boczkowski’s new book, News At Work, explores many of these issues too. In looking at how people consume news at work he identifies how users check the news online:

“The first visit [of the day] to an online news site is undertaken routinely in a double sense: each individual usually conducts this visit during the same part of the day and looks at news sites according to a navigation process that varies little from one day to the next …

“Subsequent visits differ from the initial visit in several dimensions. They take place at nonroutine times and intervals. They do not occur at fixed times of the day … but depend on the availability of downtime …

“These visits are often motivated by the need for a distraction or for more information after learning about an event … Subsequent visits … are not comprehensive, methodical, long, or marked by the strong presence of print[-originated] content. Instead they are limited, disorganised, brief and focused on breaking news or some other form of novel content.”

This is the focus on flow. However, it’s hard to tell whether that user behaviour might change if some of the context surrounding it changed. For example, Boczkowski’s focus is on consumption at work, where users are taking a quick break from what they should be doing. Elsewhere in the same chapter he identifies how many users avoid consuming news online outside of work because the computer reminds them of their work. Would this change with news consumption on more consumer-focused platforms such as mobile phones, tablets, web-enabled televisions and games consoles? Similarly, as the nature of work itself changes and more people work from home, will that change their behaviour? (Boczkowkski’s own findings suggest this is indeed the case for those people).

h/t Noah Brief.

On publishing – and deleting – allegations online

TechCrunch’s Paul Carr has a thoughtful piece on “cyber-vigilantism” where citizens witness or experience a crime and go online to chase it down, name the alleged perpetrators, or pressure the authorities out of complacency:

“[W]hen that naming happens, the case is over before it’s begun: no matter whether the accused is guilty or innocent, they are handed a life sentence. Until the day they die, whenever a potential employer or a new friend Googles their name – up will come the allegation. And, prison terms notwithstanding, that allegation carries the same punishment as guilt – a lifetime as an unemployable, unfriendable, outcast. There’s a reason why the Internet is a great way to ruin someone with false allegations – and it’s the same reason why falsely accused people are just as likely to harm themselves as guilty people.”

The post was written after TechCrunch decided to delete a story about an alleged sexual assault and is a useful read in provoking us as journalists in any medium to reflect on how we treat stories of this type.

There are no hard rules of course, and associated legal issues vary from country to country.

In the Judith Griggs case, for example, was I right to post on the story? My decision was based on a few factors: firstly, I was reporting on the actions of those on her magazine’s Facebook page, rather than the ‘crime’ itself (which was hardly the first time a publisher has lifted). Secondly, I waited to see if Griggs responded to the allegations before publishing. Thirdly, I evaluated the evidence myself to see the weight of the allegations. Still, I’d be interested in your thoughts.