How to: clean up spreadsheet headings that run across multiple rows using Open Refine

Something that infuriates me often with government datasets is the promiscuous heading. This is when a spreadsheet doesn’t just have its headings across one row, but instead splits them across two, three or more rows.

To make matters worse, there are often also extra rows before the headings explaining the spreadsheet more generally. Here’s just one offender from the ONS:

A spreadsheet with promiscuous headings

A spreadsheet with promiscuous headings

To clean this up in Excel takes several steps – but Open Refine (formerly Google Refine) does this much more quickly. In this post I’m going to walk through the five minute process there that can save you unnecessary effort in Excel. Continue reading

Guest post: Student journalists are not “journalists”, they are students #Jcarn

Martin Hirst has written a thoughtful response to my post on the ‘student journalist’ title which he also offered as a guest post. I’m happy to cross-publish it here. You can see my comments on Martin’s version.

A few days ago, my English colleague Paul Bradshaw wrote a piece “There’s no such thing as a ‘student journalist’” on his Online Journalism blog. He argues that there should be no distinction between journalists or students of journalism (presumably training to be employed as journalists after graduation) because they are both publishers of information and the students carry out the actions of journalists — they are effectively “doing” journalism — while they learn the skills, technologies and attitudes of the profession.

Students are experiencing first hand the culture of journalism, the experience of journalism and the social consequences of what they do. Paul writes:

There is no such thing as a ‘student journalist’.

Students of journalism no longer practise their work in the seclusion of a classroom. They do not write solely for lecturers, or even for each other.

Any student on a course with some awareness of the modern media world publishes their own blogs; their student media isaccessible around the world. They contribute to networks, and build communities.

Even if their course provides no opportunities to do any of these things, they will have Twitter accounts, or Facebook accounts.

All of which means that they are publishers.

I don’t disagree with this in principle. Certainly any journalism course worthy of the name would be requiring students to participate in what I like to call “live fire” news exercises. These are usually done under close supervision. However, writing a blog as part of coursework (and for many students it is an onerous requirement of their study, rather than something they enjoy or immediately see the benefits of) is not journalism. Blogging is not journalism and I thought that debate was settled years ago. Continue reading

We’re not just teaching journalists any more – we’re teaching citizens #Jcarn

Kathy Gill has written a rather wonderful post about the public service responsibilities of journalists educated with public money. It’s worth reading in full (that summary doesn’t do it justice) – and I wanted to add my own experiences around a change in journalism education which I only realised a few years ago.

It is often overlooked when people talk about journalism and media degrees that many students realise during the course of their studies that the journalism profession is not for them.

It might be the pay (it often is the pay), the culture, the conflict between perception and reality, or simply that they discover something else they enjoy even more.

For others, the decision is a forced one: circumstances take them into another career – often, again, because the pay in journalism is not enough, either to raise a family on, or to persuade them to switch from the job they got ‘until I break into the media industry’.

All of those people may not be professional journalists, but they remain citizens, and the skills they learned in their studies are there, waiting to be activated.

When they become parents, and they have concerns about the way their child’s school is governed.

When they or their loved ones become ill, and they want to ask why they were not treated with dignity.

When their local community is under threat from development – or the lack of it.

When they or their loved ones are subject to power exercised without responsibility.

It is easy as a lecturer to mentally write off students who ‘will never make it’ in journalism as a profession. But that ignores a broader responsibility: they will always be citizens.

As a society we rely on those people to tell us their stories, to scrutinise power, to suggest the questions that might be asked, and to raise the alarm.

Martin Hirst touches on these issues in his response to my own Carnival of Journalism post, and he’s right: the have a duty of care to teach not only journalists, but citizens too.

There’s no such thing as a ‘student journalist’

Learner plate image by Michael Summers

Learner plate image by Michael Summers

The Carnival of Journalism is back, and this month is looking at student media. That gives me an excuse to talk about something I seem to find myself ranting every year: “You are not student journalists”.

It’s on Twitter profiles, blog ‘about’ pages, LinkedIn profiles and business cards. And it’s an anachronism.

There is no such thing as a ‘student journalist’.

Students of journalism no longer practise their work in the seclusion of a classroom. They do not write solely for lecturers, or even for each other.

Any student on a course with some awareness of the modern media world publishes their own blogs; their student media is accessible around the world. They contribute to networks, and build communities.

Even if their course provides no opportunities to do any of these things, they will have Twitter accounts, or Facebook accounts.

All of which means that they are publishers.

Ignorance is bliss?

Describing yourself as a student journalist suggests that you haven’t noticed this.

But worse, it reinforces a similar ignorance in the people you talk to as you go about your business.

These are the press officers that say “We don’t deal with student journalists” and the election officers who stop you at the doors of the count – but also the sources who say “I didn’t realise what I said was going to be published.”

Journalism students need to be honest with the latter and forceful with the former. A large part of that means making a mental shift from ‘this is just an exercise’ to ‘this is a real story with real implications’. In other words that move from ‘I am a student’ to ‘I am a journalist-publisher’.

Not just an exercise

For a start, as a publisher you have to be aware of contempt of court, libel, and copyright. This is not an option – and the number one reason you can never think your work is ‘just an exercise’.

You also have to think about syndication: who you might supply your content to. I encourage my students to work as freelancers, and often put them in touch with different news organisations depending on the story.

I set up the Birmingham Datablog as just one way of facilitating that, but the ‘teaching hospital’ model of journalism schooling can be misleading: wherever students publish they are part of the same content ecosystem as traditional publishers.

So there is no such thing as a student journalist. There are only publishers, and non-publishers. Your story can be seen by a million people, or only one – but you should always prepare for the former. As should the press officers. And your sources.

So change that Twitter biography; that About page. And take your job seriously: because if you don’t, no one else will.

UPDATE: Martin Hirst replies in a guest cross-post here.

“In my view, if we do not acknowledge the student status of our students (no, that’s not a tautology), we are not being diligent in our duty of care (the pastoral role of all teachers at all levels) to ensure that we “first do no harm”. Yes, we have to, as Paul rightly points out, engage our students in the daily routines and socialisation of newsroom practice and we have to move beyond the newsroom model too; but in doing so, we have to be constantly mindful that our pupils must be kept safe.”

 

This prompted Victoria Baranetsky to publish a response of her own:

“Student journalists who are not afforded the rights of citizens nor the rights of journalists must be given some protection.  Thus, it is important we acknowledge their actions may transcend their status – whatever it may be.”

Should journalists learn how to code? They already do. (And yes, they should)

Shorthand - and you think coding is bad?

Shorthand image by Mike Atherton

So Olga Khazan had a bad experience with learning how to code (more on that later) and Steve Buttry can think of 6 reasons why journalists should learn how to do just that. The zombie debate ‘Should journalists learn to code?’ stiffens and groans once more, so I thought I’d prod it a little.

Journalists already learn to code. In the UK they learn shorthand – possibly the most esoteric code there ever was. We also learn a particular coding language: English. This language is taught in schools and involves using a series of 26 characters to encode objects, actions, and descriptions. You may have a similar language you have to learn in your own country. What a drag.

Why do we learn these languages? To save time, and to improve accuracy – two things that should be important to every journalist. Continue reading

Welcome to journalism. Now delete your history.

Yesterday an 18-year-old journalism student told me he’d deleted his entire Twitter history using TweetDelete. The same day I noticed that another had changed his Twitter username to remove a reference to Newcastle United.

I was not an innocent bystander – I have to admit: I’d sort of advised them to do this…

Full circle in five years

Some history: I’ve been training journalists and student journalists to use Twitter for almost five years now, and have seen an enormous shift in that time.

In those early classes – between 2008 and 2010 – the difficulty was getting people to write more informally: almost no one had a Twitter account, so they approached it as a professional tool, with professionalism very much in mind.

By the third year, however, things were starting to change. By then around half would typically have pre-existing Twitter accounts, and many were using them in a personal capacity. The problem was not using Twitter in the first place, but how to combine the professional with the personal. “Should I have a different account for personal use?” Yes, I used to say.

Now I don’t.

There’s no such thing as a personal Twitter account

I no longer suggest having separate professional and personal accounts because, aside from the difficulty of running two accounts, frankly there is no such thing as a truly personal, even private, account if you are a journalist.

Some manage the balance: Joanna Geary, who maintains @guardianJoanna and @joannaGeary, springs to mind. But Joanna is able to do that because her ‘personal’ account is barely distinguishable from her ‘work’ account: she acts professionally; she talks about things that interest many of the same people who follow her ‘professionally’.

Joanna, in other words, is the exception.

In the movement from one audience (close friends) to another (strangers who may be judging our credibility as reporters) the harsh truth is that we will be judged unfairly against a standard we never anticipated.

And so I ended up showing TweetDelete to a class of 18-year-olds.

And I only had to mention SnapChat, and sexting for them to get it.

Welcome to the world of permanence. Please keep an eye on your past. For the sake of convenience, you may want to delete it (at least TweetDelete will give you an archived copy).

Note: Ross Hawkes has a fascinating exercise on the same subject: he will find tweets by members of the class and present them back to the class with the name removed. What would they think? “But it’s out of context!” Exactly.

Related: Why you might not ever get a job again… if you swear a lot on the internet

That free online data journalism course I’m involved in

I’m happy to announce that I’ll be part of the delivery team for a free data journalism course online early next year that is being hosted by The European Journalism Centre. Continue reading

“I’m rubbish with technology” – an excuse that doesn’t cut it any more

Wall Mural in Yellow Springs, Ohio - image by UGArdener

Rubbish with *which* technology? Image by UGArdener

Last week when I wrote about things you should never say in a newsroom I really wanted to add this one. But I decided it deserved a whole post of its own. I’m talking about people who say…

“I’m rubbish with technology”?

People actually do say this in newsrooms – particularly when they want someone else to do something for them.

But that old excuse is wearing a bit thin now. And it’s time to put a stop to it. Continue reading

It’s not just journalism that has to add more value now – advertising does too

Newspaper ad revenue has gone into a precipitous free fall - image from AEIdeas

image from AEIdeas

There’s a growing awareness in journalism that simply reprocessing content from elsewhere – whether press releases or newswires – isn’t going to be viable in a world where publishers are no longer gatekeepers. ‘Do what you do best and link to the rest‘.

Now advertising seems, finally, to be waking up to the same reality. Continue reading

New ebook now ready! Learn basic spreadsheet skills with Data Journalism Heist

Data journalism book Data Journalism Heist

I’ve written a short ebook for people who are looking to get started with data journalism but need some help.

Data Journalism Heist covers two simple techniques for finding story leads in spreadsheets: pivot tables and advanced filters.

Neither technique requires any formulae, and there are dozens of local datasets (and one international one) to use them on.

In addition the book covers how to follow leads from data, and tell the resulting story, with tips on visualisation and plenty of recommendations for next steps.

You can buy it from Leanpub here. Comments welcome as always.