Tag Archives: ebooks

Over 1000 journalists are now exploring scraping techniques. Incredible.

Scraping for Journalists book coverLast week the number of people who have bought my ebook Scraping for Journalists passed the 1,000 mark. That is, to me, incredible. A thousand journalists interested enough in scraping to buy a book? What happened?

When I first began writing the book I imagined there might be perhaps 100 people in the world who would be interested in buying it. It was such a niche subject I didn’t even consider pitching it to my normal publishers.

Now it’s so mainstream that the 1000th ‘book’ was actually 12: purchased by a university which wanted multiple copies for its students to borrow – one of a number of such institutions to approach me to do so.  Continue reading

Why I stopped working with print publishers (for a while)

Scraping for Journalists book

This was first published on the BBC College of Journalism website:

I have just spent 10 months publishing an ebook. Not ‘writing’, or ‘producing’, but 10 months publishing. Just as the internet helped flatten the news industry – making reporters into publishers and distributors – it has done the same to the book industry. The question I wanted to ask was: how does that change the book?

Having written books for traditional publishers before, my plunge into self-publishing was prompted when I decided I wanted to write a book for journalists about scraping: the technique of grabbing and combining information from online documents. Continue reading

Hyperlocal Voices: Matt Brown, Londonist

The fifth in our new series of Hyperlocal Voices explores the work done by the team behind the Londonist. Despite having a large geographic footprint – Londonist covers the whole of Greater London – the site is full of ultra-local content, as well as featuring stories and themes which span the whole of the capital.

Run by two members of staff and a raft of volunteers, Editor Matt Brown gave Damian Radcliffe an insight into the breadth and depth of the site. Continue reading

A case study in online journalism part 3: ebooks (investigating the Olympic torch relay)


8000 Holes - How the 2012 Olympic Torch Relay lost its way book cover

In part one I outlined some of the data journalism processes involved in the Olympic torch relay investigation, in part 2 I explained how verification, SEO and ‘passive aggressive newsgathering’ played a role. This final part looks at how ebooks offered a new opportunity to tell the story in depth – and publish while the story was still topical.

Ebooks – publishing before the event has even finished

After a number of stories from a variety of angles I reached a fork in the road. It felt like we had been looking at this story from every angle. More than one editor, when presented with an update, said that they’d already ‘done the torch story’. I would have done the same.

But I thought of a quote on persistence from Ian Hislop that I’d published on the Help Me Investigate blog previously. “It is saying the same true thing again and again and again and again until the penny drops.”

Although it sometimes felt like we might be boring people with our insistence on continuing to dig we needed, I felt, to say the same thing again. Not the story of ‘Executive carries the torch’ but how that executive and so many others came to carry it, why that mattered, and what the impact was. A longform report.

Traditionally there would have been so space for this story. It would be too long for a newspaper or magazine, far too short for a book – where the production timescale would have missed any topicality anyway.

But we didn’t have to worry about that – because we had e-publishing.

It still seems incredible to me that we could write up and publish a book on the missed promises of the Olympic torch relay before the relay had even finished. Indeed: to also publish the day before the book’s main case study was likely to run.

But if we wanted to do that, we had about a week to hit that deadline, with important holes in our narrative, and working largely in our spare time.

First, we needed a case study to represent the human impact of the corporate torchbearers and open our book. Quite a few had been mentioned in local newspapers when they discovered that less-than-inspirational individuals had taken their place, but HMI contributor Carol Miers found one who couldn’t have been more deserving: Jack Binstead had received the maximum number of nominations; he was just 15 (half of torchbearer places were supposed to go to young people – they didn’t); and he was tipped to go to the next Paralympics.

We also needed to find out if there was an impact on the genuinely inspirational people who did get to carry the torch – I had been chasing a couple when Geoff Holt came through the site’s comments (see above). That was our ending.

For the middle we needed to pin down some of the numbers around the relay. Comments from earlier stories had indicated that some people didn’t see why it was important that executives were carrying the torches – unaware, perhaps, that promises had been made about where places would go, and what sort of stories torchbearers should have.

In particular, the organisers had promised that 90% of places would be available to the general public and that 50% of places would go to young people aged 12-24. I had to nail down where each chunk of tickets had gone – and at how many points they had been taken away from availability to the ‘general public’. Ultimately, the middle of the book would describe how that 90% got chipped away until it was more like 75%.

That middle would then be fleshed out with the themes around what happened to the other 25%: essentially some of the stories we’d already told, plus some others that filled out the picture.

Writing in this way allowed us to go beyond the normal way of writing – shock at a revelation – to identifying where things went wrong and how. For all the anger at corporate sponsors for their allocation of torch relay places, it was ultimately LOCOG’s responsibility to approve nominations, to publish 8,000 “inspirational” nomination stories, and to meet the promises that they had made about how they would be allocated. The buck stopped there.

Thanks to the iterative way we had worked so far – publishing each story as it came, asking questions in public, building an online ‘footprint’ that others could find, establishing collaborative relationships and bookmarking to create an archive – we met our deadline.

It was a timescale which allowed us to tap into interest in the relay while it was still topical, and while executive torchbearers were still carrying the torch.

8,000 Holes: How the 2012 Olympic Torch Relay Lost its Way was published on day 66 of the 70-day Olympic torch relay. All proceeds went to the Brittle Bone Society, of which Jack is an ambassador. The publishers – Leanpub – agreed to give their commission on the book to the charity as well. This was all organised over email in 24 hours a couple days before the book went live.

We organised an interview with Jack Binstead which was published in The Guardian the day after – the day that the torch was to go through his home town and the day that he would be flying out of the country to avoid it. An interview with Journalism.co.uk on the ebook itself – Help Me Investigate’s first – was published the same day.

We published data on where torchbearer places went in The Guardian’s datablog two days after that, and serialised the book throughout the week, along with some additional pieces – for example, on how LOCOG missed their target of 50% of places going to young people by other 1,000 places. A lengthier interview with Jack and his mother was published at the end of the week.

In theory this should have captured interest in the torch relay at just the right time – but I think we misjudged two factors.

The first was beyond our control: the weather changed.

Until now, the weather had been awful. When it changed, the mood of the country changed, and there was less interest in the missed promises of the Olympic torch relay. But it also coincided with another change: the final week of the torch relay was also the last few days before the opening ceremony – and as the weather changed, attention shifted to the Olympic Games itself.

The torch relay, which had been squeezed dry of every possible angle for nine weeks, was – finally – yesterday’s news. It was no longer about who was carrying the torch, but about where that torch was going, and who might carry the last one.

Still, the book raised money for a deserving charity, and its story is not over. There’s a long tail of interest to tap into here, which having an ebook increases. When the next torch relay comes around, I wonder, will it benefit from a resurgence of interest?

Get the free ebook for the full story: 8,000 Holes: How the 2012 Olympic Torch Relay Lost its Way - Leanpub.com/8000holes

A case study in online journalism part 3: ebooks (investigating the Olympic torch relay)

8000 Holes - book cover

In part one I outlined some of the data journalism processes involved in the Olympic torch relay investigation, in part 2 I explained how verification, SEO and ‘passive aggressive newsgathering’ played a role. This final part looks at how ebooks offered a new opportunity to tell the story in depth – and publish while the story was still topical.

Ebooks – publishing before the event has even finished

After a number of stories from a variety of angles I reached a fork in the road. It felt like we had been looking at this story from every angle. More than one editor, when presented with an update, said that they’d already ‘done the torch story’. I would have done the same.

But I thought of a quote on persistence from Ian Hislop that I’d published on the Help Me Investigate blog previously. “It is saying the same true thing again and again and again and again until the penny drops.”

Although it sometimes felt like we might be boring people with our insistence on continuing to dig we needed, I felt, to say the same thing again. Not the story of ‘Executive carries the torch’ but how that executive and so many others came to carry it, why that mattered, and what the impact was. A longform report. Continue reading

20 free ebooks on journalism (for your Xmas Kindle)

For some reason there are two versions of this post on the site – please check the more up to date version here.

20 free ebooks on journalism (for your Xmas Kindle) {updated to 65}

Journalism 2.0 cover

As many readers of this blog will have received a Kindle for Christmas I thought I should share my list of the free ebooks that I recommend stocking up on.

Online journalism and multimedia ebooks

Starting with more general books, Mark Briggs‘s book Journalism 2.0 (PDF*) is a few years old but still provides a good overview of online journalism to have by your side. Mindy McAdams‘s 42-page Reporter’s Guide to Multimedia Proficiency (PDF) adds some more on that front, and Adam Westbrook‘s Ideas on Digital Storytelling and Publishing (PDF) provides a larger focus on narrative, editing and other elements.

After the first version of this post, MA Online Journalism student Franzi Baehrle suggested this free book on DSLR Cinematography, as well as Adam Westbrook on multimedia production (PDF). And Guy Degen recommends the free ebook on news and documentary filmmaking from ImageJunkies.com.

The Participatory Documentary Cookbook [PDF] is another free resource on using social media in documentaries.

A free ebook on blogging can be downloaded from Guardian Students when you register with the site, and Swedish Radio have produced this guide to Social Media for Journalists (in English).

The Traffic Factories is an ebook that explores how a number of prominent US news organisations use metrics, and Chartbeat’s role in that. You can download it in mobi, PDF or epub format here.

Continue reading

Bed, knee and breakfast: designing for the iPad

Bed, knee and breakfast: the Bibliotype template

Craig Mod has written a lengthy and well-informed piece on A List Apart about the problems of designing for the iPad and other “browser”-based interfaces. He makes some particularly important points about the differences between products which have a spine as the “axis of symmetry” (e.g. books, magazines), and digital products where the axis is hard to place:

“If the axis of symmetry for a book is the spine, where is it on an iPad? On one hand, designers can approach tablets as if they were a single sheet of “paper,” letting the physicality of the object define the central axis of symmetry—straight down the middle.

“On the other hand, the physicality of these devices doesn’t represent the full potential of content space. The screen becomes a small portal to an infinite content plane, or “infinite canvas,” as so well illustrated by Scott McCloud.”

The core of his article is a design template for long form tablet reading, for which Mod breaks tablet reading distances into three main categories: Bed, Knee, and Breakfast

  • “Bed (Close to face): Reading a novel on your stomach, lying in bed with the iPad propped up on a pillow.
  • “Knee (Medium distance from face): Sitting on the couch or perhaps the Eurostar on your way to Paris, the iPad on your knee, catching up on Instapaper.
  • “Breakfast (Far from face): The iPad, propped up by the Apple case at a comfortable angle, behind your breakfast coffee and bagel, allowing for handsfree news reading as you wipe cream cheese from the corner of your mouth.”

An image of the template in action is shown above. It’s released under the MIT licence.

Although the article is written with ebooks in mind, the principles can obviously also be applied to magazine and news apps. Worth a read.