There have been quite a few scraping-related stories that I’ve been meaning to blog about – so many I’ve decided to write a round up instead. It demonstrates just the increasing role that scraping is playing in journalism – and the possibilities for those who don’t know them:
Scraping company information
Chris Taggart explains how he built a database of corporations which will be particularly useful to journalists and anyone looking at public spending:
“Let’s have a look at one we did earlier: the Isle of Man (there’s also one for Gibraltar, Ireland, and in the US, the District of Columbia) … In the space of a couple of hours not only have we liberated the data, but both the code and the data are there for anyone else to use too, as well as being imported in OpenCorporates.”
OpenCorporates are also offering a bounty for programmers who can scrape company information from other jurisdictions.