Tag Archives: Utrecht

Training: scraping in the Netherlands

Scraping for Journalists ebookI’m delivering a course in scraping in Utrecht in the Netherlands on April 2. The booking page with more details about location etc is here – a broad breakdown below:

  • Scraping for journalism: ideas and examples
  • Scraping basics: finding structure in HTML and URLs; what’s possible with programming
  • Simple scraping jobs: how to write a basic scraper in 5 minutes
  • Scraping tools: Outwit Hub and Import.io
  • How to scrape dozens of public webpages
  • Scraping databases with empty searches
  • How to understand scrapers on Scraperwiki: Scraping PDFs, lists of URLs, and databases with specific searches