<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: SFTW: Scraping data with Google Refine</title>
	<atom:link href="http://onlinejournalismblog.com/2012/01/13/sftw-scraping-data-with-google-refine/feed/" rel="self" type="application/rss+xml" />
	<link>http://onlinejournalismblog.com/2012/01/13/sftw-scraping-data-with-google-refine/</link>
	<description>A conversation.</description>
	<lastBuildDate>Mon, 21 May 2012 12:30:09 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
	<item>
		<title>By: Paul Bradshaw</title>
		<link>http://onlinejournalismblog.com/2012/01/13/sftw-scraping-data-with-google-refine/#comment-371045</link>
		<dc:creator>Paul Bradshaw</dc:creator>
		<pubDate>Fri, 20 Apr 2012 15:56:11 +0000</pubDate>
		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=15674#comment-371045</guid>
		<description>Good point - thanks for pointing that out.</description>
		<content:encoded><![CDATA[<p>Good point &#8211; thanks for pointing that out.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thad Guidry</title>
		<link>http://onlinejournalismblog.com/2012/01/13/sftw-scraping-data-with-google-refine/#comment-370995</link>
		<dc:creator>Thad Guidry</dc:creator>
		<pubDate>Fri, 20 Apr 2012 13:39:22 +0000</pubDate>
		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=15674#comment-370995</guid>
		<description>Hi Paul,

Incidentally, the reason that you probably had the .0 on the end of the ID numbers might have been that perhaps, upon import of your spreadsheet of IDs, you forgot to uncheck the importer option to Parse as numbers ?</description>
		<content:encoded><![CDATA[<p>Hi Paul,</p>
<p>Incidentally, the reason that you probably had the .0 on the end of the ID numbers might have been that perhaps, upon import of your spreadsheet of IDs, you forgot to uncheck the importer option to Parse as numbers ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: SFTW: Scraping data with Google Refine - Just another My blog Sites site - technologycreditunionsunnyvale</title>
		<link>http://onlinejournalismblog.com/2012/01/13/sftw-scraping-data-with-google-refine/#comment-313272</link>
		<dc:creator>SFTW: Scraping data with Google Refine - Just another My blog Sites site - technologycreditunionsunnyvale</dc:creator>
		<pubDate>Tue, 31 Jan 2012 11:47:38 +0000</pubDate>
		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=15674#comment-313272</guid>
		<description>[...] In this instance,Read more&#8230;  Read more: [...]</description>
		<content:encoded><![CDATA[<p>[...] In this instance,Read more&#8230;  Read more: [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: SFTW: Scraping data with Google Refine &#124; Online Journalism Blog &#124; Computational and Data Journalism &#124; Scoop.it</title>
		<link>http://onlinejournalismblog.com/2012/01/13/sftw-scraping-data-with-google-refine/#comment-292632</link>
		<dc:creator>SFTW: Scraping data with Google Refine &#124; Online Journalism Blog &#124; Computational and Data Journalism &#124; Scoop.it</dc:creator>
		<pubDate>Wed, 18 Jan 2012 04:51:38 +0000</pubDate>
		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=15674#comment-292632</guid>
		<description>[...] jQuery(&quot;#errors*&quot;).hide(); window.location= data.themeInternalUrl; } }); }        onlinejournalismblog.com  - Today, 5:51 [...]</description>
		<content:encoded><![CDATA[<p>[...] jQuery(&#8220;#errors*&#8221;).hide(); window.location= data.themeInternalUrl; } }); }        onlinejournalismblog.com  &#8211; Today, 5:51 [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Free school meals in Scottish primary schools &#8211; data visualisation &#124; Help Me Investigate&#8230; Education</title>
		<link>http://onlinejournalismblog.com/2012/01/13/sftw-scraping-data-with-google-refine/#comment-287516</link>
		<dc:creator>Free school meals in Scottish primary schools &#8211; data visualisation &#124; Help Me Investigate&#8230; Education</dc:creator>
		<pubDate>Sat, 14 Jan 2012 14:48:25 +0000</pubDate>
		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=15674#comment-287516</guid>
		<description>[...] The data was obtained by scraping the data from over 3,000 pages on the Education Scotland website, using Scraperwiki (you can see the scraper here). As the page for each school had its own URL based on the school&#8217;s ID, the scraper had to generate URLs for each school from a list of codes obtained by Jennifer O&#8217;Mahoney of The Scotsman. You can read more about this process and how it can also be done using Google Refine on the Online Journalis.... [...]</description>
		<content:encoded><![CDATA[<p>[...] The data was obtained by scraping the data from over 3,000 pages on the Education Scotland website, using Scraperwiki (you can see the scraper here). As the page for each school had its own URL based on the school&#8217;s ID, the scraper had to generate URLs for each school from a list of codes obtained by Jennifer O&#8217;Mahoney of The Scotsman. You can read more about this process and how it can also be done using Google Refine on the Online Journalis&#8230;. [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

