<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Online Journalism Blog &#187; open platform</title>
	<atom:link href="http://onlinejournalismblog.com/tag/open-platform/feed/" rel="self" type="application/rss+xml" />
	<link>http://onlinejournalismblog.com</link>
	<description>A conversation.</description>
	<lastBuildDate>Sat, 11 Feb 2012 12:06:28 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<cloud domain='onlinejournalismblog.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
		<item>
		<title>The OJB guide to open news APIs &#8211; part 1: Guardian, NYT and Daylife</title>
		<link>http://onlinejournalismblog.com/2009/04/23/the-ojb-guide-to-open-news-apis-part-1-guardian-nyt-and-daylife/</link>
		<comments>http://onlinejournalismblog.com/2009/04/23/the-ojb-guide-to-open-news-apis-part-1-guardian-nyt-and-daylife/#comments</comments>
		<pubDate>Thu, 23 Apr 2009 08:55:23 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[online journalism]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[daylife]]></category>
		<category><![CDATA[duckduckgo]]></category>
		<category><![CDATA[Guardian]]></category>
		<category><![CDATA[New York Times]]></category>
		<category><![CDATA[open api]]></category>
		<category><![CDATA[open platform]]></category>
		<category><![CDATA[peter clark]]></category>
		<category><![CDATA[yahoo boss]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=2562</guid>
		<description><![CDATA[In the first of a series, Peter Clark, founder of Broadersheet, takes a look at three of the leading APIs for people looking to build news-based web projects and mashups. About six months ago, a friend of mine released a new search engine called Duckduckgo. Duckduckgo was based on the much hyped (free) Yahoo BOSS search engine platform, it was<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2009/04/23/the-ojb-guide-to-open-news-apis-part-1-guardian-nyt-and-daylife/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F04%2F23%2Fthe-ojb-guide-to-open-news-apis-part-1-guardian-nyt-and-daylife%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2009_2F04_2F23_2Fthe-ojb-guide-to-open-news-apis-part-1-guardian-nyt-and-daylife_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F04%2F23%2Fthe-ojb-guide-to-open-news-apis-part-1-guardian-nyt-and-daylife%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p><em>In the first of a series, <strong><a href="http://blog.omgponi.es/" onclick="urchinTracker('/outgoing/blog.omgponi.es/?referer=');">Peter Clark</a></strong>, founder of <a href="http://broadersheet.com/" onclick="urchinTracker('/outgoing/broadersheet.com/?referer=');">Broadersheet</a></em><em>, takes a look at three of the leading APIs for people looking to build news-based web projects and mashups.</em></p>
<p>About six months ago, a friend of mine released a new search engine called <a href="http://duckduckgo.com/" onclick="urchinTracker('/outgoing/duckduckgo.com/?referer=');">Duckduckgo</a>. Duckduckgo was based on the much hyped (free) <a href="http://developer.yahoo.com/search/boss/" onclick="urchinTracker('/outgoing/developer.yahoo.com/search/boss/?referer=');">Yahoo BOSS search engine platform</a>, it was well received and now serves hundreds of thousands of searches a day.</p>
<p>Yahoo recently announced BOSS was going to be a paid-for service &#8211; surprising a lot of developers. When you’ve built a popular (albeit non-profitable) service on a free platform, and that platform suddenly becomes rather expensive &#8211; that eats into your ramen budget.</p>
<p>So when various news agencies announced content delivery developer platforms, I was particularly interested in where they were headed.</p>
<p>There are various services &#8211; some free, some paid-for &#8211; that developers can use to extract content and valuable information from news agencies. My friend was developing a web application that took content from The Guardian, and automatically printed a bespoke newspaper each day about your favourite topics. He expressed displeasure about The Guardian restricting developers from doing this:</p>
<blockquote><p>&#8220;You will not: Use Open Guardian Platform Content in any printed format&#8221;</p></blockquote>
<p>We’re entering a new age of restrictions and jumping through hoops and loopholes to make awesome content platforms for users.</p>
<p>There are three top platforms for news content which I explore below. I&#8217;ll discuss what you can and can’t do <em>technically</em>.<span id="more-2562"></span></p>
<p>In the future I’ll attempt to translate terms of services into understandable answers to questions like “Can I legally do X?”</p>
<h3>NYTimes Developer Network</h3>
<p>The primary NYTimes API is the <a href="http://developer.nytimes.com/docs/article_search_api" onclick="urchinTracker('/outgoing/developer.nytimes.com/docs/article_search_api?referer=');">Article Search API</a>. It allows you to extract headlines, abstracts and multimedia from any news article published since 1981. You can’t, unlike The Guardian, extract full content &#8211; only leading paragraphs or extracts.</p>
<p>The <a href="http://developer.nytimes.com/docs/timestags_api/" onclick="urchinTracker('/outgoing/developer.nytimes.com/docs/timestags_api/?referer=');">TimesTags API</a> is an internal-to-NYTimes taxonomy of tags and their relationships. You can easily find related tags to an article or tag.</p>
<p>There is significant value in these taxonomies: if you wanted to aggregate any content that discussed the Boston Celtics but didn’t necessarily include the phrase “Boston Celtics” you’d need to use a tag taxonomy &#8211; and these are a real headache to compile from scratch.</p>
<p>The final, and most socially exciting API is the <a href="http://developer.nytimes.com/docs/timespeople_api/" onclick="urchinTracker('/outgoing/developer.nytimes.com/docs/timespeople_api/?referer=');">TimesPeople API</a>. This allows you to lookup the tens of millions of daily users on the NYTimes.com website, and extract information as to what their activity is on the website. If they’ve recommended articles, if they’ve rated stories, and profile information such as age and location.</p>
<h3>The Guardian Open Platform</h3>
<p>When <a href="http://www.guardian.co.uk/open-platform" onclick="urchinTracker('/outgoing/www.guardian.co.uk/open-platform?referer=');">The Guardian says “Open</a>” they really mean it. You can extract full articles, their author and category. The caveat, naturally, is that in the future you’ll have to join The Guardian advertising network, or pay for the content privilege. It turns out that journalists do have to eat. Outrageous.</p>
<p>The Guardian has also spent significant time creating beautiful tag taxonomies for their content. Running a query for listings of tags about “Obama” returns 4: Obama Administration, Barack Obama, Obama Inauguration and Michelle Obama. You can then extract the articles, and further filter them via additional tags or dates.</p>
<p>If you wanted all articles written by Charlie Brooker about ponies dated before April 2007 this would be simple, thanks to The Guardian.</p>
<p>An interesting caveat is that you must not store any content for longer than 24 hours. That isn’t as large an issue as it sounds if you consider how irrelevant news is after 24 hours, and that you can still pull the content a user requires it after it expires.</p>
<p>Further content you can syndicate includes polls, competitions and the rich interactive flash content The Guardian has on its site.</p>
<h3>Daylife</h3>
<p><a href="http://www.daylife.com/" onclick="urchinTracker('/outgoing/www.daylife.com/?referer=');">Daylife</a> is the daddy of content developer platforms. A great example of what you can achieve via Daylife is the <a href="http://www.zemanta.com/" onclick="urchinTracker('/outgoing/www.zemanta.com/?referer=');">Zemanta</a>-related content blogging utility: by utilizing the Daylife Image library, it can extract high quality, free-to-use images about what you’re writing about.</p>
<p>Daylife is heavily used by web agencies for media organizations to create sections of content &#8211; a one page overview of Barack Obama, referencing content from your news agency, from Wikipedia, Youtube, and related content from other news agencies.</p>
<p>The massive benefit of these content platforms over traditional “scraping” or RSS feeds is that the data is wrapped in information that’ll make it trivial to create a hierarchy of content, find related content and build up “value added content” that readers will demand if you’re trying to persuade them not to go to the original news provider website.</p>
<p>The massive downside is that you’re at the mercy of the news publishers and their developer frameworks. There is no such thing as a free lunch, especially if you’re syndicating content from a news publisher whose revenues have decreased more than 10% in the past year. You’re going to be expected to drive them traffic and subsidize their printing presses.</p>
<p>What are your favourite mashups of the content platforms you’ve seen so far?</p>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F04%2F23%2Fthe-ojb-guide-to-open-news-apis-part-1-guardian-nyt-and-daylife%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2009/04/23/the-ojb-guide-to-open-news-apis-part-1-guardian-nyt-and-daylife/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Guardian joins New York Times in releasing open API</title>
		<link>http://onlinejournalismblog.com/2009/03/10/guardian-joins-new-york-times-in-releasing-open-api/</link>
		<comments>http://onlinejournalismblog.com/2009/03/10/guardian-joins-new-york-times-in-releasing-open-api/#comments</comments>
		<pubDate>Tue, 10 Mar 2009 14:13:25 +0000</pubDate>
		<dc:creator>Paul Bradshaw</dc:creator>
				<category><![CDATA[newspapers]]></category>
		<category><![CDATA[advertising]]></category>
		<category><![CDATA[api]]></category>
		<category><![CDATA[apimaps.org]]></category>
		<category><![CDATA[content tagger]]></category>
		<category><![CDATA[crowdsourcing]]></category>
		<category><![CDATA[Guardian]]></category>
		<category><![CDATA[mashups]]></category>
		<category><![CDATA[New York Times]]></category>
		<category><![CDATA[open api]]></category>
		<category><![CDATA[open platform]]></category>
		<category><![CDATA[RSS]]></category>
		<category><![CDATA[sculptors]]></category>
		<category><![CDATA[tagging]]></category>
		<category><![CDATA[techcrunch]]></category>
		<category><![CDATA[zemanta]]></category>

		<guid isPermaLink="false">http://onlinejournalismblog.com/?p=2345</guid>
		<description><![CDATA[Will 2009 be the year news organisations finally went open? Barely a month after the New York Times allowed users to build on 28 years of content with its articles API (with immediate results), The Guardian is opening up over a million articles to developers for free as part of its own &#8216;Open Platform&#8216;. TechCrunch reports: &#8220;The Guardian is effectively letting<br /><span class="read_more"><a href="http://onlinejournalismblog.com/2009/03/10/guardian-joins-new-york-times-in-releasing-open-api/">Read more...</a></span>]]></description>
			<content:encoded><![CDATA[<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F03%2F10%2Fguardian-joins-new-york-times-in-releasing-open-api%2F" onclick="urchinTracker('/outgoing/api.tweetmeme.com/share?url=http_3A_2F_2Fonlinejournalismblog.com_2F2009_2F03_2F10_2Fguardian-joins-new-york-times-in-releasing-open-api_2F&amp;referer=');"><br />
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F03%2F10%2Fguardian-joins-new-york-times-in-releasing-open-api%2F&amp;style=normal&amp;b=2" height="61" width="50" /><br />
			</a>
		</div>
<p>Will 2009 be the year news organisations finally went open? Barely a month after <a href="http://onlinejournalismblog.com/2009/02/06/new-york-times-lets-users-build-things-with-its-content-open-api/">the New York Times allowed users to build on 28 years of content with its articles API</a> (with immediate<a href="http://onlinejournalismblog.com/2009/02/23/adding-value-to-the-archives-suburbifiedcom-mashes-up-nyt-real-estate-articles/"> results</a>), The Guardian is opening up over a million articles to developers for free as part of its own &#8216;<a href="http://www.guardian.co.uk/open-platform" onclick="urchinTracker('/outgoing/www.guardian.co.uk/open-platform?referer=');">Open Platform</a>&#8216;.<span id="more-2345"></span></p>
<p>TechCrunch <a href="http://uk.techcrunch.com/2009/03/10/the-guardian-launches-open-api-for-all-content-but-they-still-control-the-ads/" onclick="urchinTracker('/outgoing/uk.techcrunch.com/2009/03/10/the-guardian-launches-open-api-for-all-content-but-they-still-control-the-ads/?referer=');">reports</a>:</p>
<blockquote><p>&#8220;The Guardian is effectively letting control of its content go in order to maximise its reach &#8211; and therefore the number of eyeballs that see its brand/content &#8211; across the Web.</p>
<p>&#8220;the API does not prevent developers from running “commercial applications” using the API &#8211; but I’d read the <a href="http://www.guardian.co.uk/open-platform/terms-and-conditions" onclick="urchinTracker('/outgoing/www.guardian.co.uk/open-platform/terms-and-conditions?referer=');">fine detail</a> first. They do <a href="http://www.guardian.co.uk/open-platform/partner-programs" onclick="urchinTracker('/outgoing/www.guardian.co.uk/open-platform/partner-programs?referer=');">say</a>: “You can display your own ads and keep your own revenue. We will require that you join our ad network in the future.”</p>
<p>&#8220;&#8230; The APIs feature ‘full fat’ feeds and other content including video, audio and photo galleries. You can combine free text search and combine tags to create feeds based on XML, JSON and Atom. </p>
<p>&#8220;&#8230; Some early examples include <a href="http://labs.zemanta.com/guardian/" onclick="urchinTracker('/outgoing/labs.zemanta.com/guardian/?referer=');">Zemanta’s Guardian topic research demo</a>. It’s a simple app that searches the vast database of Guardian articles via its API and then uses Zemanta’s API to get links to related concepts.</p>
<p>&#8220;There is also <a href="http://www.contenttagger.org/" onclick="urchinTracker('/outgoing/www.contenttagger.org/?referer=');">Content Tagger</a>, an application to provide user-generated tagging on guardian.co.uk content. <a href="http://guardian.apimaps.org/" onclick="urchinTracker('/outgoing/guardian.apimaps.org/?referer=');">ApiMaps.org</a>, built by Stamen Design, is designed to crowdsource geodata about Guardian articles. <a href="http://www.sculpture.org.uk/" onclick="urchinTracker('/outgoing/www.sculpture.org.uk/?referer=');">Cass Sculpture Foundation</a> is using the Open Platform to insert lists of articles from the Guardian about its sculptors into their biography and home pages.</p></blockquote>
<p>Expect more news organisations to follow suit.</p>
<iframe src="http://www.facebook.com/plugins/like.php?href=http%3A%2F%2Fonlinejournalismblog.com%2F2009%2F03%2F10%2Fguardian-joins-new-york-times-in-releasing-open-api%2F&amp;layout=standard&amp;show_faces=true&amp;width=450&amp;action=like&amp;colorscheme=light&amp;height=80" scrolling="no" frameborder="0" style="border:none; overflow:hidden; width:450px; height:80px;" allowTransparency="true"></iframe>]]></content:encoded>
			<wfw:commentRss>http://onlinejournalismblog.com/2009/03/10/guardian-joins-new-york-times-in-releasing-open-api/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
	</channel>
</rss>

