<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>

<channel>
	<title>Open Knowledge Foundation Blog &#187; Open Science</title>
	<atom:link href="http://blog.okfn.org/taxonomy/open-science/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.okfn.org</link>
	<description></description>
	<pubDate>Mon, 06 Sep 2010 10:51:28 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>The Power of Open Data</title>
		<link>http://blog.okfn.org/2010/09/01/the-power-of-open-data/</link>
		<comments>http://blog.okfn.org/2010/09/01/the-power-of-open-data/#comments</comments>
		<pubDate>Wed, 01 Sep 2010 10:11:16 +0000</pubDate>
		<dc:creator>Jonathan Gray</dc:creator>
		
		<category><![CDATA[Guest post]]></category>

		<category><![CDATA[Open Access]]></category>

		<category><![CDATA[Open Data]]></category>

		<category><![CDATA[Open Science]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=3716</guid>
		<description><![CDATA[The following guest post is from David Bollier, independent policy strategist, journalist, and author of Viral Spiral. It was originally posted at the On the Commons blog.

Science has always recognized the power of sharing in developing new knowledge.  But in the search for treatments and cures for diseases like Alzheimer’s and Parkinson’s, the sprawling [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2004/10/30/the-medical-innovation-convention-a-new-global-framework-for-healthcare-research-and-development/' rel='bookmark' title='Permanent Link: The Medical Innovation Convention: A New Global Framework for Healthcare Research and Development'>The Medical Innovation Convention: A New Global Framework for Healthcare Research and Development</a></li><li><a href='http://blog.okfn.org/2007/09/04/articles-in-ctwatch-quarterly/' rel='bookmark' title='Permanent Link: Articles in CTWatch Quarterly'>Articles in CTWatch Quarterly</a></li><li><a href='http://blog.okfn.org/2010/06/08/biomed-central-open-data-award-winner-to-be-announced-this-week/' rel='bookmark' title='Permanent Link: BioMed Central Open Data Award: winner to be announced this week!'>BioMed Central Open Data Award: winner to be announced this week!</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p><strong>The following guest post is from <a href="http://www.bollier.org/">David Bollier</a>, independent policy strategist, journalist, and author of <a href="http://www.viralspiral.cc/">Viral Spiral</a>. It was originally posted at the <a href="http://onthecommons.org/power-open-data-0">On the Commons</a> blog.</strong></p>

<p>Science has always recognized the power of sharing in developing new knowledge.  But in the search for treatments and cures for diseases like Alzheimer’s and Parkinson’s, the sprawling bodies of highly diverse research data are not easily shared.  Either they are considered proprietary resources for making money, or they are hidden in academic databases that others may not know about, often inaccessible because of incompatible software formats.  No single researcher really has the resources or incentive to develop an overarching regime  to enable cooperation and sharing.  And so dozens of academics, nonprofits and pharmaceutical companies have continued their research in relative&nbsp;isolation.</p>

<p> “Companies were caught in a prisoner’s dilemma,” a research at the University of Pennsylvania recently told the <a href="https://www.nytimes.com/2010/08/13/health/research/13alzheimer.html?_r=1&amp;scp=1&amp;sq=rare%20data%20sharing%20alzheimer%27s&amp;st=cse">New York Times.</a>  “They all wanted to move the field forward, but no one wanted to take the risks of doing&nbsp;it.”  </p>

<p> But ten years ago, Dr. Neil S. Buckholtz, who oversees dementia research at the National Institutes of Health, realized that the sharing of research data was a collective action problem that might be solved through concerted leadership.  He helped instigate a plan by which the NIH stepped up to serve as an “honest broker” between the pharmaceutical industry and academics.  The goal was to ensure that all research would be shared openly and freely, and published on the Internet immediately, so that anyone could use it.  You could publish a research paper and you could develop new treatments, but no one would own the data.  Researchers would even be free to make mistakes or misguided interpretations &#8212; because who is to say at the outset that something is necessarily&nbsp;incorrect?</p>

<p> Seven years ago, the NIH persuaded scientists from the FDA, the drug industries, medical-imaging companies, academia and nonprofit groups to cooperate in an ambitious scheme to affirmatively <em>share</em> their findings with each other.  As the <em>Times</em> reports (August 13, 2010), the sharing of data is now starting to show results.  Scientists studying Alzheimer’s disease routinely share their findings about “biological markers” that indicate the progression of the disease.  This has led to recent scientific papers suggesting the value of PET scans and tests of spinal fluids as ways to make early diagnoses of&nbsp;Alzheimer’s. </p>

<p> The conventional business response to such radical ideas of “sharing” is that no company would have adequate incentive to invest in risky research unless they could be assured exclusive ownership of the results, in order to create a revenue-generating “product” (i.e., medical treatment or drug).  But the drug industry has had to concede that diseases such as Alzheimer’s are just too scientifically complicated for any single research entity to tackle; the most fruitful way forward is to pursue an “open source” approach that places the basic building-blocks of knowledge into the commons – while sanctioning the private patenting of more refined medical innovations that build on the fruits of the&nbsp;commons.</p>

<p> It’s so common-sensical that it seems faintly ridiculous that a story of this sort should merit lead-story treatment in the <em>New York&nbsp;Times.</em></p>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F09%2F01%2Fthe-power-of-open-data%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F09%2F01%2Fthe-power-of-open-data%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=3716&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_3716" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2004/10/30/the-medical-innovation-convention-a-new-global-framework-for-healthcare-research-and-development/' rel='bookmark' title='Permanent Link: The Medical Innovation Convention: A New Global Framework for Healthcare Research and Development'>The Medical Innovation Convention: A New Global Framework for Healthcare Research and Development</a></li><li><a href='http://blog.okfn.org/2007/09/04/articles-in-ctwatch-quarterly/' rel='bookmark' title='Permanent Link: Articles in CTWatch Quarterly'>Articles in CTWatch Quarterly</a></li><li><a href='http://blog.okfn.org/2010/06/08/biomed-central-open-data-award-winner-to-be-announced-this-week/' rel='bookmark' title='Permanent Link: BioMed Central Open Data Award: winner to be announced this week!'>BioMed Central Open Data Award: winner to be announced this week!</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/09/01/the-power-of-open-data/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Introducing the Panton Papers</title>
		<link>http://blog.okfn.org/2010/07/26/introducing-the-panton-papers/</link>
		<comments>http://blog.okfn.org/2010/07/26/introducing-the-panton-papers/#comments</comments>
		<pubDate>Mon, 26 Jul 2010 19:22:20 +0000</pubDate>
		<dc:creator>Jonathan Gray</dc:creator>
		
		<category><![CDATA[Ideas]]></category>

		<category><![CDATA[OKF]]></category>

		<category><![CDATA[Open Data]]></category>

		<category><![CDATA[Open Knowledge]]></category>

		<category><![CDATA[Open Science]]></category>

		<category><![CDATA[WG Open Data in Science]]></category>

		<category><![CDATA[Working Groups]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=3504</guid>
		<description><![CDATA[Peter Murray-Rust &#8212; Cambridge University chemist, Open Knowledge Foundation Advisory Board member and tireless advocate for open data in chemistry &#8212; has recently started a series of blog posts about open data, focusing on issues related to the Panton Principles for open data in science.

The first is called Open Data: why I need the Open [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/' rel='bookmark' title='Permanent Link: Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service'>Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service</a></li><li><a href='http://blog.okfn.org/2010/06/23/panton-principle-authors-win-sparc-innovator-prize/' rel='bookmark' title='Permanent Link: Panton Principle authors win SPARC Innovator prize!'>Panton Principle authors win SPARC Innovator prize!</a></li><li><a href='http://blog.okfn.org/2010/03/25/comments-on-the-panton-principles-and-data-licensing/' rel='bookmark' title='Permanent Link: Comments on the Panton Principles and Data Licensing'>Comments on the Panton Principles and Data Licensing</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p><a href="http://en.wikipedia.org/wiki/Peter_Murray-Rust">Peter Murray-Rust</a> &#8212; Cambridge University chemist, Open Knowledge Foundation Advisory Board member and tireless advocate for open data in chemistry &#8212; has recently started a series of blog posts about open data, focusing on issues related to the <a href="http://pantonprinciples.org/">Panton Principles</a> for open data in science.</p>

<p>The first is called <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2471">Open Data: why I need the Open Knowledge Foundation</a>, and in it he introduces some of the issues he wishes to discuss and gives his vision for the role he hopes the OKF community will play in relation to open data. He writes:</p>

<blockquote>
  <p>After a period of silence on this blog (but <a href="http://lists.okfn.org/mailman/listinfo">not on the Open Knowledge Foundation lists</a>) I hope to publish a flurry of ideas on Open Data. There is no doubt that “Open Data” has arrived and there is enormous interest. (By contrast when I started to investigate it 5 years ago there was nothing). It’s desperately important, more complex than I ever imagined, and it’s critical to address it immediately, responsibly, dispassionately and inclusively. If we manage to set out the concerns now, we may manage to avoid the worst problems that were encountered by the Open Source and later Open Access movements. [They have made enormous progress and without their footsteps Open Data would fall into many of the same pitfalls. But Open Data is Difficult – a phrase I shall repeat frequently.]</p>
  
  <p>I am putting my faith and energy into the <a href="http://www.okfn.org">Open Knowledge Foundation</a> – its people and its infrastructure. This is because it’s an organisation which is wideranging (it deals with open content of all sorts, open metadata, services, etc.). It has great expertise in legal problems and solutions (where these are necessary) and also how to find alternative approaches. It’s neutral (apart from urging Openness and developing the infrastructure). It’s very professional, and realises that ideas without implementation have less weight. So there is an impressive range of software and information skills. I am reminded of my favourite motto (from the IETF) – “rough consensus and running code”, one the greatest productive mantras of our time.</p>
  
  <p>The enthusiasm is palpable. [Today I had a breakfast Skype session with Jonathan Gray (coordinator of OKF) and it's all about how we can make things happen fast and responsibly.] The OKF works through Working Groups and discussion lists, and so when I had a concern about Open Data I brought it to the OKF and – after a great deal of work – we emerged with the <a href="http://pantonprinciples.org/">Panton Principles</a> which have now been translated into several languages by OKF members.</p>
  
  <p><strong>Simply, the OKF amplifies the visions of individuals from the almost-impossible to the attainable.</strong></p>
  
  <p>So I am putting some ideas into the OKF melting pot to see what emerges.</p>
</blockquote>

<p>In the next post, titled <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2474">Open Data: The concept of Panton Papers</a>, he lays out his ideas for the Panton Papers:</p>

<blockquote>
  <p>The current theme is “Panton Papers”. The idea is that part of the value of the <a href="http://www.pantonprinciples.org/">Panton Principles</a> is that the whole document is short and the key points are simply made. But the “Principles” can therefore only address the motivation and the procedures for Open data in a general manner, and many of the problems are in the details. I believe that many of the problems in Open Access (which is simpler than Open Data) arose because not enough communal effort was given to the practice of Open Access and I want to avoid as many OD problems as possible before they occur.</p>
  
  <p>Over the last 2 years (when Open Data has started to become important and discussed) I have seen several potentially difficult areas. I’ll simply list the ones I have thought of here and then outline the idea of the Panton Papers. This discussion is mirrored in part by the OKF <a href="http://lists.okfn.org/pipermail/open-science/2010-July/thread.html">open-science discussion list</a> and you may wish to subscribe. There’s also a regular <a href="http://wiki.okfn.org/wg/science">working group on open-science</a>. (Almost everything in OKF is Open, but it may take a little while to find out where you want to be!). The issues that I currently have are:</p>
  
  <ul>
  <li><strong>What is data?</strong> Images? Graphs? Tables? Equations? Accounts of experiments? This is a major problem and almost completely unexplored. Without solving this we are held back 10 year or more in our ability to re-use the primary scientific literature (e.g. by closed-access publishers who claim that factual graphs belong to them).</li>
  <li><strong>Why should data be open?</strong> (and when should it not be?). I’ve put forward ideas <a href="http://en.wikipedia.org/wiki/Open_science_data">here</a> and <a href="http://precedings.nature.com/documents/1526/version/1">here</a> . They range from moral, to legal/quasi-legal to utilitarian.</li>
  <li><strong>Who owns data?</strong> This is one of the trickiest areas – there is legal and contractual ownership and there is moral ownership. Generally there is far far too much “ownership” of data.</li>
  <li><strong>When should data be released?</strong> This is a key question (see <a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2473">here</a> for an example). Some communities have solved it – most haven’t addressed it and will have to go through the rigour of working out release protocols.</li>
  <li><strong>How and where should data be exposed?</strong> I am strongly of the opinion that we need domain-specific repositories (which could be national or international) and the Institutional Repositories are almost never the best place to expose data (I expect and welcome alternative opinions). The “how” depends on understanding what the data and metadata are and is increasingly dependent on specialist software and information standards. “Archival” is often the wrong word to use.</li>
  <li><strong>Datamining and textmining.</strong> Most authors, publishers, repository owners are unaware of the enormous power of automated analysis of the literature. Some closed access publishers expressly forbid these activities. We have to liberate the right of the scientific community to do this enthusiastically and efficiently.</li>
  <li><strong>Reproducibility.</strong> Science is based on reproducibility – we expect to be able to replicate the “materials and methods” of an experiment and to try to falsify its claims. Physical materials are beyond the immediate discussion (though this may change) but much science is now based on computing. It should be possible to replicate simulations, data cleaning, data analysis, model fitting etc. This is a tricky area. It is difficult (though with virtualization and the cloud is becoming easier) to reproduce the computing environment. Large or complex data sets are a major problem but must be addressed. This is not without monetary cost. </li>
  </ul>
  
  <p>I may add more.</p>
  
  <p>The idea is that each of these is a “Panton Paper”. It may or may not be crafted in Pantonia (the hectare of the Chemistry Department, The OKF headquarters, and the Panton Arms in Cambridge UK). Everything I now write is mutable.</p>
  
  <p>Each paper will have a top level document of similar form to the Panton Principles, i.e. 3-8 ideas, with short explanatory paragraph(s). This document will be crafted by the OKF in public view on a wiki or Ether/Piratepad. Anyone can take part. We shall welcome contributions from a wide range of disciplines (in fact this is essential). At some stage version 1.0 of the paper will be frozen and will be formally published. We have an offer from a major publisher to do this and I am hoping we can announce this at Open Science Summit.</p>
  
  <p>The Paper should carry a wider range of links to other essays in Open Data and should carry examples from different disciplines. For example there is a well tried and accepted process in many areas of bioscience and astronomy as to what when and how data get published.</p>
</blockquote>

<p>Peter has started drafting ideas for the first two of these at:</p>

<ul>
<li><a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2475">What is Scientific Data?</a></li>
<li><a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2480">Why Scientific Data should be Open</a></li>
<li><a href="http://wwmm.ch.cam.ac.uk/blogs/murrayrust/?p=2484">Who owns Scientific Data?</a></li>
</ul>

<p>If you&#8217;d like to get stuck in, please head on over to the <a href="http://lists.okfn.org/mailman/listinfo/open-science">open-science</a> list and say hi! <img src='http://blog.okfn.org/wp-includes/images/smilies/icon_smile.gif' alt=':-)' class='wp-smiley' /> </p>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F07%2F26%2Fintroducing-the-panton-papers%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F07%2F26%2Fintroducing-the-panton-papers%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=3504&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_3504" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/' rel='bookmark' title='Permanent Link: Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service'>Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service</a></li><li><a href='http://blog.okfn.org/2010/06/23/panton-principle-authors-win-sparc-innovator-prize/' rel='bookmark' title='Permanent Link: Panton Principle authors win SPARC Innovator prize!'>Panton Principle authors win SPARC Innovator prize!</a></li><li><a href='http://blog.okfn.org/2010/03/25/comments-on-the-panton-principles-and-data-licensing/' rel='bookmark' title='Permanent Link: Comments on the Panton Principles and Data Licensing'>Comments on the Panton Principles and Data Licensing</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/07/26/introducing-the-panton-papers/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Pollen data in the New and Old World</title>
		<link>http://blog.okfn.org/2010/07/14/pollen-data-in-the-new-and-old-world/</link>
		<comments>http://blog.okfn.org/2010/07/14/pollen-data-in-the-new-and-old-world/#comments</comments>
		<pubDate>Wed, 14 Jul 2010 08:00:55 +0000</pubDate>
		<dc:creator>steko</dc:creator>
		
		<category><![CDATA[Guest post]]></category>

		<category><![CDATA[Open Data]]></category>

		<category><![CDATA[Open Science]]></category>

		<category><![CDATA[Open/Closed]]></category>

		<category><![CDATA[Technical]]></category>

		<category><![CDATA[WG Archaeology]]></category>

		<category><![CDATA[WG Open Data in Science]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=3337</guid>
		<description><![CDATA[The following guest post is from Stefano Costa at the University of Siena. He is Founder of the IOSA initiative and Coordinator of the Open Knowledge Foundation&#8217;s  Working Group on Open Data in Archaeology. Stefano wishes to thank Thomas Kluyver and David Jones for their help in reviewing the post.

Since the 19th century, the [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2010/04/20/world-bank-opens-up-development-data/' rel='bookmark' title='Permanent Link: World Bank Opens Up Development Data'>World Bank Opens Up Development Data</a></li><li><a href='http://blog.okfn.org/2008/12/02/biblios-worlds-largest-database-of-open-bibliographic-data-goes-beta/' rel='bookmark' title='Permanent Link: Biblios - &#8220;world&#8217;s largest database&#8221; of open bibliographic data goes beta!'>Biblios - &#8220;world&#8217;s largest database&#8221; of open bibliographic data goes beta!</a></li><li><a href='http://blog.okfn.org/2010/02/25/open-data-in-archaeology/' rel='bookmark' title='Permanent Link: Open Data in Archaeology'>Open Data in Archaeology</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p><strong>The following guest post is from <a href="http://www.iosa.it/">Stefano Costa</a> at the University of Siena. He is Founder of the <a href="http://www.iosa.it/">IOSA initiative</a> and Coordinator of the <a href="http://www.okfn.org">Open Knowledge Foundation</a>&#8217;s  <a href="http://wiki.okfn.org/wg/archaeology">Working Group on Open Data in Archaeology</a>. Stefano wishes to thank <a href="http://www.shef.ac.uk/aps/apsrtp/aps-rtp-2010/kluyver-thomas/">Thomas Kluyver</a> and <a href="http://drj11.wordpress.com/">David Jones</a> for their help in reviewing the post.</strong></p>

<p>Since the 19th century, the study of archaeobotanical remains has been very important for combining “strict” archaeological knowledge with environmental data. Pollen data enable assessing the introduction of certain domesticated species of plants, or the presence of other species that grow typically where humans dwell. Not all pollen data come from archaeological fieldwork, and pollen analysis is often done by ecologists without a particular focus on human-associated plants. However, from an archaeologist&#8217;s perspective the relationship among the two sets is strong enough to take an interested look at pollen data worldwide, their availability and most importantly their openness, for which we follow the <a href="http://www.opendefinition.org/okd/">Open Knowledge Definition</a>.</p>

<p>We found that there is a serious misunderstanding by universities and research centers of their role in society as places of research, innovation that is <em>available for everyone</em>. As for <a href="http://www.iosa.it/content/open-source-dendrochronology-not-yet">dendrochronological data</a>, academia is a closed system producing data (at very high costs for society) that are only available inside its walls, but it’s all done with public money.</p>

<h2>Finding pollen data</h2>

<p>The starting point for finding pollen data is the <a href="http://www.ncdc.noaa.gov/paleo/pollen.html">NOAA website</a>.</p>

<p>The <a href="http://www.ncdc.noaa.gov/paleo/gpd.html">Global Pollen Database </a>hosted by the NOAA is a good starting point, but apparently its coverage is quite limited outside the US. Furthermore, data from 2005 onwards aren’t available via <a href="ftp://ftp.ncdc.noaa.gov/pub/data/paleo/pollen/">FTP</a> in <a href="http://www.ncdc.noaa.gov/paleo/pollen/polfileformat.html">simple documented formats</a>, but are instead downloadable <em>as Access databases</em> from <a href="http://www.neotomadb.org/">another external website</a>. Defining Access databases as a Bad Choice™ for data exchange is perhaps an euphemism.</p>

<p>Unfortunately, a large number of databases covering single continents or smaller regions is growing, and the approaches to data dissemination show marked differences.</p>

<h2>Americas</h2>

<p>For both North and South America, you can get data from more than one thousand sites directly via FTP. There are no explicit terms of use. Usually, data retrieved from federal agencies are public domain data.</p>

<p>The <a href="ftp://ftp.ncdc.noaa.gov/pub/data/paleo/pollen/readme_pollen.txt">README document</a> only states <code>NOTE: PLEASE CITE ORIGINAL REFERENCES WHEN USING THIS DATA!!!!!</code>. Fair enough, the requirement for attribution is certainly compatible with the <a href="http://www.opendefinition.org/okd/">Open Knowledge Definition</a>.</p>

<ul>
    <li><a href="http://www.ncdc.noaa.gov/paleo/napd.html">North American Pollen Database</a></li>
    <li><a href="http://www.ncdc.noaa.gov/paleo/lapd.html">Latin American Pollen Database</a></li>
    <li><a href="http://www.ncdc.noaa.gov/paleo/gpd.html">Global Pollen Database</a></li>
</ul>

<h2>Europe</h2>

<p>From the GPD website we can easily reach the <a href="http://www.europeanpollendatabase.net/">European Pollen Database</a>, that is found at another website tough (and things can be even more confusing, provided that the NOAA website has some dead links).</p>

<p>You can download EPD data in PostgreSQL dump format (one file for each table, with a separate SQL script <code>create_epd_db.sql</code>). Data in the EPD can be restricted or unrestricted. That’s fine, let’s see how many unrestricted datasets there are. Following the database documentation, the <code>P_ENTITY</code> table contains the use status of each dataset:</p>

<div class="CodeRay">
<div class="code">
<pre>steko@gibreel:~/epd-postgres-distribution-20100531$ cat p_entity.dump |
awk -F "\t" {' print $5 '} | sort | uniq -c
    154 R
   1092 U</pre>
</div>
</div>

<p>which is pretty good because almost 88% of them are unrestricted (NB I write most of my programs in Python but I <em>love</em> one liners that involve <code>awk</code>, <code>sort</code> and <code>uniq</code>). We could easily create an “unrestricted” subset and make it available for easy download to all those who don’t want to mess up with restricted data.</p>

<p>But what do “unrestricted” mean for EPD data? Let’s take a more careful <a href="http://www.europeanpollendatabase.net/wiki/doku.php?id=new">look</a> (emphasis mine):</p>

<blockquote>
<ol>
    <li>Data will be classified as restricted or unrestricted. All data will be available in the EPD, although restricted data can be used only as provided below.</li>
    <li>Unrestricted data are available <em>for all uses</em>, and are included in the EPD on various electronic sites.</li>
    <li>Restricted data may be used only by permission of the data originator. Appropriate and ethical use of restricted data is the responsibility of the data user.</li>
    <li>Restrictions on data will expire three years after they are submitted to the EPD. Just prior to the time of expiration, the data originator will be contacted by the EPD database manager with a reminder of the pending change. The originator may extend restricted status for further periods of three years by so informing the EPD each time a three-year period expires.</li>
</ol>
</blockquote>

<p>Sounds quite good, doesn’t it? “for all uses” is reassuring and the short time limit is a good trade off. The horror comes a few paragraphs below with the following scary details:</p>

<blockquote>
<ol>
    <li>The data are available only to non-profit-making organizations and for research.</li>
</ol>
Profit-making organizations may use the data, even for legitimate uses, only with the written consent of the EPD Board, who will determine or negotiate the payment of any fee required.</blockquote>

<p>Here the false assumption that only academia is entitled to perform research is taken for granted. And there are even more <a href="http://www.europeanpollendatabase.net/wiki/doku.php?id=new#d._users">rules</a> about the “normal ethics”: basically if you use EPD data in a publication the original data author should be listed among the authors of the work. I always thought citation and attribution were invented just for that exact purpose, but it looks like they have distinctly different approach to <em>attribution</em>. The EPD is even deciding what are “legitimate” uses of pollen data (I can hardly think of any possible unlegitimate use).</p>

<ul>
    <li><a href="http://www.europeanpollendatabase.net/">European Pollen Database</a></li>
</ul>

<h2>Africa</h2>

<p>For “Africa” read “Europe” again, because most research projects are from French and English universities. For this reason, the situation is almost the same. What is even worst is that in developing countries there are far less people or organizations that can afford buying those data, notwithstanding the fact that in regions under rapid development the study and preservation of environmental resources are of major importance.</p>

<p>Data are downloadable for individual sites using a <a href="http://medias3.mediasfrance.org/apd/tools/fossile/index.html">search engine</a>, in Tilia format (not ASCII unfortunately). The problems come out with the license:</p>

<p>The wording is almost exactly the same as for the EPD seen above:</p>

<blockquote class="posterous_medium_quote">Normal ethics pertaining to co-authorship of publications applies. The contributor should be invited to be a co-author if a user makes significant use of a single contributor’s site, or if a single contributor’s data comprise a substantial portion of a larger data set analysed, or if a contributor makes a significant contribution to the analysis of the data or to the interpretation of the results. The data will be available only to non-profit-making organisations and for research. Profit-making organisations may use the data for legitimate purposes, only with the written consent of the majority of the members of the Advisory board, who will determine or negotiate the payment of any fee required. Such payment will be credited to the APD.</blockquote>

<ul>
    <li><a href="http://www.ncdc.noaa.gov/paleo/apd.html">African Pollen Database at NOAA</a></li>
    <li><a href="http://medias3.mediasfrance.org/apd/">Banque Africaine de Données Polliniques/African Pollen Database</a></li>
</ul>

<h2>Conclusions</h2>

<p>The only positive bit of the story, if any, is that these datasets are nevertheless available on the web, and their terms of use are clearly stated, no matter how restrictive. It would be just impossible to write a similar article about archaeological pottery, or zooarchaeological finds.</p>

<h2>Appendix: Using pollen data</h2>

<p>Pollen data are usually presented in forms of synthetic charts where both stratigraphic data and quantitative pollen data are easily readable. Each “column” of the chart stands for a species or <em>genus</em>. You <em>can</em> create this kind of visualization with free software tools.</p>

<p>The <a href="http://cran.r-project.org/web/packages/stratigraph/index.html">stratigraph</a> package for <a href="http://www.r-project.org/">R</a> can be used for</p>

<blockquote class="posterous_medium_quote">plotting and analyzing paleontological and geological data distributed through through time in stratigraphic cores or sections. Includes some miscellaneous functions for handling other kinds of palaeontological and paleoecological data.</blockquote>

<p>See the chart for an example of how they look like.</p>

<p style="text-align: center; "><img class="posterous_plugin_object posterous_plugin_object_image aligncenter" src="http://posterous.com/getfile/files.posterous.com/steko/cIv3JoK9mAHYsO150N3xkIJWkP3xfd8ZR2hYFOX3iATPHYPWJEjeUaBpTyqa/mohawk.png.thumb100.jpg?content_part=gR0C6TvYZx80AwKgSocW" alt="" width="100" height="100" /></p>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F07%2F14%2Fpollen-data-in-the-new-and-old-world%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F07%2F14%2Fpollen-data-in-the-new-and-old-world%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=3337&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_3337" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2010/04/20/world-bank-opens-up-development-data/' rel='bookmark' title='Permanent Link: World Bank Opens Up Development Data'>World Bank Opens Up Development Data</a></li><li><a href='http://blog.okfn.org/2008/12/02/biblios-worlds-largest-database-of-open-bibliographic-data-goes-beta/' rel='bookmark' title='Permanent Link: Biblios - &#8220;world&#8217;s largest database&#8221; of open bibliographic data goes beta!'>Biblios - &#8220;world&#8217;s largest database&#8221; of open bibliographic data goes beta!</a></li><li><a href='http://blog.okfn.org/2010/02/25/open-data-in-archaeology/' rel='bookmark' title='Permanent Link: Open Data in Archaeology'>Open Data in Archaeology</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/07/14/pollen-data-in-the-new-and-old-world/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Panton Principle authors win SPARC Innovator prize!</title>
		<link>http://blog.okfn.org/2010/06/23/panton-principle-authors-win-sparc-innovator-prize/</link>
		<comments>http://blog.okfn.org/2010/06/23/panton-principle-authors-win-sparc-innovator-prize/#comments</comments>
		<pubDate>Wed, 23 Jun 2010 15:33:10 +0000</pubDate>
		<dc:creator>Jonathan Gray</dc:creator>
		
		<category><![CDATA[News]]></category>

		<category><![CDATA[OKF]]></category>

		<category><![CDATA[Open Data]]></category>

		<category><![CDATA[Open Science]]></category>

		<category><![CDATA[WG Open Data in Science]]></category>

		<category><![CDATA[Working Groups]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=3225</guid>
		<description><![CDATA[We are delighted to announce that the authors of the Panton Principles have been awarded the SPARC Innovator prize!

The principles are currently maintained by the Open Knowledge Foundation&#8217;s Working Group on Open Data in Science.

From the announcement:


  Science is based on building on, reusing, and openly criticizing the published body of scientific knowledge. For [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/' rel='bookmark' title='Permanent Link: Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service'>Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service</a></li><li><a href='http://blog.okfn.org/2010/07/26/introducing-the-panton-papers/' rel='bookmark' title='Permanent Link: Introducing the Panton Papers'>Introducing the Panton Papers</a></li><li><a href='http://blog.okfn.org/2008/04/25/sparc-europe-seal-for-open-access-journals/' rel='bookmark' title='Permanent Link: SPARC Europe Seal for Open Access Journals'>SPARC Europe Seal for Open Access Journals</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p>We are delighted to announce that the authors of the <a href="http://www.pantonprinciples.org/">Panton Principles</a> have been awarded the <a href="http://www.arl.org/sparc/innovator/">SPARC Innovator prize</a>!</p>

<p>The principles are currently maintained by the Open Knowledge Foundation&#8217;s <a href="http://wiki.okfn.org/wg/science">Working Group on Open Data in Science</a>.</p>

<p>From the announcement:</p>

<blockquote>
  <p>Science is based on building on, reusing, and openly criticizing the published body of scientific knowledge. For science to effectively function, and for society to reap the full benefits from scientific endeavors, it is crucial that science data be made open.</p>
  
  <p>That’s the belief of four leaders who have put forth a groundbreaking set of recommendations for scientists to more easily share their data – The Panton Principles – and who have been named the latest SPARC Innovators for their work. [...]</p>
  
  <p>The authors advocate making data freely available on the Internet for anyone to download, copy, analyze, reprocess, pass to software or use for any purpose without financial, legal or technical barriers. Through the Principles, the group aimed to develop clear language that explicitly defines how a scientist’s rights to his own data could be structured so others can freely reuse or build on it. The goal was to craft language simple enough that a scientist could easily follow it, and then focus on doing science rather than law.</p>
  
  <p>The Panton Principles were publicly launched in February of 2010, with a Web site at www.pantonprinciples.org to spread the word and an invitation to endorse. About 100 individuals and organizations have endorsed the Principles so far.</p>
  
  <p>“This is the first time we’re seeing diverse viewpoints crystallize around the pragmatic idea that we have to start somewhere, agree on the basics, and set the tone,” says Heather Joseph, Executive Director of SPARC (the Scholarly Publishing and Academic Resources Coalition). “The authors are all leading thinkers in this area – as well as producers and consumers of data. They each approached the idea of open data from different directions, yet with the same drive to open up science, and ended up on common ground.”</p>
  
  <p>According to Pollock, “It&#8217;s commonplace that we advance by building on the work of colleagues and predecessors – standing on the shoulders of giants. In a digital age, to build on the work of others we need something very concrete: access to the data of others and the freedom to use and reuse it. That&#8217;s what the Panton Principles are about.”</p>
</blockquote>

<p>Further details, including more background information and comments from other leading voices in the open science community, are available at:</p>

<ul>
<li><a href="http://www.arl.org/sparc/innovator/panton.shtml">http://www.arl.org/sparc/innovator/panton.shtml</a></li>
</ul>

<p>Alma Swan comments, quite rightly, that:</p>

<blockquote>
  <p>Coming up with the Principles is not going to cut the mustard by itself. They will need to be advocated and promoted so that scientists are interested in debating them.</p>
</blockquote>

<p>If you&#8217;re interested in helping to promote the principles to scientists in different domains, to research funding bodies, or to the general public, please introduce yourself on our <a href="http://lists.okfn.org/mailman/listinfo/open-science">open-science mailing list</a>.</p>

<div align="center"><img src="http://farm2.static.flickr.com/1194/4727141613_775cc8d470.jpg" alt="" /></div>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F06%2F23%2Fpanton-principle-authors-win-sparc-innovator-prize%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F06%2F23%2Fpanton-principle-authors-win-sparc-innovator-prize%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=3225&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_3225" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/' rel='bookmark' title='Permanent Link: Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service'>Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service</a></li><li><a href='http://blog.okfn.org/2010/07/26/introducing-the-panton-papers/' rel='bookmark' title='Permanent Link: Introducing the Panton Papers'>Introducing the Panton Papers</a></li><li><a href='http://blog.okfn.org/2008/04/25/sparc-europe-seal-for-open-access-journals/' rel='bookmark' title='Permanent Link: SPARC Europe Seal for Open Access Journals'>SPARC Europe Seal for Open Access Journals</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/06/23/panton-principle-authors-win-sparc-innovator-prize/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Open Science Summit 2010, July 29-31, Berkeley</title>
		<link>http://blog.okfn.org/2010/06/22/open-science-summit-2010-july-29-31-berkeley/</link>
		<comments>http://blog.okfn.org/2010/06/22/open-science-summit-2010-july-29-31-berkeley/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 14:16:56 +0000</pubDate>
		<dc:creator>Lisa Evans</dc:creator>
		
		<category><![CDATA[Open Data]]></category>

		<category><![CDATA[Open Science]]></category>

		<category><![CDATA[WG Open Data in Science]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=3179</guid>
		<description><![CDATA[The following guest post is from Joseph Jackson, one of the main organisers behind the Open Science Summit.

Dear Champions of Open Science,

Please join us in gathering stakeholders seeking to liberate our scientific and technological commons to enable a new era of decentralized, distributed innovation.


http://opensciencesummit.com/


While there are many great organizations and talented people thinking about these [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2008/08/19/interview-with-science-commons-for-their-voices-from-the-future-of-science/' rel='bookmark' title='Permanent Link: Interview with Science Commons for their Voices from the Future of Science'>Interview with Science Commons for their Voices from the Future of Science</a></li><li><a href='http://blog.okfn.org/2010/07/07/hacks-and-hackers-birmingham-23rd-july-2010/' rel='bookmark' title='Permanent Link: Hacks and Hackers, Birmingham, 23rd July 2010'>Hacks and Hackers, Birmingham, 23rd July 2010</a></li><li><a href='http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/' rel='bookmark' title='Permanent Link: Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service'>Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p><strong>The following guest post is from Joseph Jackson, one of the main organisers behind the <a href="http://opensciencesummit.com/">Open Science Summit</a>.</strong></p>

<p>Dear Champions of Open Science,</p>

<p>Please join us in gathering stakeholders seeking to liberate our scientific and technological commons to enable a new era of decentralized, distributed innovation.</p>

<ul>
<li><a href="http://opensciencesummit.com/">http://opensciencesummit.com/</a></li>
</ul>

<p>While there are many great organizations and talented people thinking about these problems, the discussion is usually spread out across email lists, blogs, friend feed, etc. Experts in one domain (Open Data, Open
Access publishing) are not always engaged with those working on policy issues in another (patent reform, funding mechanisms to incentivize openness).  To remedy this, I ambitiously (perhaps too ambitiously) decided to attempt to bring everyone together in an event that I hope will help build cohesion between the many groups and individuals striving to transform science.</p>

<p>Over two and a half days, from July 29-31st at Berkeley, we will consider the major challenges and policy questions that must be addressed to successfully update the institutions, practices, and worldviews that comprise our global scientific governance system.  Topics include personal genomics, gene patents, open data, the future of scientific publication, peer review, and reputation, citizen science in biology, and many more.  We ask you to join us in imagining a new paradigm, one in which a virtuous circle of mutually reinforcing shifts toward transparency and collaboration could unleash hitherto untapped reserves of human ingenuity.  It will take tremendous coordinated effort to achieve this vision.</p>

<p>This event is for everyone who cares deeply about making science work more effectively to benefit all humanity.  There is a distinct focus on biology and life sciences at the conference because these fields are experiencing rapid change and are essential to our health, environment, and food security.  In future years we will try to include greater coverage from other disciplines, for example the exciting &#8220;collective science&#8221; data sharing efforts emerging in astronomy, math, and physics (arxiv.org being a leading example of open science principles in action).</p>

<p>We ask you to help us in making this first conference a success so that we can build to the next one.</p>

<p>Thank you,</p>

<p>Joseph and OSS volunteers.</p>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F06%2F22%2Fopen-science-summit-2010-july-29-31-berkeley%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F06%2F22%2Fopen-science-summit-2010-july-29-31-berkeley%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=3179&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_3179" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2008/08/19/interview-with-science-commons-for-their-voices-from-the-future-of-science/' rel='bookmark' title='Permanent Link: Interview with Science Commons for their Voices from the Future of Science'>Interview with Science Commons for their Voices from the Future of Science</a></li><li><a href='http://blog.okfn.org/2010/07/07/hacks-and-hackers-birmingham-23rd-july-2010/' rel='bookmark' title='Permanent Link: Hacks and Hackers, Birmingham, 23rd July 2010'>Hacks and Hackers, Birmingham, 23rd July 2010</a></li><li><a href='http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/' rel='bookmark' title='Permanent Link: Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service'>Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/06/22/open-science-summit-2010-july-29-31-berkeley/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Some final - and future - thoughts on the BioMed Central Open Data award</title>
		<link>http://blog.okfn.org/2010/06/22/some-final-and-future-thoughts-on-the-biomed-central-open-data-award/</link>
		<comments>http://blog.okfn.org/2010/06/22/some-final-and-future-thoughts-on-the-biomed-central-open-data-award/#comments</comments>
		<pubDate>Tue, 22 Jun 2010 06:00:53 +0000</pubDate>
		<dc:creator>Jonathan Gray</dc:creator>
		
		<category><![CDATA[External]]></category>

		<category><![CDATA[Guest post]]></category>

		<category><![CDATA[Open Data]]></category>

		<category><![CDATA[Open Science]]></category>

		<category><![CDATA[WG Open Data in Science]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=3193</guid>
		<description><![CDATA[The following guest post is from Iain Hrynaszkiewicz who is Managing Editor at leading open access publisher BioMed Central and a member of the Open Knowledge Foundation’s Working Group on Open Data in Science.

Having already been scooped by some encouraging and eloquent activity 
in the open data blogosphere here, here, here and here (perhaps there [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2010/06/08/biomed-central-open-data-award-winner-to-be-announced-this-week/' rel='bookmark' title='Permanent Link: BioMed Central Open Data Award: winner to be announced this week!'>BioMed Central Open Data Award: winner to be announced this week!</a></li><li><a href='http://blog.okfn.org/2010/04/21/thoughts-from-the-glas-possibilities-of-real-time-data-conference/' rel='bookmark' title='Permanent Link: Thoughts from the GLA&#8217;s Possibilities of Real Time Data conference'>Thoughts from the GLA&#8217;s Possibilities of Real Time Data conference</a></li><li><a href='http://blog.okfn.org/2007/12/19/response-to-the-future-of-bibliographic-control-draft-from-the-library-of-congress/' rel='bookmark' title='Permanent Link: Response to &#8216;The Future of Bibliographic Control&#8217; draft from the Library of Congress'>Response to &#8216;The Future of Bibliographic Control&#8217; draft from the Library of Congress</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p><strong>The following guest post is from Iain Hrynaszkiewicz who is Managing Editor at leading open access publisher <a href="http://www.biomedcentral.com/">BioMed Central</a> and a member of the <a href="http://www.okfn.org/">Open Knowledge Foundation</a>’s <a href="http://wiki.okfn.org/wg/science">Working Group on Open Data in Science</a>.</strong></p>

<p>Having already been scooped by some encouraging and eloquent activity 
in the open data blogosphere <a href="http://cameronneylon.net/blog/the-bmc-10th-anniversary-celebrations-and-open-data-prize/" target="_blank">here</a>, <a href="http://bit.ly/94cpFm" target="_blank">here</a>, <a href="http://blogs.msdn.com/b/msr_er/" target="_blank">here</a> and <a href="http://blogs.openaccesscentral.com/blogs/bmcblog/entry/biomed_central_celebrates_excellence_in" target="_blank">here</a> (perhaps there are more), a few final comments 
and future thoughts on the 2009 BioMed Central <a href="http://www.biomedcentral.com/researchawards/opendata/" target="_blank">Open Data award</a>. <br /></p>

<p>Our internal scientific, and external 
academic, editors were vital in the candidate selection process this 
year, but next year we hope to involve more readers. Any article published 
in a BioMed Central journal (including those on physics and chemistry 
portals) in 2010 can be considered for the next award so we encourage 
all our readers – and editors – to <a href="mailto:iain.hrynaszkiewicz@biomedcentral.com" target="_blank">contact 
us</a> if they read or, even 
better, re-use any useful data published in 2010. <br />
</p>

<p>Thanks again to the judging panel – 
Lee Dirks, John Wilbanks, Peter Murray-Rust, Rufus Pollock and Cameron 
Neylon – and to Microsoft Research for supporting the award. 
Without their contributions the award simply would not have been possible. <br />
</p>

<p>As Cameron Neylon acknowledges in his <a href="http://cameronneylon.net/blog/the-bmc-10th-anniversary-celebrations-and-open-data-prize/" target="_blank">blog about the awards</a> licensing of open data is an important issue, 
and heterogeneity in approaches to data licensing was identified in 
the examples of original data published in BioMed Central’s journals 
identified during the selection process. We’re keen to develop a consensus 
and code of practice on licensing and re-use that best serves the needs 
of authors, editors and the open data community, in the many different 
fields incorporated in BioMed Central’s journals. <br />

</p>

<p>Another important issue we hope will 
be addressed by future nominees is standardization of data. Not only 
do we want articles that include raw data, but we are also looking for 
data in fully harvestable, machine readable and re-usable formats where 
possible, and appropriate. Over the coming weeks and months <a href="http://www.biomedcentral.com/bmcresnotes/" target="_blank"><i>BMC Research Notes</i></a> will be ratcheting up its involvement in helping 
to define domain-specific data file standards, including a publication 
of guidance and best practice. Any potential contributors are, again, 
encouraged to <a href="mailto:iain.hrynaszkiewicz@biomedcentral.com" target="_blank">get 
in touch</a>.</p>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F06%2F22%2Fsome-final-and-future-thoughts-on-the-biomed-central-open-data-award%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F06%2F22%2Fsome-final-and-future-thoughts-on-the-biomed-central-open-data-award%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=3193&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_3193" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2010/06/08/biomed-central-open-data-award-winner-to-be-announced-this-week/' rel='bookmark' title='Permanent Link: BioMed Central Open Data Award: winner to be announced this week!'>BioMed Central Open Data Award: winner to be announced this week!</a></li><li><a href='http://blog.okfn.org/2010/04/21/thoughts-from-the-glas-possibilities-of-real-time-data-conference/' rel='bookmark' title='Permanent Link: Thoughts from the GLA&#8217;s Possibilities of Real Time Data conference'>Thoughts from the GLA&#8217;s Possibilities of Real Time Data conference</a></li><li><a href='http://blog.okfn.org/2007/12/19/response-to-the-future-of-bibliographic-control-draft-from-the-library-of-congress/' rel='bookmark' title='Permanent Link: Response to &#8216;The Future of Bibliographic Control&#8217; draft from the Library of Congress'>Response to &#8216;The Future of Bibliographic Control&#8217; draft from the Library of Congress</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/06/22/some-final-and-future-thoughts-on-the-biomed-central-open-data-award/feed/</wfw:commentRss>
		</item>
		<item>
		<title>BioMed Central Open Data Award: winner to be announced this week!</title>
		<link>http://blog.okfn.org/2010/06/08/biomed-central-open-data-award-winner-to-be-announced-this-week/</link>
		<comments>http://blog.okfn.org/2010/06/08/biomed-central-open-data-award-winner-to-be-announced-this-week/#comments</comments>
		<pubDate>Tue, 08 Jun 2010 15:44:25 +0000</pubDate>
		<dc:creator>Jonathan Gray</dc:creator>
		
		<category><![CDATA[External]]></category>

		<category><![CDATA[Guest post]]></category>

		<category><![CDATA[Open Data]]></category>

		<category><![CDATA[Open Science]]></category>

		<category><![CDATA[WG Open Data in Science]]></category>

		<category><![CDATA[Working Groups]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=3065</guid>
		<description><![CDATA[The following guest post is from Iain Hrynaszkiewicz who is Managing Editor at leading open access journal BioMed Central and a member of the Open Knowledge Foundation’s Working Group on Open Data in Science.

All followers of this blog will know that science publishing now goes much beyond the traditional, IMRaD, journal article. As collective consciences [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2010/06/22/some-final-and-future-thoughts-on-the-biomed-central-open-data-award/' rel='bookmark' title='Permanent Link: Some final - and future - thoughts on the BioMed Central Open Data award'>Some final - and future - thoughts on the BioMed Central Open Data award</a></li><li><a href='http://blog.okfn.org/2007/07/14/voyages-into-publisher-copyright/' rel='bookmark' title='Permanent Link: &#8216;Voyages into publisher copyright&#8217;'>&#8216;Voyages into publisher copyright&#8217;</a></li><li><a href='http://blog.okfn.org/2008/04/25/sparc-europe-seal-for-open-access-journals/' rel='bookmark' title='Permanent Link: SPARC Europe Seal for Open Access Journals'>SPARC Europe Seal for Open Access Journals</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p><strong>The following guest post is from Iain Hrynaszkiewicz who is Managing Editor at leading open access journal <a href="http://www.biomedcentral.com/">BioMed Central</a> and a member of the <a href="http://www.okfn.org/">Open Knowledge Foundation</a>’s <a href="http://wiki.okfn.org/wg/science">Working Group on Open Data in Science</a>.</strong></p>

<p>All followers of this blog will know that science publishing now goes much beyond the traditional, IMRaD, journal article. As collective consciences are raised about data sharing it becomes an increasingly important part of the research and publication process, but great heterogeneity in the availability of data underlying peer-reviewed articles remains – arguably even more so in the availability of truly <a href="http://www.opendefinition.org/">open data</a>.</p>

<p>So in recognition of the challenges overcome by scientists who have shown leadership in openly sharing scientific data, particularly when sharing goes against cultural or community norms, the open access publisher <a href="http://www.biomedcentral.com/">BioMed Central</a> last month launched its <a href="http://www.biomedcentral.com/researchawards/opendata/">Open Data Award</a>.</p>

<p>BioMed Central teamed up with <a href="http://research.microsoft.com/en-us/">Microsoft Research</a> and <a href="http://pantonprinciples.org/">Panton Principles</a> to introduce the award, which will be presented to the winning author – of an article published by BioMed Central in 2009 – at BioMed Central’s 4th Annual Research Awards on Thursday 10th June. The support of Microsoft Research and Panton Principles for data-intensive science, and BioMed Central’s commitment to data via journals such as <a href="http://www.biomedcentral.com/bmcresnotes/">BMC Research Notes</a> is a logical, and exciting, collaboration.</p>

<p>The focus of the shortlisted articles, which were announced last week, is primary research articles that include potentially re-usable data as either additional files with the journal article or via a link to publicly available data hosted elsewhere. As well as many examples from fields that have collectively begun to recognize the benefits of openness, such as genomics, nominations were received across the full spectrum of BioMed Central’s 200+ journals. A number of articles presenting tools, policies or software to encourage data sharing were also published by BioMed Central in 2009, and this is reflected in the make-up of the shortlist.</p>

<p>A further update will follow after the winner is announced, at London’s Gherkin landmark, later this week.</p>

<p><img src="http://farm5.static.flickr.com/4018/4681936065_4d8fb480d4.jpg" alt="Screenshot of Open Data Awards" align="center" /></p>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F06%2F08%2Fbiomed-central-open-data-award-winner-to-be-announced-this-week%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F06%2F08%2Fbiomed-central-open-data-award-winner-to-be-announced-this-week%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=3065&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_3065" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2010/06/22/some-final-and-future-thoughts-on-the-biomed-central-open-data-award/' rel='bookmark' title='Permanent Link: Some final - and future - thoughts on the BioMed Central Open Data award'>Some final - and future - thoughts on the BioMed Central Open Data award</a></li><li><a href='http://blog.okfn.org/2007/07/14/voyages-into-publisher-copyright/' rel='bookmark' title='Permanent Link: &#8216;Voyages into publisher copyright&#8217;'>&#8216;Voyages into publisher copyright&#8217;</a></li><li><a href='http://blog.okfn.org/2008/04/25/sparc-europe-seal-for-open-access-journals/' rel='bookmark' title='Permanent Link: SPARC Europe Seal for Open Access Journals'>SPARC Europe Seal for Open Access Journals</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/06/08/biomed-central-open-data-award-winner-to-be-announced-this-week/feed/</wfw:commentRss>
		</item>
		<item>
		<title>A tour of climate data at CKAN</title>
		<link>http://blog.okfn.org/2010/02/24/a-tour-of-climate-data-at-ckan/</link>
		<comments>http://blog.okfn.org/2010/02/24/a-tour-of-climate-data-at-ckan/#comments</comments>
		<pubDate>Wed, 24 Feb 2010 08:00:36 +0000</pubDate>
		<dc:creator>David Jones</dc:creator>
		
		<category><![CDATA[CKAN]]></category>

		<category><![CDATA[Guest post]]></category>

		<category><![CDATA[Open Science]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=2009</guid>
		<description><![CDATA[The following guest post is by David Jones who is, among other things, a curator of the climate data group on CKAN (the OKF&#8217;s open source registry of open data) and co-founder of Clear Climate Code (which was previously featured on our blog here and here).

Take a tour of some of the additions we&#8217;ve made [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2008/09/17/clearer-climate-code/' rel='bookmark' title='Permanent Link: Clearer Climate Code'>Clearer Climate Code</a></li><li><a href='http://blog.okfn.org/2010/01/28/clear-climate-code-and-data/' rel='bookmark' title='Permanent Link: Clear Climate Code, and Data'>Clear Climate Code, and Data</a></li><li><a href='http://blog.okfn.org/2009/12/05/climate-change-climate-sceptics-and-open-data/' rel='bookmark' title='Permanent Link: Climate Change, Climate Sceptics and Open Data'>Climate Change, Climate Sceptics and Open Data</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p><strong>The following guest post is by <a href="http://drj11.wordpress.com/">David Jones</a> who is, among other things, a curator of the <a href="http://www.ckan.net/group/climatedata">climate data group</a> on <a href="http://www.ckan.net/">CKAN</a> (the OKF&#8217;s open source registry of open data) and co-founder of <a href="http://clearclimatecode.org/">Clear Climate Code</a> (which was previously featured on our blog <a href="http://blog.okfn.org/2008/09/17/clearer-climate-code/">here</a> and <a href="http://blog.okfn.org/2010/01/28/clear-climate-code-and-data/">here</a>).</strong></p>

<p>Take a tour of some of the additions we&#8217;ve made to the <a href="http://www.ckan.net/group/climatedata">climate data group</a> at <a href="http://www.ckan.net/">OKF&#8217;s CKAN</a>.</p>

<p>The Mauna Loa observatory, Hawaii, has the longest period of continual recording of the amount of CO<sub>2</sub> (carbon dioxide) in the air, the <em>airborne fraction</em>.  The <a href="http://www.ckan.net/package/co2-trends">data are available in CKAN</a>, and here they are in chart form:
<img src="http://blog.okfn.org/wp-content/uploads/2010/02/co2.png" alt="co2" title="co2" width="400" height="300" class="alignnone size-full wp-image-2010" /></p>

<p>CO<sub>2</sub> is a relatively well mixed gas in the atmosphere, but even so, it would be unwise to rely on a single location for measurements.  The <a href="http://www.ckan.net/package/cdiac">Carbon Dioxide Analysis Center</a> maintain a global network of stations collecting CO<sub>2</sub>.  In fact Mauna Loa is not used for the global average because its height gives it a CO<sub>2</sub> fraction that is lower than the surface average by 1 to 2 ppm.</p>

<p>What about reconstructing historical CO<sub>2</sub> levels?  One source is ice cores.  On the Antarctic ice sheet snow falls every year and never melts.  New snow falls on the snow from the previous year, building up in layers.  Eventually the snow builds up to a thickness where it compresses the snow beneath it into solid ice, ice that is impermeable to gas.  At that point, air at the surface becomes trapped in little bubbles in the ice.</p>

<p><img src="http://blog.okfn.org/wp-content/uploads/2010/02/vostok400.png" alt="vostok400" title="vostok400" width="400" height="492" class="alignnone size-full wp-image-2011" /></p>

<p>By drilling down through the ice we can reach older and older ice.  Vostok Station sits on the Antarctic ice sheet, above Lake Vostok.  Researchers have drilled down through the ice to a depth of 3623m and reached ice that is about 400,000 years old.  Drilling was stopped, tantalisingly close to the lake surface, because the Scientific Committee on Antarctic Research (SCAR) raised concerns that life in Lake Vostok, potentially forming a unique biome, may be contaminated.</p>

<p>By measuring the CO<sub>2</sub> content of the gas trapped in the ice core, we can reconstruct the historical levels.  Of course, <a href="http://www.ckan.net/package/vostok-co2">the data are in CKAN</a>.  Here&#8217;s a chart:
<img src="http://blog.okfn.org/wp-content/uploads/2010/02/vostokco2.png" alt="vostokco2" title="vostokco2" width="400" height="300" class="alignnone size-full wp-image-2013" /></p>

<p>(Other <a href="http://www.ckan.net/package/vostok-deuterium-temperature">data from the Vostok ice core</a> are also available)</p>

<p>Vostok is well known for being the coldest place on Earth.  Vostok Station was established in 1957, and since that time weather records have been kept by researchers working there.  The temperature record for Vostok Station is just one of the many thousands of records made available in the <a href="http://www.ckan.net/package/ghcn">Global Historical Climate Network (GHCN)</a>.  Here&#8217;s Vostok&#8217;s temperature record for the last three decades (more data are available, but three decades fits nicely):</p>

<p><img src="http://blog.okfn.org/wp-content/uploads/2010/02/vostoktemp.png" alt="vostoktemp" title="vostoktemp" width="436" height="287" class="alignnone size-full wp-image-2036" /></p>

<p>The different colours are because for a particular station the whole series can be comprised of individual records that cover only part of the range (due to different equipment, different reporting procedures, and so on); each record gets a different colour (unfortunately the records often overlap, confusing the colours).</p>

<p>The station records in GHCN, sometimes augmented by other similar datasets such as SCAR&#8217;s <a href="http://www.ckan.net/package/scar-reader">READER</a>, are used to reconstruct global temperature anomalies, like the <a href="http://www.ckan.net/package/jma-temperature">Japan Meteorological Agency Global Surface Temperature Anomaly</a>, <a href="http://www.ckan.net/package/uk-metoffice-hadcrut3">HadCRUT3 from the UK&#8217;s Met Office and the University of East Anglia&#8217;s Climate Research Unit</a>, and, perhaps most famously, <a href="http://www.ckan.net/package/gistemp">GISTEMP</a> from NASA:</p>

<p><img src="http://chart.apis.google.com/chart?cht=lc&#038;chtt=GISTEMP+Global+Temperature+Anomaly&#038;chds=-100,100&#038;chd=t:-28,-21,-26,-27,-32,-32,-29,-36,-27,-17,-39,-27,-32,-33,-33,-25,-14,-11,-25,-15,-7,-14,-24,-30,-34,-24,-19,-39,-33,-35,-33,-34,-32,-30,-15,-10,-30,-39,-33,-20,-19,-14,-26,-22,-22,-17,-2,-15,-12,-26,-8,-2,-8,-19,-7,-12,-5,7,10,1,4,10,3,9,19,6,-5,0,-4,-7,-15,-4,3,11,-10,-10,-17,8,8,6,-1,7,4,8,-21,-11,-3,-1,-4,8,3,-10,0,14,-8,-5,-16,12,1,8,18,26,4,26,9,5,12,26,31,19,37,35,12,13,23,37,29,39,56,32,33,48,56,55,48,62,54,57,43,57,-999&#038;chxt=x,y,r&#038;chxl=0:%7C1880%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1890%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1900%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1910%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1920%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1930%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1940%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1950%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1960%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1970%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1980%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C1990%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C2000%7C%7C%7C%7C%7C%7C%7C%7C%7C%7C2010%7C1:%7C%7C-0.5%7C+0.0%7C+0.5%7C%7C2:%7C%7C-0.5%7C+0.0%7C+0.5%7C&#038;chco=ff0000,000000&#038;chs=400x300"/></p>

<p>The seasonal cycle is evident in the Mauna Loa CO<sub>2</sub> (it&#8217;s caused by photosynthesis of plants, mostly in the Northern Hemisphere, drawing down more CO<sub>2</sub> from the atmosphere), and also in the Vostok temperature record.  In some cases the seasonal signal and the long term trend are easily visible, in others in takes effort to recover the long term trend.  The GISTEMP graph can be thought of as recovering the long term trend from many thousands of individual station records.</p>

<p>Another well known climate data series with both seasonal and long term trends is the <a href="http://www.ckan.net/package/nsidc">National Snow and Ice Data Centre&#8217;s</a> Arctic Sea Ice Extent:</p>

<p><img src="http://blog.okfn.org/wp-content/uploads/2010/02/arcticseaice.png" alt="arcticseaice" title="arcticseaice" width="400" height="300" class="alignnone size-full wp-image-2041" /></p>

<p>The seasonal cycle in the Arctic sea ice is of course due to summer melt and winter freeze.</p>

<p>Another data set available as a CKAN package is the <a href="http://www.ckan.net/package/colorado-sealevelchange">Colorado Sea Level data</a>.  This is a measurement of global mean sea level obtained by a series of satellites: TOPEX, then JASON-1 and JASON-2.  The next satellite in this series is JASON-3, and <a href="http://news.bbc.co.uk/1/hi/8491116.stm">it has just secured funding from a European consortium</a>.  There is a seasonal cycle in this data too:</p>

<p><img src="http://blog.okfn.org/wp-content/uploads/2010/02/satellitemeansealevel.png" alt="satellitemeansealevel" title="satellitemeansealevel" width="400" height="300" class="alignnone size-full wp-image-2059" /></p>

<p>Now I <em>think</em> the seasonal cycle in mean sea level is due to the thermal expansion of the oceans.  In the summer the ocean warms and expands; both the northern hemisphere and the southern hemisphere are affected but the effects don&#8217;t balance so there is a seasonal cycle.  Please contact your tour guide (leave a comment!) if you can find a reliable explanation (I looked and was unable to find a good source).</p>

<p>The satellite era has been tremendously useful for earth observation and climate science, but of course the records from satellites are short. For example, the satellite data for sea level only goes back to 1993.  Since climate is often a matter of looking at events on long timescales we often have to find longer series from other measurements.</p>

<p>The UK&#8217;s Natural Environment Research Council maintains the Permanent Service for Mean Sea Level at the Proudman Oceanographic Laboratory.  Using a global network of about 2000 tide gauges they can reconstruct a global mean sea level record that documents sea level rise since 1880.  Here&#8217;s a chart of the <a href="http://www.ckan.net/package/psmsl">data available from the CKAN package</a>:</p>

<p><img src="http://blog.okfn.org/wp-content/uploads/2010/02/meansealevel.png" alt="meansealevel" title="meansealevel" width="400" height="300" class="alignnone size-full wp-image-2060" /></p>

<p>The tour is coming to an end now.  The data that I&#8217;ve shown here are just a selection of the data available, both generally and in <a href="http://www.ckan.net/">CKAN</a>.  Often there is much more detailed data (and more detailed science) behind each of these datasets, but one of the reasons I&#8217;ve selected many of these datasets is that they are <em>key indicators</em>.  They are the headline figures that show increased CO<sub>2</sub> emissions, rising sea levels, decreasing Arctic sea ice.  These are the data that a curious member of the public will want to engage with, and that reason makes it important that the <a href="http://opendefinition.org/">data are accessible and freely accessible</a>.</p>

<p>If you&#8217;d like to contribute to the climate data group then <a href="http://www.okfn.org/contact">please drop us an e-mail</a>.  If you&#8217;d like to continue the tour on your own you might want to try the <a href="http://www.ckan.net/package/redsea-sealevel">Red Sea Sea Level records</a> and the <a href="http://www.ckan.net/package/itrdb">Paleo Tree Ring records</a> which are just around the corner in the <a href="http://www.ckan.net/group/archaeology">Open Archeology wing</a>.</p>

<p>If you&#8217;re interested in promoting open data in climate science, you may wish to <a href="http://pantonprinciples.org/endorse/">endorse the Panton Principles</a>, which were <a href="http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/">launched last week</a>.</p>

<h3>References and Credits</h3>

<ul>
<li>The Mauna Loa data comes from <a href="http://www.esrl.noaa.gov/gmd/ccgg/trends/">NOAA/ESRL</a>.</li>
<li>Vostok Ice Core data comes from <a href="http://www.ncdc.noaa.gov/paleo/icecore/antarctica/vostok/vostok_co2.html">NOAA paleoclimatology</a>.</li>
<li>Vostok Station temperature data comes from <a href="http://www.ncdc.noaa.gov/oa/climate/ghcn-monthly/index.php">NOAA&#8217;s GHCN data</a>.</li>
<li>The GISTEMP global temperature anomaly comes from <a href="http://data.giss.nasa.gov/gistemp/">NASA GISS</a>.</li>
<li>Arctic Sea Ice data comes from the <a href="http://nsidc.org/">National Snow and Ice Data Center</a>.</li>
<li>Satellite Sea Level data comes from the <a href="http://sealevel.colorado.edu/">University of Colorado at Boulder</a>.</li>
<li>Tide Gauge Sea Level data comes from the <a href="http://www.pol.ac.uk/psmsl/">Proudman Oceanographic Laboratory</a>.</li>
<li>Charts were plotted using <a href="http://code.google.com/apis/charttools/index.html">Google Chart Tools</a> and programs written in <a href="www.python.org/">Python</a>; the programs for Vostok Station and the GISTEMP anomaly were borrowed from <a href="http://clearclimatecode.org/">Clear Climate Code</a>.</li>
</ul>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F02%2F24%2Fa-tour-of-climate-data-at-ckan%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F02%2F24%2Fa-tour-of-climate-data-at-ckan%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=2009&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_2009" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2008/09/17/clearer-climate-code/' rel='bookmark' title='Permanent Link: Clearer Climate Code'>Clearer Climate Code</a></li><li><a href='http://blog.okfn.org/2010/01/28/clear-climate-code-and-data/' rel='bookmark' title='Permanent Link: Clear Climate Code, and Data'>Clear Climate Code, and Data</a></li><li><a href='http://blog.okfn.org/2009/12/05/climate-change-climate-sceptics-and-open-data/' rel='bookmark' title='Permanent Link: Climate Change, Climate Sceptics and Open Data'>Climate Change, Climate Sceptics and Open Data</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/02/24/a-tour-of-climate-data-at-ckan/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Launch of the Panton Principles for Open Data in Science and &#8216;Is It Open Data?&#8217; Web Service</title>
		<link>http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/</link>
		<comments>http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/#comments</comments>
		<pubDate>Fri, 19 Feb 2010 10:25:13 +0000</pubDate>
		<dc:creator>Jonathan Gray</dc:creator>
		
		<category><![CDATA[News]]></category>

		<category><![CDATA[OKF]]></category>

		<category><![CDATA[OKF Projects]]></category>

		<category><![CDATA[Open Data]]></category>

		<category><![CDATA[Open Science]]></category>

		<category><![CDATA[WG Open Data in Science]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=2067</guid>
		<description><![CDATA[We are pleased to announce the Panton Principles for Open Data in Science:


http://www.pantonprinciples.org/


From the preamble:


  Science is based on building on, reusing and openly criticising the published body of scientific knowledge.
  For science to effectively function, and for society to reap the full benefits from scientific endeavours, it is crucial that science data [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2010/06/23/panton-principle-authors-win-sparc-innovator-prize/' rel='bookmark' title='Permanent Link: Panton Principle authors win SPARC Innovator prize!'>Panton Principle authors win SPARC Innovator prize!</a></li><li><a href='http://blog.okfn.org/2010/03/25/comments-on-the-panton-principles-and-data-licensing/' rel='bookmark' title='Permanent Link: Comments on the Panton Principles and Data Licensing'>Comments on the Panton Principles and Data Licensing</a></li><li><a href='http://blog.okfn.org/2009/05/28/virtual-meeting-for-working-group-on-open-data-in-science/' rel='bookmark' title='Permanent Link: Virtual meeting for Working Group on Open Data in Science'>Virtual meeting for Working Group on Open Data in Science</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p>We are pleased to announce the <a href="http://www.pantonprinciples.org/">Panton Principles for Open Data in Science</a>:</p>

<ul>
<li><a href="http://www.pantonprinciples.org/">http://www.pantonprinciples.org/</a></li>
</ul>

<p>From the preamble:</p>

<blockquote>
  <p>Science is based on building on, reusing and openly criticising the published body of scientific knowledge.
  For science to effectively function, and for society to reap the full benefits from scientific endeavours, it is crucial that science data be made <a href="http://www.opendefinition.org/">open</a>.</p>
  
  <p>By open data in science we mean that it is freely available on the public internet permitting any user to download, copy, analyse, re-process, pass them to software or use them for any other purpose without financial, legal, or technical barriers other than those inseparable from gaining access to the internet itself. To this end data related to published science should be explicitly placed in the public domain.</p>
</blockquote>

<p>The principles themselves state that legal status of scientific datasets should be made explicit and that content licenses are not appropriate for data. They strongly discourage non-commercial licenses and strongly encourage dedication to the public domain. You can endorse the principles at:</p>

<ul>
<li><a href="http://pantonprinciples.org/endorse/">http://pantonprinciples.org/endorse/</a></li>
</ul>

<p>We&#8217;d greatly appreciate any help in disseminating the principles - including blogging, microblogging and forwarding to relevant people!</p>

<p>The first draft of Panton Principles was written in July 2009 by Peter Murray-Rust, Cameron Neylon, Rufus Pollock and John Wilbanks at the Panton Arms on Panton Street in Cambridge, UK, just down from the Chemistry Faculty where Peter works. They were then refined with the help of the members of the <a href="http://wiki.okfn.org/wg/science">Open Knowledge Foundation Working Group on Open Data in Science</a>.</p>

<p>The <a href="http://www.isitopendata.org/">Is It Open Data?</a> web service is also now live. This allows anyone to make and publicly record an enquiry into the openness of a given (scientific) dataset. It is also an initiative of the <a href="http://wiki.okfn.org/wg/science">Working Group on Open Data in Science</a>.</p>

<ul>
<li><a href="http://www.isitopendata.org/">http://www.isitopendata.org/</a></li>
</ul>

<div align="center"><img src="http://farm3.static.flickr.com/2743/4370186974_363d182500.jpg" alt="Panton Principles" /></div>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F02%2F19%2Flaunch-of-the-panton-principles-for-open-data-in-science%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F02%2F19%2Flaunch-of-the-panton-principles-for-open-data-in-science%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=2067&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_2067" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2010/06/23/panton-principle-authors-win-sparc-innovator-prize/' rel='bookmark' title='Permanent Link: Panton Principle authors win SPARC Innovator prize!'>Panton Principle authors win SPARC Innovator prize!</a></li><li><a href='http://blog.okfn.org/2010/03/25/comments-on-the-panton-principles-and-data-licensing/' rel='bookmark' title='Permanent Link: Comments on the Panton Principles and Data Licensing'>Comments on the Panton Principles and Data Licensing</a></li><li><a href='http://blog.okfn.org/2009/05/28/virtual-meeting-for-working-group-on-open-data-in-science/' rel='bookmark' title='Permanent Link: Virtual meeting for Working Group on Open Data in Science'>Virtual meeting for Working Group on Open Data in Science</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/02/19/launch-of-the-panton-principles-for-open-data-in-science/feed/</wfw:commentRss>
		</item>
		<item>
		<title>Clear Climate Code, and Data</title>
		<link>http://blog.okfn.org/2010/01/28/clear-climate-code-and-data/</link>
		<comments>http://blog.okfn.org/2010/01/28/clear-climate-code-and-data/#comments</comments>
		<pubDate>Thu, 28 Jan 2010 17:22:02 +0000</pubDate>
		<dc:creator>David Jones</dc:creator>
		
		<category><![CDATA[Exemplars]]></category>

		<category><![CDATA[External]]></category>

		<category><![CDATA[Open Data]]></category>

		<category><![CDATA[Open Science]]></category>

		<guid isPermaLink="false">http://blog.okfn.org/?p=1784</guid>
		<description><![CDATA[The following guest post is by David Jones who is, among other things, a curator of the climate data group on CKAN (the OKF&#8217;s open source registry of open data) and co-founder of Clear Climate Code (which we blogged about back in 2008).

Clear Climate Code have been working on ccc-gistemp, a project to reimplement in [...]


Related posts:<ol><li><a href='http://blog.okfn.org/2008/09/17/clearer-climate-code/' rel='bookmark' title='Permanent Link: Clearer Climate Code'>Clearer Climate Code</a></li><li><a href='http://blog.okfn.org/2010/02/24/a-tour-of-climate-data-at-ckan/' rel='bookmark' title='Permanent Link: A tour of climate data at CKAN'>A tour of climate data at CKAN</a></li><li><a href='http://blog.okfn.org/2009/12/05/climate-change-climate-sceptics-and-open-data/' rel='bookmark' title='Permanent Link: Climate Change, Climate Sceptics and Open Data'>Climate Change, Climate Sceptics and Open Data</a></li></ol>]]></description>
			<content:encoded><![CDATA[<p><strong>The following guest post is by <a href="http://drj11.wordpress.com/">David Jones</a> who is, among other things, a curator of the <a href="http://www.ckan.net/group/climatedata">climate data group</a> on <a href="http://www.ckan.net/">CKAN</a> (the OKF&#8217;s open source registry of open data) and co-founder of <a href="http://clearclimatecode.org/">Clear Climate Code</a> (which we blogged about <a href="http://blog.okfn.org/2008/09/17/clearer-climate-code/">back in 2008</a>).</strong></p>

<p>Clear Climate Code have been working on <a href="http://clearclimatecode.org/gistemp/"><em>ccc-gistemp</em></a>, a project to reimplement in clear Python <a href="http://data.giss.nasa.gov/gistemp/">NASA&#8217;s GISTEMP</a>.  GISTEMP is a global historical temperature analysis, it produces, amongst other things, graphs like this, that tell you whether the Earth is getting warmer or cooler:</p>

<p><img src="http://data.giss.nasa.gov/gistemp/graphs/Fig.A2.lrg.gif" width="400" height="290" alt="Official GISTEMP global anomaly." /></p>

<p>Because this graph is important for studying the world&#8217;s climate (and determining the signature of global warming), there is a lot of public discussion about where this data comes from.  The raw data underlying the graph is surface weather station temperature records.  The raw data is <em>processed</em> to produce the data for the graph:</p>

<p><img src="http://blog.okfn.org/wp-content/uploads/2010/01/gistemp.png" alt="gistemp" title="gistemp" width="400" height="238" class="alignnone size-full wp-image-1823" /></p>

<p>The box in the middle, labelled &#8220;GISTEMP&#8221;, is a process that converts the raw station records into the data for the graph on the right, which is the global temperature anomaly.  There are descriptions of this process available, for example <a href="http://pubs.giss.nasa.gov/abstracts/1987/Hansen_Lebedeff.html">Hansen and Lebedeff, 1987</a>.  A description is one thing, but it might not tell you everything you need to know.  Perhaps the description is sufficiently clear and accurate for you to reproduce the process, perhaps not. The ultimate authority on the process is the <em>source code</em> that implements it, because It&#8217;s the source code that is executed in order to produce the processed data.  So if you want to know exactly what the process involves, you have to get hold of the source code.</p>

<p>In effect it is the source code that adds value to the raw data to produce processed data.  So in a sense, the value of the processed data is embodied in the source code.  That&#8217;s what makes the source code important.</p>

<p>The source code for GISTEMP is written mostly in Fortran by scientists at NASA, and is <a href="http://data.giss.nasa.gov/gistemp/sources/">available from them</a>.  This source code is the working code used by the NASA scientists, it is not necessarily the best source code for explaining how the process works (to an interested and competent member of the general public).  There is the question of whether NASA, a publicly funded body, should be paying someone to write code that makes a better tool for communicating with the public (for example by writing better documentation, or writing it in a more exemplary style).  I am not going to address that question.  The source code NASA use is the source code we have right now.</p>

<p>Our goal at Clear Climate Code is to take this code and produce a new version that is clearer, but does the same thing.  We have taken great steps forward towards this goal: We have recently <a href="http://clearclimatecode.org/ccc-gistemp-release-0-3-0/">released a version which is all in Python and which reproduces NASA&#8217;s results exactly</a>.  We think much of this code is already a great deal clearer than the starting material, but we continue to make it clearer.  Of course we would welcome your support.  If you want to help, please join <a href="http://groups.google.com/group/ccc-gistemp-discuss">our mailing list</a>, or you can follow our progress at <a href="http://clearclimatecode.org/">our blog</a> and on <a href="http://twitter.com/clear_climate">twitter</a>.</p>

<p>The reasons Clear Climate Code chose Python as the implementation language for ccc-gistemp are: <em>accessibility</em>, <em>clarity</em>, and <em>familiarity</em>.  By accessible I mean that there is a large community of Python programmers, but also there are several tutorials and other materials for learning Python should you be motivated.  <a href="http://lib5.leeds.ac.uk/rlists/broker/index.php?bbModuleId=200910_22366_COMP1550&#038;bbListId=_594307_1&#038;bbLastListId=_527086_1&#038;arc=host_vlebb&#038;hld=0&#038;sess=200910&#038;typ=PUB">Python is used to teach undergraduates programming</a>.  Python is relatively clear; it&#8217;s deliberately designed to be free of the clutter that imperils other programming languages.  It&#8217;s certainly possible for people who are not professional programmers to create small programs in Python, and examine and modify existing Python programs.  And lastly, it&#8217;s familiar; Nick Barnes and I already knew Python when we started the project.  This seems like a trivial consideration, but in fact Clear Climate Code is an unpaid project and it&#8217;s pretty easy to come up with reasons to do something else instead, so the fact that we already knew Python was important.</p>

<p>Hopefully Clear Climate Code illustrates how both code <em>and</em> data are central to the public understanding of science.  For an issue like global warming it is absolutely crucial that public are involved.  <a href="http://www.ckan.net/group/climatedata">CKAN&#8217;s climate data group</a> is a place where non-specialists can access scientist&#8217;s data more easily, and hopefully use it to innovate, do their own hobby science, or create visualisations to better communicate with the public.  I&#8217;m hoping to add more data sources to the climate data group in the near future, if you&#8217;re interested in adding more data to this group, please <a href="http://okfn.org/contact">get in touch</a>.</p>
<div class="tweetmeme_button" style="float: right; margin-left: 10px;">
			<a href="http://api.tweetmeme.com/share?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F01%2F28%2Fclear-climate-code-and-data%2F">
				<img src="http://api.tweetmeme.com/imagebutton.gif?url=http%3A%2F%2Fblog.okfn.org%2F2010%2F01%2F28%2Fclear-climate-code-and-data%2F&amp;style=normal&amp;service=bit.ly&amp;service_api=okfn%3AR_fe0b7ee16cbd10dd919dea5f4ffd24a3" height="61" width="50" />
			</a>
		</div><p class="akst_link"><a href="http://blog.okfn.org/?p=1784&amp;akst_action=share-this"  title="E-mail this, post to del.icio.us, etc." id="akst_link_1784" class="akst_share_link" rel="nofollow">Share This</a>
</p>

<p>Related posts:<ol><li><a href='http://blog.okfn.org/2008/09/17/clearer-climate-code/' rel='bookmark' title='Permanent Link: Clearer Climate Code'>Clearer Climate Code</a></li><li><a href='http://blog.okfn.org/2010/02/24/a-tour-of-climate-data-at-ckan/' rel='bookmark' title='Permanent Link: A tour of climate data at CKAN'>A tour of climate data at CKAN</a></li><li><a href='http://blog.okfn.org/2009/12/05/climate-change-climate-sceptics-and-open-data/' rel='bookmark' title='Permanent Link: Climate Change, Climate Sceptics and Open Data'>Climate Change, Climate Sceptics and Open Data</a></li></ol></p>]]></content:encoded>
			<wfw:commentRss>http://blog.okfn.org/2010/01/28/clear-climate-code-and-data/feed/</wfw:commentRss>
		</item>
	</channel>
</rss>
