Support Us

You are browsing the archive for Open Definition.

Protecting the foundations of Open Knowledge

February 13, 2013 in OKF, Open Definition, Open Knowledge Definition, Open Standards

The foundations of the Foundation

The Open Knowledge Definition (OKD) was one of the Open Knowledge Foundation’s very first projects: drafted in 2005, 1.0 in 2006. By stipulating what Open means, the OKD has been foundational to the OKF’s work, as illustrated by this several-years-old diagram of the Open Knowledge “stack”.

Knowing your foundations seems a must in any field, but even more so in an explosively growing and cross-disciplinary one. The OKD has kept the OKF itself on-track, as it has started and facilitated dozens of projects over the last years.

Burgeoning movements for open access, culture, data, education, government, and more have also benefited from a shared understanding of Open in face of “openwashing” on one hand, and lack of understanding on another. In either case, when works and projects claimed or intended as Open are actually closed, society loses: closed doesn’t create an interoperable commons.

A selection of OKF blog posts from the past few years illustrates how the OKD plays a low-profile but essential role in setting the standard for Open in a variety of fields:

Recent developments

In 2008 an Advisory Council was inaugurated to steward the OKD and related defintions. I joined the council later in 2008, and recently agreed to serve as its chair for a year.

Since then we’ve discussed and provided feedback on intended-open licenses, in particular an Open Government License Canada proposal, iterated on an ongoing discussion about refinements needed in the next version of the OKD, and made our processes for approving licenses – as well as new council members – slightly more rigorous.

We’ve also taken the crucial step of adding new council members with deep expertise in Public Sector Information/Open Government Data, where we expect much of the “action” in Open and intended-open licenses in the next years to be. I’m very happy to welcome:

  • Baden Appleyard, National Programme Director at AusGOAL
  • Tariq Khokhar, Open Data Evangelist at the World Bank
  • Herb Lainchbury, Citizen, Developer and Founder of OpenDataBC.ca
  • Federico Morando, Managing Director at the Nexa Center
  • Andrew Stott, Former Director for Transparency and Digital Engagement and Co-Chair of the Open Government Data Working Group at the Open Knowledge Foundation.

While many of them will be well known to many of our readers, you may find their brief bios and websites on the Advisory Council page.

It is also time to thank three former council members for their service in years past:

  • Paul Jacobson
  • Rob Styles
  • John Wilbanks

Open movements will continue to grow rapidly (unless we fail miserably). You can help ensure we succeed splendidly! We could always use more help reviewing and providing feedback on licenses, but there are also roles for designers, programmers, translators, writers, and people committed to sound open strategy. See a recent get involved update for more.

Most of all, make sure your open access / culture / education / government / science project is truly open — OpenDefinition.org is a good place for you and your colleagues to start!

Making a Real Commons: Creative Commons should Drop the Non-Commercial and No-Derivatives Licenses

October 4, 2012 in Featured, Free Culture, Open Content, Open Data, Open Definition, Open Standards, Open/Closed, WG Open Licensing

Students for Free Culture recently published two excellent pieces about why Creative Commons should drop their Non-Commercial and No-Derivatives license variants:

As the first post says:

Over the past several years, Creative Commons has increasingly recommended free culture licenses over non-free ones. Now that the drafting process for version 4.0 of their license set is in full gear, this is a “a once-in-a-decade-or-more opportunity” to deprecate the proprietary NonCommercial and NoDerivatives clauses. This is the best chance we have to dramatically shift the direction of Creative Commons to be fully aligned with the definition of free cultural works by preventing the inheritance of these proprietary clauses in CC 4.0′s final release.

After reiterating some of the most common criticisms and objections against the NC and ND restrictions (if you are not familiar with these then they are worth reading up on), the post continues:

Most importantly, though, is that both clauses do not actually contribute to a shared commons. They oppose it.

This is a crucial point and one that I and others at the Open Knowledge Foundation have made time and time again. Simply: the Creative Commons licenses do not make a commons.

As I wrote on my personal blog last year:

Ironically, despite its name, Creative Commons, or more precisely its licenses, do not produce a commons. The CC licenses are not mutually compatible, for example, material with a CC Attribution-Sharealike (by-sa) license cannot be intermixed with material licensed with any of the CC NonCommercial licenses (e.g. Attribution-NonCommercial, Attribution-Sharealike-Noncommercial).

Given that a) the majority of CC licenses in use are ‘non-commercial’ b) there is also large usage of ShareAlike (e.g. Wikipedia), this is an issue affects a large set of ‘Creative Commons’ material.

Unfortunately, the presence of the word ‘Commons’ in CC’s name and the prominence of ‘remix’ in the advocacy around CC tends to make people think, falsely, that all CC licenses as in some way similar or substitutable.

The NC and ND licenses prevent CC licensed works forming a unified open digital commons that everyone is free to use, reuse and redistribute.

Perhaps if Creative Commons were instead called ‘Creative Choice’ and it were clearer that only a subset of the licenses (namely CC0, CC-BY and CC-BY-SA) contribute to the development of a genuine, unified, interoperable commons then this would not be so problematic. But the the fact that CC appears to promote such a commons (which in fact it does not) ultimately has a detrimental effect on the growth and development of the open digital commons.

As the Free Culture blog puts it:

Creative Commons could have moved towards being a highly-flexible modular licensing platform that enabled rightsholders to fine-tune the exact rights they wished to grant on their works, but there’s a reason that didn’t happen. We would be left with a plethora of incompatible puddles of culture. Copyright already gives rightsholdors all of the power. Creative Commons tries to offer a few simple options not merely to make the lives of rightsholders easier, but to do so towards the ends of creating a commons.

Whilst Free Culture are focused on “content” the situation is, if anything, more serious for data where combination and reuse is central and therefore interoperability (and the resulting open commons) are especially important.

We therefore believe this is the time for Creative Commons to either retire the NC and ND license variants, or spin them off into a separate entity which does not purport to promote or advance a digital commons (e.g. ‘Creative Choice’).

Please consider joining us and Students for a Free Culture in the call to Creative Commons to make the necessary changes:

Announcing the Open Definition Licenses Service

February 16, 2012 in Open Content, Open Data, Open Definition, Open Knowledge Definition, Open Standards, Our Work, WG Open Licensing

We’re pleased to announce a simple new service from the Open Knowledge Foundation as part of the Open Definition Project: the (Open) Licenses Service.

open licensing

The service is ultra simple in purpose and function. It provides:

  • Information on licenses for open data, open content, and open-source software in machine readable form (JSON)
  • A simple web API that allows you retrieve this information over the web — including using javascript in a browser via JSONP

In addition to the service there’s also:

What’s Included

There’s data on more than 100 open (and a few closed) licenses including all OSI-approved open source licenses and all Open Definition conformant open data and content licenses. Also included are a few closed licenses as well as ‘generics’ — licensed representing a category (useful where a user does not know the exact license but knows, for example, that the material only requires attribution).

View all the licenses available »

In addition various generic groups are provided that are useful when constructing license choice lists, including non-commercial options, generic Public Domain and more. Pre-packaged groups include:

The source for all this material is a git licenses repo on github. Not only does it provide another way to get the data, but also means that if you spot an error, or have a suggestion for an improvement, you can file an issue on the Github repo or fork, patch and submit a pull request.

Why this Service?

The first reason is the most obvious: having a place to record license data in a machine readable way, especially for open licenses (i.e. for content and data those conforming to the Open Defnition and for Software the Open Source Definition).

The second reason is to make it easier for other people to include license info into their own apps and services. Literally daily, new sites and services are being created that allow users to share or create content and data. But when they do that, if there’s any intention for that data to get used and reused by others it’s essential that the material get licensed — and preferably, openly licensed.

By providing license data in a simple machine-usable, web friendly format we hope to make it easier for people to integrate license choosers — and good license defaults — into their sites. This will provide not only greater clarify, but also, more open content and data — remember, no license usually means defaulting to the most restrictive, all rights reserved, condition.

Open Knowledge Definition translated into Telugu (తెలుగు)

November 29, 2011 in Open Definition, Open Knowledge Definition

The following post is by Theodora Middleton, the OKF blog editor.

We are pleased to announce that the Open Knowledge Definition has now been translated into Telugu (తెలుగు), thanks to the hard work of Sridhar Gutam. You can find this at:

The definition has now been translated into 27 languages. If you’d like to translate the Definition into another language, or if you’ve already done so, please get in touch on our discuss list, or on info at okfn dot org.

We need international open government data principles

July 8, 2011 in OGDCamp, Open Data, Open Definition, Open Government Data, Open/Closed, Policy, WG EU Open Data, WG Open Government Data, Working Groups

The following post is from Jonathan Gray, Community Coordinator at the Open Knowledge Foundation.

We need a set of international open government data principles.

Why? Because as the ‘open data‘ meme travels around the world – unlocking information from local, regional and national public bodies as it goes – we want to make sure that we’re all talking about the same thing. We want to make sure we can easily tell data which is open from data which is not open.

Why do we need to make sure that we agree on what ‘open data’ means? Because we want to minimise friction in the data ecosystem. We want an open data ecosystem without borders, barriers, restrictions, exceptions, checklists, registration forms, clickwrap agreements or micro-payments. We want data users – whether they are developers or designers, scholars or journalists – to be able to say:

Aha! This is open data. That means that I know what I can do with it: anything I like!


So far we’ve been very lucky. The vast majority of initiatives around the world calling themselves ‘open data’ initiatives are, indeed, pretty open. There have been a few initiatives which have launched with restrictive, ambiguous or absent legal terms – but this has often been pointed out, and often been rectified.

But why rely on luck? If we really want open government data to scale, to grow from a powerful idea to a powerful, global reality, then perhaps it is time we started to rely on simple, explicit, well-defined principles instead of serendipity to ensure that everyone reaches the same conclusions about what open data is, and what the defining characteristics of an open data initiative are.

Instead of public bodies deciding to do an open data initiative, then debating and wrangling over the legal details, we should try to build a stronger sense that if you want to do an open data initiative, then this entails you make your data open in such and such a way, according to principles which every major open data initiative around the world has signed on to.

These principles would be much less about drafting original content, and much more about building stronger consensus between existing open data initiatives around the world. There is already a lot of implicit agreement about what open data is, as well as lots of good material to build on.

For example, the UK Government’s Public Data Principles say:

Public data will be published in reusable, machine-readable form [...]

Public data will be released under the same open licence which enables free reuse, including commercial reuse – all data should be under the same easy to understand licence.


The US’s Open Government Directive says:

To the extent practicable and subject to valid restrictions, agencies should publish information online in an open format that can be retrieved, downloaded, indexed, and searched by commonly used web search applications. An open format is one that is platform independent, machine readable, and made available to the public without restrictions that would impede the re-use of that information.


New Zealand’s NZGOAL Framework says:

[...] State Services agencies should make their copyright works which are or may be of interest or use to people available for re-use on the most open of licensing terms available within NZGOAL (the Open Licensing Principle).


Not to mention things such as the 8 Principles of Government led by civic society developers and NGOs in the US, and the Sunlight Foundation’s 10 Open Data Principles.

The basic message is clear: maximise reusability, minimise restrictions. This message is also unpacked on OpenDefinition.org:

A piece of content or data is open if anyone is free to use, reuse, and redistribute it — subject only, at most, to the requirement to attribute and share-alike.


A set of international open government data principles would enshrine some of these ideas into a few clear and simple sentences saying what open data is – and have some mechanism for public bodies around the world to sign on. The key thing is that they would be drafted and adopted by leading open data initiatives around the world – who would also help to encourage others to adopt them.

This need not be an arduous or burdensome process. Signing principles that say open government data is X, Y, and Z, is very different to committing to implement them across government. E.g. most governments will have some open data and some non-open data, some data which is free for everyone to reuse, and some data which isn’t. The main thing is that when governments around the world talk about open data, they agree about what this means, and what it entails.

If you’re interested in this topic you may like to join the open-government mailing list. There will also be further discussion and activity around a set of international open government data principles at Open Government Data Camp 2011 in Warsaw on 20-21st October.

Bulgarian translation of the Open Knowledge Definition (OKD)

June 21, 2011 in OKF Projects, Open Definition, Open Knowledge Definition, WG Open Licensing

The following post is from Jonathan Gray, Community Coordinator at the Open Knowledge Foundation.

We are pleased to now have a Bulgarian translation of the Open Knowledge Definition thanks to Peio Popov. You can find this at:

If you’d like to translate the Definition into another language, or if you’ve already done so, please get in touch on our discuss list, or on info at okfn dot org.

data.gouv.fr to promote free public data

May 31, 2011 in External, Open Data, Open Definition, Open Government Data, Open/Closed, WG EU Open Data, Working Groups

The following guest post is from Regards Citoyens, a French organisation that promotes open data.

Three months ago, the French Prime Minister announced officially the creation of the EtaLab governmental team, dedicated to the future data.gouv.fr. On Friday May 27th, two official texts have been published: a decree (fr) that defines new juridic rules regarding data licencing, and a circular (fr) intended to all administrative services to precise the processes of data release on the future platform.

The good news is that the lobbyists in favor of paying data lost this battle: free of charge data will be the default rule for new datasets as well as for currently paying data which wouldn’t be registered as such by July 2012. The decision to keep fees on some datasets will have to be discussed by an official commission — unfortunately its board offers a disproportionate representation of private data sellers — duly motivated and officialised by an individual official decree from the Prime Minister. Every paying dataset will further have to be listed on an official repository of paying data. It is sadly still unclear whether this repository will be part of data.gouv.fr. Will real open data be intermixed with paying data on the platform?

Every other data should be released freely under the conditions of a new licence that will be drafted and discussed in the next few months and released by August 24th. The circular states that this licence should follow the free and open principles, even though its designers never took part in the writing of such a licence in the past. We will actively follow this process in order to push for a total compatibility with open data principles such as the Open Knowledge Definition.

Much worse news would be the exclusion from these rules of public administrations with commercial objectives (EPIC) and public service delegations to private businesses, such as transport companies. The circular also offers the possibility for administrations to write their own licences in order to meet some specific, unspecified needs: this might lead to an increasing number of slightly incompatible licences, with the risk of having some datasets undermined by non-commercial clauses.

At last, the circular mentions file formats. It is heartwarming to read the recommandation to use a list of various open formats such as CSV, XML, KML or ODS. It is unfortunate though to read ODS mistakingly listed as a text document format, or to see also the discriminatory format XLS proposed for spreadsheet documents.

To conclude, these official texts are a real progress which strongly encourages French administrations to open their data. But some serious risks remain, and Open Data shall not be restricted to free of charge datasets: open formats and free licences have to be the core of EtaLab’s coming work.

What do you think about Norway’s new open data license?

April 12, 2011 in Legal, OKF Projects, Open Data, Open Data Commons, Open Definition, Open Government Data, WG EU Open Data, WG Open Government Data, Working Groups

The following guest post is from Sverre Andreas Lunde-Danbolt who works for the Department for ICT and renewal in the Norwegian Ministry of Government Administration, Reform and Church Affairs, and who is a member of the OKF’s Working Group on Open Government Data

The Norwegian Ministry of Government Administration and Reform have just sent a draft version of a new Norwegian Licence for Open Data (NLOD) on a formal hearing here in Norway (the hearing documents (in Norwegian), and a blog post about the licence and the hearing (also in Norwegian)). After the hearing, we intend to recommend all government agencies in Norway to use this licence when they publish data.

Government agencies publishing data are not always very good at specifying the terms under which the information can be reused. In Norway, at least, the introduction of a new sui generis licence for each new data set has become a predictable exercise. This is confusing to the reuser, adding an uneccessary layer of uncertainty, and, in some cases, even impeding legitimate reuse.

The Ministry has therefore decided to establish one common licence. This will reduce the number of open data licences in Norway (one licence to rule them all). The licence is a rather straightforward attribution licence under Norwegian law. Its main purpose is to enable reuse in Norway, but to make sure data under NLOD can be combined with other data as well as reused internationally, the licence states clearly that it is compatible with Open Government Licence (v1.0), Creative Commons Attribution Licence (generic v1.0, v2.0, v2.5 and unported v3.0), and Open Data Commons Attribution Licence (v 1.0).

The most important details in the licence are the following:

  • Personal data is not covered by the licence. This is the same as in Open Government Licence.
  • The reuser cannot distort the information or use the information to mislead. The NLOD definition of this seems to be less restrictive than the definition used in Open Government Licence.
  • NLOD specifies that the licencor can provide more information on the quality or delivery of data, but that this kind of information is outside the scope of the licence. NLOD only covers the rights to use the information.
  • Information licenced under NLOD will also be licenced under future versions of the licence, provided that the licensor has not explicitly licenced information under v1.0. This gives the Norwegian Government more sway over public sector information, and reduces the chances of data ending up as a kind of orphan works in the future.

What do you think?

Avatar of admin

by admin

Open Biblio Principles Announced

January 24, 2011 in Bibliographic, News, Open Data, Open Definition, WG Open Bibliographic Data

The following post is by Mark McGillivrary, a member of the Open Knowledge Foundation Working Group on Open Bibliographic Data.

Last week the Open Biblio Principles were launched by the Open Knowledge Foundation’s Working Group on Open Bibliographic Data. The principles are the product of six months of development and discussion within the working group and the wider bibliographic community:

Producers of bibliographic data such as libraries, publishers, universities, scholars or social reference management communities have an important role in supporting the advance of humanity’s knowledge. For society to reap the full benefits from bibliographic endeavours, it is imperative that bibliographic data be made open — that is available for anyone to use and re-use freely for any purpose.

As this makes clear, the principles have a simple message: make bibliographic data open data as defined by the http://opendefinition.org/. Specifically, there are 4 core principles:

  1. When publishing bibliographic data make an explicit and robust license statement.
  2. Use a recognized waiver or license that is appropriate for data.
  3. If you want your data to be effectively used and added to by others it should be open as defined by the Open Definition (http://opendefinition.org/) – in particular non-commercial and other restrictive clauses should not be used.
  4. Where possible, explicitly place bibliographic data in the Public Domain via PDDL or CC0.

You can read the full version of the principles at: http://openbiblio.net/principles

And, perhaps even more importantly, you can endorse them: http://openbiblio.net/principles/endorse

Please help us spread the word, and the links, to individuals and organisations across the academic, library and publisher community.

Lastly, we are also working on alternative language versions so if you are interested in doing a translation please leave a comment or email mark [dot] macgillivrary [at] okfn [dot] org.

What “open data” means – and what it doesn’t

December 10, 2010 in External, OKF, OKF Projects, Open Data, Open Definition, Open Standards, Open/Closed, WG Open Licensing

The following post is from Melanie Chernoff, Public Policy Manager for Red Hat. It was originally published on opensource.com.

Last week, an article in the Wall Street Journal talked about the Open Data Partnership, which “will allow consumers to edit the interests, demographics and other profile information collected about them. It also will allow people to choose to not be tracked at all.” The article goes on to discuss data mining and privacy issues, which are hot topics in today’s digital world, where we all wonder just how much of our personal data is out there and how it’s being used. These are valid concerns being talked about in other, more appropriate fora. I, however, would like to address my personal pet peeve about the dilution of the term open data.

The Open Knowledge Definition says it this way, “A piece of content or data is open if you are free to use, reuse, and redistribute it — subject only, at most, to the requirement to attribute and share-alike.” Generally, this means that the data should be released in a format that is free of royalties and other IP restrictions. The problem is that an increasing number of people are using the term open data to mean publicly available data.

In the article, the CEO of the startup directing the Open Data Initiative says the goal is to “be more transparent and give consumers more control” of the data that is collected and shared. Providing a mechanism in which consumers can decide what information can be made available to advertisers is a laudable goal. However, this “open data” initiative focuses on what data is made available, when open data is really about how data is made available. This definitional shift is a problem, particularly for governments that are implementing data policies.

Simply put, all open data is publicly available. But not all publicly available data is open.

Open data does not mean that a government or other entity releases all of its data to the public. It would be unconscionable for the government to give out all of your private, personal data to anyone who asks for it. Rather, open data means that whatever data is released is done so in a specific way to allow the public to access it without having to pay fees or be unfairly restricted in its use.

In a previous article, I wrote about how the Massachusetts Bay Transit Authority (MBTA) opened up their transit data to software developers. Within 2 months, six new trip planning applications for bus and train riders had been built at no cost to the MBTA. That’s the power of open data. It was data produced by the government which was released to the public in an open format (GTFS) for free, under a license that allowed for use and redistribution.

Why does this matter? If open data is misunderstood as releasing any and all data to the public, people will become opposed to the concept due to their concerns about privacy. What we, as policy advocates, want to encourage is that the data that governments do and should publish is done so in a way to ensure equal public access by all citizens. In other words, you shouldn’t have to buy a particular vendor’s product in order to be able to open, use, or repurpose the data. You, as a taxpayer, have already paid for the collection of the data. You shouldn’t have to pay an additional fee to open it.

We’ve all seen, from the recent news about Wikileaks, that there are real privacy and/or security concerns with putting all the government’s data out there, but that is a separate issue and shouldn’t be confused with open data. Whether data should be made publicly available is where privacy concerns come into play. Once it has been determined that government data should be made public, then it should be done so in an open format.

Am I being nitpicky about the term? Maybe. But we’ve seen from other tech policy battles that good definitions are crucial to framing the debate.

Please create an account to get started.

Sign up to the Open Knowledge Newsletter

Get Updates