Moving forward with Open Science in Europe

Last week I went to the Institut d’Estudis Catalans in Barcelona for the Euroscience Open Forum Satellite Event organised by Science Commons. The event brought together together lawyers, scientists, policy makers, stakeholders and representatives from many different groups and organisations interested in open access and open data in scientific research from across Europe. The event […]

Over 200 Packages on CKAN!

Today the number of packages in the Comprehensive Knowledge Archive Network (CKAN) has passed the 200 mark! CKAN is an open registry of (open) knowledge packages – from genes to geodata, sonnets to statistics. CKAN currently includes basic metadata about each package – including title, URL, download URL, tags, license information and notes. Packages include […]

After the first Open Visualisation Workshop

The first Open Visualisation Workshop took place on Saturday as we mentioned last week. Details, notes and links are available on the event’s wiki page. The event took place at Trampoline Systems’ new site in East London. To make sure the event was as informal as it was billed to be – we left the […]

Open Data Going Mainstream?

Bret Taylor’s recent post entitled “We Need a Wikipedia for Data” has been garnering a lot of attention around the blogosphere. While his suggestions are not particularly novel, the post and the attention it has garnered, is, I think, indicative of the growing interests in the issues of (open) data and its importance for the […]

Public Domain Dedication & License (PDDL) v.1.0 released at OKCon!

Jordan Hatcher, of opencontentlawyer.com and chair of the Advisory Council for the Open Knowledge Definition took the Public Domain Dedication & License out of beta on Saturday at OKCon. The PDDL (which we blogged about in December) was initially sponsored by Talis and is specifically aimed at providing a suitable license for open data — […]

Public Domain Dedication & License (PDDL) v.1.0 released at OKCon!

Jordan Hatcher, of opencontentlawyer.com and chair of the Advisory Council for the Open Knowledge Definition took the Public Domain Dedication & License out of beta on Saturday at OKCon. The PDDL (which we blogged about in December) was initially sponsored by Talis and is specifically aimed at providing a suitable license for open data — […]

Open Bibliographic Data: The State of Play

Given the public role of libraries and the fact that bibliographic metadata (i.e. the material in library catalogues) doesn’t seem that exciting from a commercial point of view you might think that, of all the types of data out there, it would be bibliographic data that would be the most open. You might even think, […]

Creative Commons adopts ‘Free Cultural Works’ seal of approval

Yesterday Creative Commons announced that their Attribution and Attribution Sharealike licenses will feature a seal of approval and link to Freedom Defined – the Definition of Free Cultural Works. We’ve been in touch with Freedom Defined since May 2006 (we blogged about the project last year) as their aims are so similar to that of […]

Open Definition Advisory Council launched

We are pleased to announce the launch of an Advisory Council for opendefinition.org. The Council will be formally responsible for maintaining and developing the Definitions and associated material found on the Open Definition site – including the Open Knowledge Definition and the Open Service Definition. As many of you will know, these definitions aim to […]

When is my bus?

Sometimes you find some data whose lack of freedom is totally mysterious from a commercial point of view. At mySociety, we recently released made some travel time maps which help you work out where you should live that would have the quickest commute to your place work. Interactive flash sliders to balance this delicate equation […]

On Getting Raw Data for Cancer Research

Andrew Vickers, a biostatistician at the Memorial Sloan-Kettering Cancer Center, New York, recently published an article in the New York Times about his experiences trying to get hold of raw data for cancer research: Cancer Data? Sorry, Can’t Have It. In it he describes various difficulties he has encountered trying to get hold of the […]

Meeting on UK Public Sector Information Re-use Request Service

On Saturday I attended a ‘BarCamp’ on the Power of Information Review Recommendation 8 – which suggests there should be a re-use request service for UK Public Sector Information (we blogged about this in October). The event was organised by John Sheridan of the Office of Public Sector Information and was attended by representatives from […]

The IPCC Data Distribution Centre – environmental data licensing

We’ve recently started looking into how much environmental data made available on the web is open in accordance with the Open Knowledge Definition. The Intergovernmental Panel on Climate Change (IPCC) has a Data Distribution Centre (DDC) – which is a good start to see what data is available. The DDC “offers access to baseline and […]

Pleiades: Lots of Ancient Geodata Released!

We’ve written about the pleiades project a couple of times before: Organized by the Ancient World Mapping Center at the University of North Carolina at Chapel Hill, U.S.A., Pleiades brings together a global community of scholars, students and enthusiasts to expand and enhance continually the information originally brought together by the Classical Atlas Project (1988-2000) […]

Open social data progress

My last post here, Google vs. Facebook, was about how our own personal knowledge, that you’ve put into social networks like Facebook or MySpace, should also be open. By this I mean that you should have control of it, and it should be encoded in open formats with open protocols. The last week there’s been […]

British History Online: Why the Restrictions?

British History Online is a site created and run by Institute for Historical Research (part of the University of London I believe) and the History of Parliament Trust and located at: (note the ‘ac.uk’ domain name signifying the official academic status though rather unusually they do run ads). Their purpose is clearly stated on the […]

Cofundos – ‘community innovation and funding’

Recently the Agile Knowledge and Semantic Web research group (AKSW) at Universität Leipzig launched Cofundos.org. Confundos aims to help people share, refine, fund and realise new ideas for open software and knowledge projects. It was founded and developed by Sören Auer, who leads the AKSW research group (and is on the OKF’s advisory board). The […]

Google vs Facebook

Facebook has striken fear not only into the hearts of incumbent dot-com billionaires, but also into the hearts of open data freaks. It’s terrifying me – not least, because I use it. I give the most personal, sensitive information that I have to a private US corporation. I have no way of getting the data […]

WorldMapper: Is Its Data Open?

WorldMapper produces a whole variety of illuminating cartograms to show the distribution of various statistics across the world from royalties to the level of military spending. While looking at the site I immediately started to wonder about the openness both of the maps themselves and the underlying data (to my mind while the maps are […]

AMEE – an exemplary open service

The people behind AMEE, the ‘world’s energy meter’ (which we blogged about back in May), have been busy forging ahead into new areas of open service development. As well as ensuring AMEE conforms to the draft Open Service Definition (in short, open data plus open software) they’ve recently published a Memorandum of Understanding with terms […]

Open Database License

Jordan Hatcher (Open Content Lawyer) and Dr. Charlotte Waelde (University of Edinburgh) have just published the first draft of the Open Data Commons, or the Open Database License. The new license was inspired by the Talis Community License (a draft open license for data from 2006) and its development has been sponsored by Talis. The […]

DBpedia 2.0

DBpedia recently released the new version of their dataset. The project aims to extract structured information from Wikipedia so that this can be queried like a database. On their blog they say: The renewed DBpedia dataset describes 1,950,000 “things”, including at least 80,000 persons, 70,000 places, 35,000 music albums, 12,000 films. It contains 657,000 links […]

Articles in CTWatch Quarterly

As some of you many have seen, Open Knowledge Foundation advisory board members Peter Suber and John Wilbanks recently wrote two interesting articles in CTWatch Quarterly. Peter Suber’s Trends Favoring Open Access is a broad-ranging overview of developments in publishing, research, and technology that look to support Open Access. As well as looking at how […]

The Open Library and Versioned Data

The Internet Archive has recently launched a beta version of The Open Library. A demo can be found here and the Open Library book can be read here. It is inspired by the idea of a “library that makes all the published works of humankind available to everyone in the world”. Initially it will consist […]

The Comprehensive Knowledge Archive Network (CKAN) Launched Today

After a year of (off and on) development we are delighted today to announce the official launch of the Comprehensive Knowledge Archive Network (CKAN for short): http://www.ckan.net/. CKAN is a registry of open knowledge packages and projects — be that a set of Shakespeare’s works, a global population density database, the voting records of MPs, […]

Guide to Open Data Licensing

Over the last month we’ve been working to produce a Guide to Open Data Licensing. As the name should make clear this is a guide to licensing data aimed particularly at those who want to make their data open. The guide is currently located on the wiki so that anyone can edit and update it: […]

Giving us our own information

Yesterday, the UK’s Cabinet Office published an important new review of Public Sector Information. It’s a fat document to read through, but it oozes with ideas and stories (many no doubt from readers of this blog) about the importance of open data. The obvious question is, what will the Government do with it at all […]

AMEE: The Open CO2 Emissions Platform

One of the highpoints of XTech last week was the presentation of Gavin Starks about AMEE (Avoiding Mass Extinction Engine). AMEE is a “a platform for collaboration on Climate Change and Energy Efficiency”. It combines together a whole bunch of CO2 emissions data (including data from the UK government) with modelling code and assumptions to […]

What Do We Mean by Componentization (for Knowledge)?

Background Nearly a year ago I wrote a short essay entitled The Four Principles of (Open) Knowledge Development in which I proposed that the four key features features of a successful (open) knowledge development process were that it was: Incremental Decentralized Collaborative Componentized As I emphasized at the time the most important feature — and […]

PlanningAlerts.com — Opening up UK Planning Application Data

Back at the Civic Info forum in November Richard Pope presented his initial work on scraping planning application data from local council websites. This was a classic case where the original providers of the data did not make it available in an open form that was easy to use and reuse (it was often just […]

Storing and Visualizing Open Data: II

Back in June last year we posted about a demo site we’d put together to experiment with storing and visualizing open data. Recently several new sites have appeared doing web-based visualization, the most prominent of which are Many Eyes from IBM’s visualization research department and swivel. In both cases it appears that the code isn’t […]

Open Knowledge 1.0 Nearly Here

Open Knowledge 1.0, which takes place on Saturday March the 17th at Limehouse Town Hall in London, is now just over a week away. While there are still some places left we are nearing capacity so, if you would like to come, we advise you to register as soon as possible via: http://www.okfn.org/okcon/register/ Open Knowledge […]

8.4 Million Grant to University of Manchester to Expand Semi-Open Data Repository

According to a press release yesterday the University of Manchester received a further 8.4 Million GBP of funding from the ESRC to continue and expand its MIMAS service which provides students and researchers with free access to social science data: The billions of data items managed by the School of Social Sciences and Manchester Information […]

Collaborative Development of Data

$ This version: 2007-02-15 (First version 2006-05-24) $ We already have some fairly good working processes for collaborative development of unstructured text: the two most prominent examples being source code of computer programs and wikis for general purpose content (encyclopedias etc). However these tools perform poorly (or not at all) when we come to structured […]

Open Context Officially Launched

The Open Context project, “a free, open access resource for the electronic publication of primary field research from archaeology and related disciplines”, was officially launched back on January the 29th: In April 2006, we proudly launched the Beta version of Open Context, our new open access publication system that enables researchers to distribute their primary […]

Open Knowledge Web Buttons: Get Them Now

Over the last couple of years we’ve done a lot of work to get a clear and clean definition of what open knowledge is in the form of the Open Knowledge Definition. This provides a core set of principles defining openness independent of any particular set of licenses similar to the way the open source […]

Open Data Discussion on SPARC List

I was recently involved in some interesting discussion with John Wilbanks on the SPARC open-data list and thought it worth excerpting some of this here. Email 1: Reply to a message from John Wilbanks Source: https://mx2.arl.org/Lists/SPARC-OpenData/Message/100.html Hi all, chiming in here…just joined the list. The lack of international consensus on data makes use of CC […]

Storing and Visualizing Open Data

The basic purpose of the Open Knowledge Foundation is to ‘promote open knowledge’. In particular we want to: Get data out there — that’s why we’re developing KnowledgeForge Make sure that data is open data (i.e. is properly licensed) — that’s why we’re developing open knowledge definition Make sure that data can be found — […]