Proof-Editing Shakespeare Entry from Encyclopaedia Britannica 11th Edition

Since the previous post we’ve succeeded in using tesseract and we now have a nice plain text version of the EB entry on shakespeare: http://knowledgeforge.net/shakespeare/svn/trunk/shksprdata/ancillary/britannica-11th.txt What we now need to do is ‘proof’ this to correct the OCR errors. This kind of think is perfect for distributed volunteers so if you’d like to help out […]

Talk at Law 2.0: Openness, Web 2.0 and the Ethic of Sharing

Yesterday I was at the SCL’s “Law 2.0? : New Speech, New Property, New Identity” talking on Openness, Web 2.0 and the Ethic of Sharing. The full text of my talk is inline below, there are companion slides up online (more graphics!) and for those who like source here a link to the markdown original. […]

Towards an Open Service Definition

As mentioned previously on this blog recent developments, particularly the increase in ‘Software as a Service’ approaches, have created the need to think hard about what would constitute an `Free/Open Service’ (as opposed to just plain Free/Open Source software or Free/Open Knowledge). Following extensive discussion in the last couple of months on the okfn-discuss mailing […]

KForge v0.13 Released

After another six months of work we’ve released a new version of the KForge software. KForge/KnowledgeForge is one the OKF’s main activities with the KForge software being used to run the KnowledgeForge service. Over two and half years after work first started the project is now fairly mature with a growing number of projects hosted […]

OCRing Shakespeare Entry from Encyclopaedia Britannica 11th Edition

One of next things we want to do for open shakespeare is provide an open introduction for to his works. The obvious idea for this was to use the Shakespeare entry in the 11th ed of the Encyclopaedia Britannica as detailed in this ticket: http://p.knowledgeforge.net/shakespeare/trac/ticket/24 We’ve now written code to grab the relevant tiffs off […]

The Comprehensive Knowledge Archive Network (CKAN) Launched Today

After a year of (off and on) development we are delighted today to announce the official launch of the Comprehensive Knowledge Archive Network (CKAN for short): http://www.ckan.net/. CKAN is a registry of open knowledge packages and projects — be that a set of Shakespeare’s works, a global population density database, the voting records of MPs, […]

Guide to Open Data Licensing

Over the last month we’ve been working to produce a Guide to Open Data Licensing. As the name should make clear this is a guide to licensing data aimed particularly at those who want to make their data open. The guide is currently located on the wiki so that anyone can edit and update it: […]

Open Textbook Project Launched

Today we are pleased to announce the launch of Open Textbook, a place to list and keep track of news about textbooks that are open in accordance with the Open Knowledge Definition — i.e. free to use, reuse, and redistribute. We welcome participation in the project and if anyone has a textbook or notes they’d […]

Open Knowledge Foundation Annual Report 2006-2007

Today we published our 2006-2007 annual report that details some of what we have been up to over the last year. The following is taken from the introduction to that report. Introduction In May this year the Open Knowledge Foundation celebrated its third birthday. Much has changed in that time and the last year is […]

iCommons 2007: Retrospective Reflections

The iCommons conference in Dubrovnik, where I’ve been for the last few days, finished yesterday. This has been a great event (a big well done to Heather Ford and all the other organizers) and I’ve had the chance to talk to a very large number of interesting people — renewing old acquaintances and making new […]

AMEE: The Open CO2 Emissions Platform

One of the highpoints of XTech last week was the presentation of Gavin Starks about AMEE (Avoiding Mass Extinction Engine). AMEE is a “a platform for collaboration on Climate Change and Energy Efficiency”. It combines together a whole bunch of CO2 emissions data (including data from the UK government) with modelling code and assumptions to […]

XTech 2007

Last week I was at the XTech conf along with Jo Walsh in order to present in the Open Data track. We built on our recent discussion to argue for the fundamental importance of componentization in developing the Open Data/Knowledge ecosystem — you can find the slides of our talk (entitled Open Data and Componentization) […]

What Do We Mean by Componentization (for Knowledge)?

Background Nearly a year ago I wrote a short essay entitled The Four Principles of (Open) Knowledge Development in which I proposed that the four key features features of a successful (open) knowledge development process were that it was: Incremental Decentralized Collaborative Componentized As I emphasized at the time the most important feature — and […]

How to Develop Geodata Domain Models

Jo Walsh (who’s also a member of the Open Knowledge Foundation) has written a great post over on the mappinghacks blog about the development a new data model for OpenStreetMap. Though focused on the issue of modelling geodata the points she raises, particularly in relation to ‘Audit’ functionality (change tracking, versioning etc), are applicable to […]

New Version (v0.4) of Open Economics Released

This is the fourth release of the Open Economics project and the first that has been deemed ‘worthy’ of a full release announcement. The Open Economics project provides data storage and visualization for economics data as well as associated web services and assorted modelling code. The project home page is: http://www.okfn.org/econ/ while the open economics […]

v0.4 of Open Shakespeare Released

A new version of open shakespeare is out. Get it via the code page: http://www.openshakespeare.org/code/ Changelog Annotation of texts (js-based in browser) (ticket:20, ticket:21) (http://www.openshakespeare.org/2007/04/10/annotation-is-working/) Switch to unicode for internal string handling (resolves ticket:23: some texts breaking the viewer) Add functional tests for the web interface (ticket:11) Substantial improvements to speed of concordance (ticket:22) (http://www.openshakespeare.org/2007/01/03/improvements-to-the-concordance/) […]

v0.4 of Open Shakespeare Released

We’ve been doing more work on the Open Shakespeare project with the result that a new version (v0.4) is ready for release (full details including the Changelog can be found below). For those unfamiliar with the project, Open Shakespeare has two basic aims. First, to provide a simple but compelling open knowledge exemplar — and […]

PlanningAlerts.com — Opening up UK Planning Application Data

Back at the Civic Info forum in November Richard Pope presented his initial work on scraping planning application data from local council websites. This was a classic case where the original providers of the data did not make it available in an open form that was easy to use and reuse (it was often just […]

Annotation is Working!

After another push over the last few days I’ve got the web annotation system for Open Shakespeare operational (we’ve been hacking on this on and off since back in December). To see the system in action visit: http://demo.openshakespeare.org/view?name=phoenix_and_the_turtle_gut&format=annotate Quite a bit of effort has been made to decouple the annotation system from Open Shakespeare so […]

Copyright and Research in the Humanities and Social Sciences Conference

I’ll be speaking on the Copyright Users panel at the Copyright and Research in the Humanities and Social Sciences conference which takes place this Friday (30th March 2007) in Edinburgh. The event is being jointly organised by The British Academy and the AHRC Research Centre for Studies in Intellectual Property and Technology Law at the […]

Response to OfCom’s Public Service Publisher Consultation

The Open Knowledge Foundation made a joint response in association with the [Open Rights Group](http://www.openrightsgroup.org and Free Culture UK) to OfCom’s Public Service Publisher consultation (officially titled: A new approach to public service content in the digital media age). The Response The founding of a Public Service Publisher (PSP) is an opportunity to make a […]

Some Statistics

We’ve been busy hacking away and as a result we now have some rough statistics: Composers whose works are out of copyright. Of the 1083 composers listed in the data kindly donated to us by Philip Harper we estimate that, as of January 1st 2007: Out of Copyright: 263 In Copyright: 762 Status Unknown: 58 […]

Is Citizendium Not Open?

Following a link from Peter Suber’s ever-valuable Open Access News I read Larry Sanger’s blog entry We aren’t Wikipedia (Citizendium blog, March 21, 2007) which lists the various ways in which Citizendium differs from Wikipedia. This made interesting reading but my eye was especially caught by these two items: To be confirmed: Our license disallows […]

Open Knowledge 1.0 Has Happened

Open Knowledge 1.0 took place last Saturday at Limehouse Town Hall in London. Over 70 people came to hear the panels and participate in the open space. Material (including speaker presentations) and related links from the event are being posted online at http://www.okfn.org/okcon/after/. We’ve had excellent feedback and in my opinion (though of course I […]

The Nature of Information

“We are moving towards a world in which all information is software and all software is information.”

Storing and Visualizing Open Data: II

Back in June last year we posted about a demo site we’d put together to experiment with storing and visualizing open data. Recently several new sites have appeared doing web-based visualization, the most prominent of which are Many Eyes from IBM’s visualization research department and swivel. In both cases it appears that the code isn’t […]

Open Knowledge 1.0 Nearly Here

Open Knowledge 1.0, which takes place on Saturday March the 17th at Limehouse Town Hall in London, is now just over a week away. While there are still some places left we are nearing capacity so, if you would like to come, we advise you to register as soon as possible via: http://www.okfn.org/okcon/register/ Open Knowledge […]

8.4 Million Grant to University of Manchester to Expand Semi-Open Data Repository

According to a press release yesterday the University of Manchester received a further 8.4 Million GBP of funding from the ESRC to continue and expand its MIMAS service which provides students and researchers with free access to social science data: The billions of data items managed by the School of Social Sciences and Manchester Information […]

Collaborative Development of Data

$ This version: 2007-02-15 (First version 2006-05-24) $ We already have some fairly good working processes for collaborative development of unstructured text: the two most prominent examples being source code of computer programs and wikis for general purpose content (encyclopedias etc). However these tools perform poorly (or not at all) when we come to structured […]

Free Cultural Works Definition v1.0 Released

Having been working on the very similar Open Knowledge Defintion since Summer 2005 (with a v1.0 released in September last year) we were very interested when http://freedomdefined.org/ launched last May. Now after ten months of work they’ve released a stable, 1.0, version of what is now termed the “Free Cultural Works Definition”. Though having a […]

Open Context Officially Launched

The Open Context project, “a free, open access resource for the electronic publication of primary field research from archaeology and related disciplines”, was officially launched back on January the 29th: In April 2006, we proudly launched the Beta version of Open Context, our new open access publication system that enables researchers to distribute their primary […]

Copyright and the Digital Age

I authored the following short essay for publication in a pamphlet produced by the RSA entitled Promoting innovation and rewarding creativity: A balanced intellectual property framework for the digital age. The pamphlet was published at the beginning of January and along with my piece included items by Matthew Taylor (Chief Executive of the RSA), Lynne […]

Open Knowledge Web Buttons: Get Them Now

Over the last couple of years we’ve done a lot of work to get a clear and clean definition of what open knowledge is in the form of the Open Knowledge Definition. This provides a core set of principles defining openness independent of any particular set of licenses similar to the way the open source […]

Porting Marginalia Annotation to Python

Adding annotation support to the texts in Open Shakespeare is the main item for the next 0.4 release. This is a rather large undertaking and the last 2 months has seen substantial work on the first stage in the form of porting Geof Glass’ marginalia into a standalone python package named annotater that can then […]

An Open Search Service: Regulating Search the Open Way

The inspiration for writing this, as well as much of the information contained herein, came from the search Roundtable which took place at the IDEI Toulouse ‘Conference on the Software and Internet Industries’ on January 20th 2007. An earlier version of this essay as well as notes from the Roundtable can be found in this […]

Thinking about Annotation

Annotation means the adding of comments/notes/etc to an underlying resource. For the present I’ll focus on the situation where the underlying resource is textual (as opposed to being an image, or a piece of film or some data). Various things to consider when implementing an annotation/comment system: Addressing and atomisation: Are annotations specific to particular […]

Opening Up Ancient Geodata: The Barrington Atlas II

I’ve written previously about the Barrington Atlas of the Ancient World which took 12 years to produce (1988-2000). It’s a wonderful example of interdisciplinary collaboration using, as it did, the talents of a multitude of classical scholars as well as many cartographers. In that earlier post I pointed out that, unfortunately, none of the underlying […]

Open Knowledge 1.0: Registration Now Open

Registration is now open for Open Knowledge 1.0 which takes place at Limehouse Town Hall in London on March 17th 2007. Space at the venue is limited so we encourage you to register. More information about the event available below. Open Knowledge 1.0 When: Saturday 17th March 2007, 11am until 6:30pm (Doors open at 1030) […]

Open Data Discussion on SPARC List

I was recently involved in some interesting discussion with John Wilbanks on the SPARC open-data list and thought it worth excerpting some of this here. Email 1: Reply to a message from John Wilbanks Source: https://mx2.arl.org/Lists/SPARC-OpenData/Message/100.html Hi all, chiming in here…just joined the list. The lack of international consensus on data makes use of CC […]

KForge 0.12 Released

After nearly six months of work we’ve released a new version of the KForge software. KForge/KnowledgeForge is one the OKF’s main activities with the KForge software being used to run the KnowledgeForge service. Two years after work first started on the project, KnowledgeForge is beginning to live up to the original vision, hosting, among other […]

Improvements to the Concordance

One of the main items scheduled for v0.4 of open shakespeare is improvements to the responsiveness of the concordance. Using the v0.3 codebase, using just the sonnets as test material, loading up the list of words for the concordance alone took around 24s on my laptop. This is because even with a single text there […]

Striking confirmation from Google of the problems with ‘open’ APIs

As of December 5th 2006 Google stopped issuing API keys for their SOAP search API. They appear to want to move people to their ajax service which provides much less freedom for the client to process and manipulate the data (in fact it appears it is very hard to get at the data any more […]

Adding Web-Based Annotation Support

We intend to add annotation/commentarysupport to the open shakespeare web demo either in this release or next. As a first step we’ve been looking to see what (open-source) web-based annotation systems are already out there. Below is our list of what we’ve been able to find so far (if you know of more please post […]

Open Knowledge 1.0: London, Saturday 17th March 2007

~ For redistribution to all relevant blogs, lists, forums and individuals ~ Event home page: http://www.okfn.org/okcon/ Event wiki page: http://okfn.org/wiki/okcon/ What On Saturday the 17th March 2007 the first all-day Open Knowledge event is taking place at Limehouse Town Hall in London. Bringing together individuals and groups from across the open knowledge spectrum it will […]

Open Knowledge Forum on Civic Information (No. 2): Post-Event Information

Both those who attended the Open Knowledge Forum on Civic Information, as well as those who couldn’t, will be happy to know that the presentations of several of the speakers along with various additional material is now up on the forum’s home page: http://www.okfn.org/okforums/civicinfo2/ If you have material relevant to the forum you’d like hosted […]

The Trans-Atlantic Slave Trade Database: Is It Going to be Made Open?

Over a four year period in the mid-1990s a team of scholars centred on the Du Bois institute at Harvard compiled a comprehensive database of transatlantic slave-trading voyages. Over 27,000 individual journeys were recorded for the period 1650-1867 covering more than 2/3 of all voyages that took place. The data includes extensive demographic (and mortality) […]

UK National Statistics: Are They Open or Not?

I’ve used data a couple of times from the UK’s national statistics site: The other day I went there to investigate their licensing as part of an effort to do a simple survey of the openness of various UK government agency’s data. To summarize their copyright statement (full details are in 1): National statistics are […]