Milestone for Open Bibliographic Data: British Library Release 3 Million Records

November 23, 2010 in Bibliographica, News, WG Open Bibliographic Data

The JISC funded OpenBib project, of which OKF is a partner, announced last week in collaboration with the British Library the release of 3 million open bibliographic records to the community.

This release represents a milestone for open bibliography as it represents the first substantial corpus of bibliographic data to be released in an open form by a national library.

As reported in the announcement post:

We have initially received a dataset consisting of approximately 3 million records, which is now available as a CKAN package. This dataset consists of the entire British National Bibliography, describing new books published in the UK since 1950; this represents about 20% of the total BL catalogue, and we are working to add further releases. In addition, we are developing sample access methods onto the data, which we will post about later this week.

The data has also been loaded into Bibliographica so that it can be searched. For those who like RDF there is a sparql endpoint and there is also an isbn lookup service. More from the announce post:

The data has been loaded into a Virtuoso store that is queriable through the SPARQL Endpoint and the URIs that we have assigned each record use the ORDF software to make them dereferencable, supporting perform content auto-negotiation as well as embedding RDFa in the HTML representation.

The data contains some 3 million individual records and some 173 million triples. Indexing the data was a very CPU intensive process taking approximately three days. Transforming and loading the source data took about five hours.

To get an idea of the shape of the data, let us consider a sample resource, http://bnb.bibliographica.org/entry/GB8102507

Related posts:

  1. Opening up library records at the Open Library The following is a guest blog post from George Oates, Director of the Open Library and member of the Open Knowledge Foundation’s Working Group on Open Bibliographic Data. Open Library is a wiki-editable library catalog, with an open source backend,...
  2. New open bibliographic data from Konstanz and Cambridge! So far it has a great week for open bibliographic data fans! Yesterday Konstanz University Library relicensed their data under CC0, as reported by Adrian Pohl, Coordinator of the OKF’s Working Group on Open Bibliographic Data: Mathias Schindler today tweeted...
  3. New working group on open bibliographic data! In the past few weeks there have been a number of developments related to opening up bibliographic metadata. At the end of January we blogged about CERN opening up their library data. Just recently Ghent University Library have published their...

1 response to Milestone for Open Bibliographic Data: British Library Release 3 Million Records

  1. 3 million is huge and I am certain that they were able to make big money because of this. In fact, they were able to release the record in juts a short span of time. I am certain this has created a huge impact.

Leave a reply

Your email address will not be published. Required fields are marked *


*

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>