The JISC funded OpenBib project, of which OKF is a partner, announced last week in collaboration with the British Library the release of 3 million open bibliographic records to the community.

This release represents a milestone for open bibliography as it represents the first substantial corpus of bibliographic data to be released in an open form by a national library.

As reported in the announcement post:

We have initially received a dataset consisting of approximately 3 million records, which is now available as a CKAN package. This dataset consists of the entire British National Bibliography, describing new books published in the UK since 1950; this represents about 20% of the total BL catalogue, and we are working to add further releases. In addition, we are developing sample access methods onto the data, which we will post about later this week.

The data has also been loaded into Bibliographica so that it can be searched. For those who like RDF there is a sparql endpoint and there is also an isbn lookup service. More from the announce post:

The data has been loaded into a Virtuoso store that is queriable through the SPARQL Endpoint and the URIs that we have assigned each record use the ORDF software to make them dereferencable, supporting perform content auto-negotiation as well as embedding RDFa in the HTML representation.

The data contains some 3 million individual records and some 173 million triples. Indexing the data was a very CPU intensive process taking approximately three days. Transforming and loading the source data took about five hours.

To get an idea of the shape of the data, let us consider a sample resource, http://bnb.bibliographica.org/entry/GB8102507

Notice: Undefined index: archive in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/html-layout.php on line 42
itemscope itemid="" itemtype="https://schema.org/Person" >

Notice: Undefined index: img in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-avatar.php on line 4

Notice: Undefined index: show_social_web in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-socialmedia.php on line 6

Notice: Undefined index: show_social_mail in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-socialmedia.php on line 7

Notice: Undefined index: show_social_phone in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-socialmedia.php on line 8

Notice: Undefined index: archive in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-name.php on line 37

Notice: Undefined index: name in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-name.php on line 41

Notice: Undefined index: job in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-meta.php on line 10

Notice: Undefined index: job in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-meta.php on line 15

Notice: Undefined index: company in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-meta.php on line 17

Notice: Undefined index: phone in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-meta.php on line 26

Notice: Undefined index: mail in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-meta.php on line 36

Notice: Undefined index: web in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-meta.php on line 46
+ posts

Notice: Undefined index: bio in /data-wordpress/www/blog-okfn/wp-content/plugins/molongui-authorship/views/author-box/parts/html-bio.php on line 8

6 thoughts on “Milestone for Open Bibliographic Data: British Library Release 3 Million Records”

  1. 3 million is huge and I am certain that they were able to make big money because of this. In fact, they were able to release the record in juts a short span of time. I am certain this has created a huge impact.

Comments are closed.