Open Bibliographic Data: The State of Play
Given the public role of libraries and the fact that bibliographic metadata (i.e. the material in library catalogues) doesn’t seem that exciting from a commercial point of view you might think that, of all the types of data out there, it would be bibliographic data that would be the most open. You might even think, given the public-spiritedness of librarians, that this is the kind of area where not only could it be openly available but it would be openly available (in nice little bzip or gzipped dumps …).
In fact the situation is quite the opposite. Most libraries appear to implicitly or explicitly exert rights over their data with some libraries licensing access to their catalogue data for substantial sums of money. The following lists some of the examples (both closed and open) that we know of:
Library of congress: public domain in the US (or at least free) but copyrighted outside the US. See [1] and comments in in fred2.0 readme which state:
These data are works of the United States Government and as such are not subject to copyright within the United States. (17 U.S.C §105).
The Library of Congress has copyrighted these data for use outside the United States. Contact the LC for permission prior to use or distribution of this data outside the United States. [http://www.loc.gov/cds/mds.html, which quotes a price of e.g. $21,905 for the 'Complete Service'.]
- fred2.0 (fred2.0 CKAN package): an excellent example of the effort to make material available but unfortunately has same restrictions as Library of Congress (from which the material is sourced).
- British Library: closed (and apparently gets sold for substantial sums).
- OCLC/Worldcat: closed. See the OCLC CKAN page.
- Barton/Simile: semi-open. Sourced from OCLC. Originally taken down but now back under CC non-commercial. See [1] for further discussion.
- OpenLibrary: in theory open (though no formal license or dump as yet and some material may have been sourced from LoC making it suspect outside of the US)
- isbndb.com: not really fully bibliographic data and status uncertain (see isbndb.com CKAN page)
LibraryThing: closed. Does not seem to make data available and source would likely make this problematic (from the about page):
LibraryThing uses Amazon and libraries that provide open access to their collections with the Z39.50 protocol. The protocol is used by a variety of desktop programs, notably bibliographic software like EndNote. LibraryThing appears to be the first mainstream web use.
As we continue to search for open sources of bibliographic data we’d love to hear from anyone who knows of examples not already on this list.
[1] http://www.bookism.org/open/2007/04/02/open-data-what-would-kilgour-think/




Pingback: LibrarySupportStaff.Org » Open Bibliographic Data : The State of Play
Pingback: Freie Katalogdaten und Erschließungsmittel « Jakoblog — Das Weblog von Jakob Voß
Pingback: Sur le front du libre (09/03/08) « pintiniblog
Pingback: Open Knowledge Foundation Blog » Blog Archive » CERN opens up bibliographic metadata!