Open Data in Archaeology
The following guest post is from Stefano Costa at the University of Siena. He is Founder of the IOSA initiative and Coordinator of a new Open Knowledge Foundation Working Group on Open Data in Archaeology.
Archaeological data is often not shared
According to Wikipedia, archaeology is the “science and humanity that studies historical human cultures through the recovery, documentation, analysis, and interpretation of material culture and environmental data, including architecture, artifacts, biofacts, and landscapes”. Phew – that’s a lot of things packed together! Archaeologists are working all over the world, studying the remote past of lost civilizations at restricted archaeological sites, the garbage bin behind your home and everything in between. And archaeologists do their job using a large variety of techniques, methods, theories.
You’ve probably already guessed where I’m going: archaeological research produces a wealth of information about human cultures, and since the 1970s digital documentation has slowly come into use, drawing from nearby disciplines and encouraging wide methodological debates, too. In 2010, a great deal (albeit not all) of archaeological data is ‘born digital’ in the field, library, or lab. This means literally thousands of databases, millions of pictures of finds, excavation contexts and all other stuff.
In theory, this could bring a lot of potential not only to archaeological research per se, but to archaeological knowledge in general. Digital material can be easily reproduced at no cost. But this potential is often not realised, because the vast majority of archaeological information is not shared. Researchers and research groups usually restrict access to their data to a small group of people. In other words, data sharing is not so widespread among archaeologists as one might wish, and dissemination of research is still mostly based on traditional pre-digital means like journal articles, books and the like.
In addition to opening up access to journal articles for print, new digital technologies allow archaeologists to go beyond traditional ways of disseminating scholarly research. This includes new ways of collaborating and publishing findings in ‘real time’ on the web using blogs, wikis and the like. Also researchers can go beyond PDF files of articles for print, towards machine readable texts and the raw data and other materials underlying research.
Open data in archaeology
To better understand these processes and to encourage others to open up archaeological knowledge, a few weeks ago we started a new Open Knowledge Foundation Working Group on Open Data in Archaeology:
We’ve also started a group on CKAN, the Open Knowledge Foundation’s registry of open data:
If you’d like to get involved with any of this, we encourage you to join our open-archaeology mailing list and introduce yourself!
Why open up archaeological data?
For those who don’t think “why not to share?”, it’s certainly worth exploring some of the possible reasons to share archaeological data.
1. Repeatability, impact factor, and peer review
If you’re on the “hard science” or processual side of the story (e.g. your research involves statistical or spatial analysis, …) you are going to appreciate the repeatability of your process, and formal comparison with other studies. If you’re in academia, and no matter what is theoretical background is, the availability of raw data makes your all of your research more visible and, much like Open Access to literature, it is likely to increase the impact factor of your work. Building knowledge is a complex process that involves putting together
different pieces of information from lots of sources. Also, if your work is based on data collected by others, you will agree that harmonisation of formats will be hard to reach until we have open data.
2. Encouraging unexpected reuse of archaeological data
A radical point of view, and incidentally one that might well stem from current post-processual theories, is that most value brought by open data lies exactly in what others can do with our data that we would never imagine about, thus enabling a true multivocality. “Others” means not only other archaeologists, but includes researchers from akin fields, local communities touched by archaeological research, primary school teachers, Wikipedia editors and Google.
3. Public access to publicly funded research
Not all, but a large part of archaeology is done as public funded research. These days, governments of countries like USA, UK and France are moving towards Open Governmental Data. When archaeological data is just a subset of “public sector information”, the same view can apply and one might eventually say that all non-sensible data (e.g. location of still unearthed archaeological sites) should be made available to the public. This implies to some degree that catalogues of finds from excavated sites or museums are “safe to share” without posing any threat to preservation of heritage. Furthermore, it is well known that preservation works not only by physically avoiding damage to artifacts and landscapes, but also by making them available to the community. Each country might have different law frameworks for the collection and archiving of heritage data, but no one should fail to see that sharing digital representations under an open license just adds value to cultural heritage.