Francis Irving

CEO of ScraperWiki. Made several of the world's first civic websites, such as TheyWorkForYou and WhatDoTheyKnow.

More Reading

Post navigation

4 Comments

  • This is a really great and helpful post!

    In order to provide one more example of an open data curation
    project (@disqus_tgUmjrZRJZ:disqus ) I will mention the ENGAGE project http://www.engagedata.eu/.

    The features of ENGAGE position it as a centralized and collaborative PSI e-Infrastructure providing the necessary tools for dataset processing and acquisition and differentiate it from a simple repository of open public datasets. It will be an intelligent social and collaborative space for researchers, data journalists, citizens and other potential end users, who rely on open public data for professional or personal (re-)use. – See more at: http://www.engage-project.eu/engage/wp/?p=904#sthash.gGDSph30.dpuf

  • Dear Francis,

    I slightly disagree with thearguments you pose under the “Ones that haven’t worked” part.

    For instance, (I am a big fan of collaborative curation of data and actually working on such a project), however as you do state, there do exist problems there, such as the verification of the end result and as a consequence the trust of people in these data. However, the same problems do exist also in point 1) where you also mention a community of users. What is so different in such communities and why are their data trusted? Or is this the same case as with collaborative curation? To my point of view, both ideas are derive from the same basis (the fundamental thoughts of communities of users to power up interesting solutions) and thus they share the same benefits and risks. And there is an evident lack of features that could upscale these cases, such as community rating mechanisms, license checking functions, etc…

    Also as practise has shown, keeping the community out of the loop and not allowing people to co-create on existing things is something that will not work on the long term. Paraphrasing a famous man “… don’t think what you can do with your data, but let others play with them and produce things you have not even imagined …” 🙂

    So the questions is not in my opinion is not if something is not working, but WHY it is not working and WHAT can be done in oder to revert the situation.

    • Great questions Sotiris!

      Yes, the challenge in 5) is, how can we build such a collaborative culture?You’re also right that 1) is very strongly related.

      One difference is about clarity of value of output. For data, it is much clearer to show value in a vertical. And you do whatever it takes – OSM for example both enters its own totally new data, and also ingests (“collaboratively curates”) other sources which are suitably licensed.

      The privacy part is about who has money/energy at the same time as need. Right now, there’s few people with both, and those that do aren’t bought into open data – they are still thinking of data silos.

      Hope to have more conversations about this at OKFNCon!

  • Great post. The good folks at http://datakind.org/ are also looking at helping organisations with their capacity to develop open/closed/better data upstream. A part of “creating better data” surely has to include developing better interoperability and initiatives like http://popoloproject.com/ should help in addition to moving data sets up the http://5stardata.info/ scale. https://www.theengineroom.org/ is also looking at responsible use of data, interoperability as well as lowering the barrier to entry for small local non-profits to publish their data sets. This is one of the topics we’ll address at Okcon 2013 in our workshop “Interoperability Standards for Public Good Data”.

    I’m particularly interested in hearing whether there are good models and technology platforms that are emerging around open data curation? I think Freebase is one of them but proprietary. Using git for data is another more technology driven approach and so is work around linked data provenance. Anything else going on there that’s moving things along to allow open and closed data sets, crowd contributions, public and private curation to co-exist?

Leave a Reply

Your email address will not be published. Required fields are marked *

back to top