Francis Irving

CEO of ScraperWiki. Made several of the world's first civic websites, such as TheyWorkForYou and WhatDoTheyKnow.

More Reading

Post navigation

25 Comments

  • The discussion about from cms to dms: c is for content, d is for data raises some really valid points. This perspective is refreshing.

    donking

  • Huge thanks for sharing this with all of us! Your post is packed with useful tips and thoughtful ideas that I’ll definitely apply in my daily life. It’s rare to come across such sincere and well-organized content, and I’m really grateful for your generosity.

  • Fascinating overview of DMS evolution! As we also review different SaaS tools at TheSoftReview, we found that understanding these systems is crucial for selecting the right data management software.

  • Thought-provoking perspective on how data management evolved from traditional CMS models. As we also review different SaaS tools at TheSoftReview, we’ve seen many modern platforms still building toward the DMS vision outlined here.

  • Martin, yeah! WordPress is like the commodity, lowest-common-denominator CMS. Just as in the last couple of decades, Excel has been the go-to for data analysis. In both cases, there are better higher end products!

  • Rasmus – intriguing! Have never thought about Drupal. Does it handle multiple kinds of data sets for its end users? I don’t think we should call everything a DMS just because it has data in it! Reckon dissecting Drupal as a DMS or not would take a proper conversation though not just blog comments…

    Michael – thanks! Hmmm, that’s a shame regarding DMS. I find myself calling them “data hubs” more often, so perhaps that is what we should go for. What do you think?

  • Franics, Rufus,

    Congrats, great post, thanks for establishing the necessary terms here. We really need more discussion and awareness in this area (data management systems). Minor nit: the abbreviation DMS, I think, is somehow attached to Document Management Systems, which, at least in my experience is a rather suboptimal thing to do or further encourage. Can we come up with something better?

    FYI: I’ve linked your post from my recent presentation [1] at the Irish Local Government Management Agency (LGMA) Open Source Forum in Dublin, where I explain the transition from Doc MS over Content MS to Data MS.

    Again, thanks and KUTGW!

    Cheers,
    Michael

    [1] http://bit.ly/consumer-pull-through-open-data

  • This article completely failed to mention Drupal – a DMS that has been around for a long time. In fact, Drupal was never much of a CMS, if you ask me.

    I don’t personally like Drupal, for a number of reasons – I won’t get into those here. But I wonder if part of the reason it doesn’t meet my expectations for neither a CMS or a DMS, is the fact that they think they’re building a CMS – when clearly it’s much closer to a DMS in terms of features.

    In fact, I think those who pick Drupal are those who actually need a DMS more than a CMS. I wonder how many people tried out Drupal as a CMS and were confused and disappointed? Perhaps it’s time they change their description from CMS to DMS.

  • Glenn, some data hubs have transparent loading, yes! But they don’t have to, and when they do it is of its nature limiting. I think actually if something totally automatically gets all its data (like say an RSS reader), then all its datasets are “of one kind”. And that means it isn’t really a data hub, just an app and a database. Hmmm, so maybe my definition of data hub is somewhere that has “a list of datasets, of an indefinite number of kinds (i.e. schemas)”.

    Pieter, I’d never heard of DataTank before. Their website (thedatatank.com) that appsforghent.be links to seems to be down. Is it the same as this DataTank? http://datatank.co.uk/ Anyway, yes Apps for Ghent looks like a Data Hub, in the publishing Government information vertical! Like Socrata and CKAN.

    Kerstin, that sounds to me like ERP running on databases… But now I think about it, perhaps ERP is a data hub vertical, in theory, if it was across the web and more transparent.

  • Finally, I have a better idea of what ckan or the datatank are trying to do. This article was an eye opener for me. That said, with my background, your argumentation is a bit weakened by the fact that you use wordpress as the example of what a dms should try to be, while it is a quite bad cms( but a decent blog platform). Just a minor nitpick.

  • Fariz, it’s always easy when making an innovative product to claim that you have no competition. This isn’t true – people always do something at the moment, get by in some way. The world wouldn’t end without your new product. That current solution – getting by somehow else without is the competion.

    In the case of data hubs, people’s operating systems are the incumbent competition. We use a combination of tools, such as their filesystem, email clients, and applications like Excel, SPSS, Matlab etc.

  • Nice blog post!

    An old auntie, like me, can tell you about some of the “DMS” we had long before 1994 — Systems to manage for example payroll data and accounting data with systems running on so called mainframes. And You may have an old onkel who can tell you about how they for example managed manufacturing data and spare part data with with systems on so called minicomputers. We didn’t call them”DMS” but “ADB system” (Automatisk DataBehandling in Swedish).

    Many things were different from now: no global scale (often local homegrown systems), no crowd (often just a bunch of terminal- users) and no sql (but a lot of Get-Hold-Unique-within-Parent calls in hierarchical database management systems).

    While other things have become even more important given global scale, crowdsourcing, and the recognision of that “anyone can say anything about any topic” on a web of data — That is things such as data integrity, data traceability, and the need for data context and also to cope with the variances in the reality represented in the data.

  • I still don’t quite understand. Why do Windows and OSX serve as a DMS? May I have a brief explanation?

  • DataCouch has great potential for data sharing/crowd-sourced cleanup. Since it uses CouchDB, which automtically versions every change, it provides a method for data set forking & merging. It also lets users create visualization/etc. apps on top of datasets.

Leave a Reply

Your email address will not be published. Required fields are marked *

back to top