Today, we are excited to announce that our work with the US Federal Government (data.gov) has gone live at catalog.data.gov! You can also read the announcement from the data.gov blog with their description of the new catalog.

Catalog.Data.gov

The Open Knowledge Foundation’s Services team, which deploys CKAN, have been working hard on a new unified catalog to replace the numerous previously existing catalogs of data.gov. All geospatial and raw data is federated into a single portal where data from different portals, sources and catalogs is displayed in a beautiful standardized user interface allowing users to search, filter and facet through thousands of datasets.

This is a key part of the U.S. meeting their newly announced Open Data Policy and marks data.gov’s first major step into open source. All the code is available on Github and data.gov plan to make their CKAN / Drupal set-up reusable for others as part of OGPL.

As one of the first major production sites to launch with the shiny new CKAN 2.0, data.gov takes advantage of the much improved information architecture, templating and distributed scalable authorization model. CKAN provides data.gov with a web interface for over 200 publishing organizations to manage their members, harvest sources and datasets – supporting requirements being outlined in Project Open Data. This means that agencies can maintain their data sources individually, schedule regular refreshes of the metadata into the central repository and manage an approval workflow.

There have been many additions to CKAN’s geospatial functionality, most notably a fast and elegant geospatial search:

Geospatial search filter

We have added robust support for harvesting FGDC and ISO 19139 documents from WAFs, single spatial documents, CSW endpoints, ArcGIS portals, Z39:50 sources, ESRI Geoportal Servers as well as other CKAN catalogs. This is available for re-use as part of our harvesting and spatial extensions.

Most importantly, this is a big move towards greater accessibility and engagement with re-users. Not only is metadata displayed through a browsable web interface (instead of XML files), there is now a comprehensive CKAN API with access to all web functionality including search queries and downloads which respects user and publisher permission settings. Users can preview the data in graphic previews as well as exploring Web Map Services, whilst the dataset page provides context, browsable tags, dataset extent, and maintainers.

Web Map Service

As data.gov invites users to get involved and provide feedback, we would also like to say that we are really excited about CKAN’s future. We have a very active mailing list, new documentation for installing CKAN and ways to contribute to the code for anyone wanting to join the CKAN community.

If you’re launching a CKAN portal soon or have one we don’t know about, let us know and we’ll make sure to add you to our wall of awesome!

Website | + posts

Irina is passionate about products and using technology to make things better. She spent many years working on open data at Open Knowledge (as one of the directors and ckan product owner), at web startups, and most recently as a product lead or data consultant for W3C, Open Data Institute and the UK, Dubai and UAE governments. She co-founded redecentralize.org — a project to promote and bring together people working on and interested in decentralised digital technologies.