Support Us

You are browsing the archive for WG Open Data in Science.

The White House Seeks Champions of Open Science

May 8, 2013 in Open Access, Open Science, WG Open Data in Science

Here at the Open Knowledge Foundation, we know Open Science is tough, but ultimately rewarding. It requires courage & leadership to take the open path in science.

Nearly a week ago on the open-science mailing list we started putting together a list of established scientists who have in some way or another made significant contributions to open science or lent their esteemed reputation to calls for increased openness in science. Our open list now has over 130 notable scientists, among whom 88 are Nobel prize winners.

In an interesting parallel development, the White House has just put out a call to help identify “Open Science” Champions of Change — outstanding individuals, organizations, or research projects promoting and using open scientific data for the benefit of society.

whitehouseOPENSCIENCE

Anyone can nominate an Open Science candidate for consideration by May 14, 2013.

What more proof do we need that open science is both good, and valued in society? This marks a tremendous validation of the open science movement. The US government is not seeking to reward any scientist; only open scientists actively working to change the world for the better will win this recognition.

We’re still a long way from Open Science being the norm in science. But perhaps now, we’re a crucial step closer to important widespread recognition that Open Science is good, and could be the norm in the future. We eagerly await the unveiling of the winning Open Science champions at the White House on the 20th June later this year.

Science Europe denounces ‘hybrid’ Open Access

May 2, 2013 in Open Access, Open Science, WG Open Data in Science

Recently Science Europe published a clear and concise position statement titled: Principles on the Transition to Open Access to Research Publications

This is an extremely timely & important document that clarifies what governments and research funders should expect during the transition to open access. Unlike the recent US OSTP public access policy which allows publishers to apply up to a 12 month access embargo (to the disgust of some scientists like Michael Eisen) on publicly-funded research, this new Science Europe statement makes clear that only up to a 6 month embargo at maximum should be accepted for publicly funded STEM research. The recent RCUK (UK research councils) open access policy also requires 6 months embargo at most, with some caveats.

But among the many excellent principles is a particularly bold and welcome proclamation:

the hybrid model, as currently defined and implemented by publishers, is not a working and viable pathway to Open Access. Any model for transition to Open Access supported by Science Europe Member Organisations must prevent ‘double dipping’ and increase cost transparency

Hybrid options are typically far more expensive than ‘pure’ open access journal costs, and they don’t typically aid transparency or the wider transition to open access.

The Open Knowledge Foundation heartily endorses these principles as together with the above they respect, and reinforce the need for free access AND full re-use rights to scientific research.

About Science Europe:

Science Europe is an association of European Research Funding Organisations and Research Performing Organisations, based in Brussels. At present Science Europe comprises 51 Research Funding and Research Performing Organisations from 26 countries, representing around €30 billion per annum.

Panton Fellowship wrap up: Ross Mounce

April 16, 2013 in Featured, Open Science, WG Open Data in Science

 

The Panton Fellowships have come to an end. The work that our two Panton Fellows, Ross Mounce and Sophie Kershaw have done over the past year to promote openness in the sciences has far surpassed what any of us expected. Here Ross details his wide-ranging experiences and achievements over the past year, and you can read Sophie’s report on the last year here.

So… it’s over.

For the past twelve months I was immensely proud to be one of the first Open Knowledge Foundation Panton Fellows, but that has now come to an end. In this post I will try and recap my activities and achievements during the fellowship.

okfhelsinki

The broad goals of the fellowship were to:

  • Promote the concept of open data in all areas of science
  • Explore practical solutions for making data open
  • Facilitate discussions surrounding the role and value of openness
  • Catalyse the open community, and reach out beyond its traditional core

and I’m pleased to say that I think I achieved all four of these goals with varying levels of success.

 

Achievements:

Outreach & Promotion – I went to a lot of conferences, workshops and meetings during my time as a Panton Fellow to help get the message out there. These included:

Conferences

At all of these I made clear my views on open data and open access, and ways in which we could improve scientific communication using these guiding principles. Indeed I was more than just a participant at all of these conferences – I was on stage at some point for all, whether it was arguing for richer PDF metadata, discussing data re-use on a panel or discussing AMI2 and how to liberate open phylogenetic data from PDFs.

One thing I’ve learnt during my fellowship is that just academic-to-academic communication isn’t enough. In order to change the system effectively, we’ve got to convince other stakeholders too, such as librarians, research funders and policy makers. Hence I’ve been very busy lately attending more broader policy-centred events like the Westminster Higher Education Forum on Open Access & the Open Access Royal Society workshop & the Institute of Historical Research Open Access colloquium.

Again, here in the policy-space my influence has been international not just domestic. For example, my trips to Brussels, both for the Narratives as a Communication Tool for Scientists workshop (which may help shape the direction of future FP8 funding), and the ongoing Licences For Europe: Text and Data Mining stakeholder dialogue have had real impact. My presentation about content mining for the latter has garnered nearly 1000 views on slideshare and the debate as a whole has been featured in widely-read news outlets such as Nature News. Indeed I’ve seemingly become a spokesperson for certain issues in open science now. Just this year alone I’ve been asked for comments on ‘open’ matters in three different Nature features; on licencing, text mining, and open access from an early career researcher point-of-view – I don’t see many other UK PhD students being so widely quoted!

Another notable event I was particularly proud of speaking at and contributing to was the Revaluing Science in the Digital Age invite-only workshop organised jointly by the International Council for Science & Royal Society at Chicheley Hall, September 2012. The splendour was not just in the location, but also the attendees too – an exciting, influential bunch of people who can actually make things happen. The only downside of such high-level international policy is the glacial pace of action – I’m told, arising from this meeting and subsequent contributions, a final policy paper for approval by the General Assembly of ICSU will likely only be circulated in 2014 at the earliest!

 

helsinkiTALK

The most exciting outreach I did for the fellowship were the ‘general public’ opportunities that I seized to get the message out to people beyond the ‘ivory towers’ of academia. One such event was the Open Knowledge Festival in Helsinki, September 2012 (pictured above). Another was my participation in a radio show broadcast on Voice of Russia UK radio with Timothy Gowers, Bjorn Brembs, and Rita Gardner explaining the benefits and motivation behind the recent policy shift to open access in the UK. This radio show gave me the confidence & experience I needed for the even bigger opportunity that was to come next – at very short notice I was invited to speak on a live radio debate show on open access for BBC Radio 3 with other panellists including Dame Janet Finch & David Willetts MP! An interesting sidenote is that this opportunity may not have arisen if I hadn’t given my talk about the Open Knowledge Foundation at a relatively small conference; Progressive Palaeontology in Cambridge earlier that year – it pays to network when given the opportunity!

 

Outputs

The fellowship may be over, but the work has only just begun!

I have gained significant momentum and contacts in many areas thanks to this Panton Fellowship. Workshop and speaking invites continue to roll in, e.g. next week I shall be in Berlin at the Making Data Count workshop, then later on in the month I’ll be speaking at the London Information & Knowledge Exchange monthly meet and the ‘Open Data – Better Society’ meeting (Edinburgh).

Even completely independent of my activism, the new generation of researchers in my field are discovering for themselves the need for Open Data in science. The seeds for change have definitely been sown. Attitudes, policies, positions and ‘defaults’ in academia are changing. For my part I will continue to try and do my bit to help this in the right direction; towards intelligent openness in all its forms.

What Next?

I’m going to continue working closely with the Open Knowledge Foundation as and when I can. Indeed for 6 months starting this January I agreed to be the OKF Community Coordinator, Open Science before my postdoc starts. Then when I’ve submitted my thesis (hopefully that’ll go okay), I’ll continue on in full-time academic research with funding from a BBSRC grant I co-wrote partially out in Helsinki(!) at the Open Knowledge Festival with Peter Murray-Rust & Matthew Wills, that has subsequently been approved for funding. This grant proposal which I’ll blog further about at a later date, comes as a very direct result of the content mining work I’ve been doing with Peter Murray-Rust for this fellowship using AMI2 tools to liberate open data. Needless to say I’m very excited about this future work… but first things first I must complete and submit my doctoral thesis!

“We are entering an era of open science” says EU Vice President Neelie Kroes at launch of new global Research Data Alliance

March 21, 2013 in Open Access, Open Data, Open Science, Policy, WG Open Data in Science

Neelie Kroes, Vice-President of the European Commission responsible for the Digital Agenda, gave a talk earlier this week renewing the EU’s strong, principled support for open science.

Speaking at the launch of a new global Research Data Alliance, she said that we are entering a new “era of open science”, which will be “good for citizens, good for scientists and good for society”.

She explicitly highlighted the transformative potential of open access, open data, open software and open educational resources – mentioning the EU’s policy requiring open access to all publications and data resulting from EU funded research.

She also alluded to the EU’s work encouraging national funding bodies to adopt similar approach to publicly funded research, and recent policy developments in the US and Australia.

The Research Data Alliance says it “aims to accelerate and facilitate research data sharing and exchange” and currently lists a number of working areas such as metadata harmonisation and legal interoperability.

While there does not yet appear to be an explicit focus on open data per se, we hope that the new organisation will take a principled, ‘open by default’ approach to data sharing, in line with the Panton Principles, and commensurate with Commissioner Kroes’s speech.

As always, our Open Science Working Group will continue to monitor and engage with relevant initiatives and policy developments in this area as they unfold. If you’d like to help us you can join our open-science discussion list, by signing up below:



Expanded Access to the Results of Federally Funded Research

February 25, 2013 in Open Access, Open Science, WG Open Data in Science

On Friday 22nd February, 2013 the U.S. Office of Science and Technology Policy (OSTP) released a statement to say that the “Obama Administration is committed to the proposition that citizens deserve easy access to the results of scientific research their tax dollars have paid for”.  This was accompanied by a new policy memorandum and a long-awaited response by OSTP Director John Holdren to the ‘We The People’ petition that was signed by over 65,000 people calling for expanded public access to research.

OSTP_logo

 

Advocates of green open access were pleased to see this new directive and Peter Suber in particular gives a nice clear summary of it in a Google+ post. With up to 12-month embargoes allowed before research can be self-archived even the Association of American Publishers wrote a statement of support for this new policy.

This policy certainly represents a step in the right direction, but it’s not as strong as some would have liked — prominent OA advocate & scientist Michael Eisen writes on his blog: No celebrations here: why the White House public access policy sucks.

A comparison with the United Kingdom’s RCUK policy, clearly shows the OSTP to be the weaker of the two:

Breadth: OSTP applies only to scientific research, whereas RCUK’s applies to Arts, Humanities, and Social Sciences research too.

Immediacy: OSTP allows 12 month embargoes, whilst RCUK accepts a maximum embargo of only 6 months for STM research

Coverage: OSTP policy applies only to Federal agencies with more than $100M in R&D expenditures, whilst RCUK’s applies to all RCUK funded research – no exceptions.

Some would say this is no bad thing. The OSTP policy is certainly more lenient on publishers and thus is likely to be uncontroversially implemented. Hopes for stronger OA policy in the USA are emboldened by the recent Fair Access to Science and Technology Research (FASTR) Act which proposes to shorten the maximum embargo time allowed to just 6-months, in-line with RCUK policy.

Finally, the pleasant surprise for everyone with this new OSTP policy is the specific and explicit inclusion of access to data not just publications, in section 4 titled Objectives for Public Access to Scientific Data in Digital Formats aiming to:

“Maximize access, by the general public and without charge, to digitally formatted scientific data created with Federal funds”

The United States of America has now clearly joined the global movement towards open access to taxpayer-funded research. We think the world will certainly benefit from this new policy.

BioMed Central Open Data Awards

January 29, 2013 in Open Science, Open Standards, WG Open Data in Science

The deadline for nominations in the annual BioMed Central Open Data Awards is on January 31st. Get your votes in now!

It’s been a big year for open science in general, and for BioMed Central in particular. We’ve been hard at work promoting the value of data-sharing, developing standards to make it more attractive to researchers, and offering tools to make open publishing even easier. Recognising the importance of citation and reuse tracking in motivating researchers to publish openly, we’ve been promoting standards for data-citation including data DOIs (Digital Object Identifiers) as link-able URLs. Through our technology partnership with LabArchives, all our authors can now get 100Mb of free storage and the ability to assign DOIs to their datasets, and thereby create permanent data citations.

This standard is exemplified by various datasets deposited in GigaDB, the database underpinning BioMed Central and BGI’s revolutionary “big data” journal GigaScience, which published its first articles in July 2012.

Critically, GigaDB uses the Creative Commons CC0 waiver for published datasets making data open in compliance with Open Knowledge Foundation principles. We’ve been pushing open licensing since 2010, but in September 2012 we launched a formal public consultation on changing the copyright system in science publishing to better support open data. Look out for the full, public report on the outcomes in early 2013.

The Open Data Award

But the Open Data Award is a celebration of the work of our authors. Authors who have gone the extra mile to make their science reproducible; their analyses re-computable; and so, fundamentally, their data as open as possible. We look for scientists who have published in BioMed Central journals and have demonstrated leadership in the sharing, standardization, publication, or re-use of biomedical research data.

Last year’s winner exemplified the award’s criteria superbly. The International Stroke Trial group, led by Peter Sandercock published the anonymised 19,000 individual patient data from one the largest trials in acute stroke ever conducted. This kind of transparency is sadly still uncommon in this field, and the data are already being used by at least two other groups since its release.

I’m hoping we will unearth an equally important example of “data sharing done well” from our 2012 publications. As in previous years the authors of Panton Principles – Peter Murray-Rust, John Wilbanks, Rufus Pollock and Cameron Neylon – will be assisting with the judging process, along with Earl Buetler, CEO of LabArchives. With less than a week to go before we begin the judging process, please nominate soon!

The House of Lords on Open Access

January 16, 2013 in News, Open Access, WG Open Data in Science

This month, the House of Lords science committee is hearing evidence on the likely effects of the new Research Councils UK (RCUK) Open Access policy that will come into force in April this year for all RCUK-funded researchers.

The UK, and many other countries around the world are making this bold move to allow everyone open access to academic research because, in the long term, it will save money AND increase access to research for everyone. Currently the predominant mode of paying for the publication of research is via the subscription-access model – an inefficient system which benefits commercial publishers to the detriment of taxpayers.

Under the subscription model, taxpayer-funded research is given to publishers (for free) and the publishers sell access to this research to everyone, usually via annual subscription(s). The UK and all other countries thus have to rent access to research papers (many of which they produced themselves!) at an estimated global cost of $8 billion per year.

Under the ‘gold’ Open Access model, research is instead published either for an upfront fee or for no charge with institutional support, and made permanently free to access for everyone on the internet. This mode ensures more people have easy access to research and that the costs of publishing are met both upfront and transparently – a marked contrast from the Non-Disclosure Agreements that shroud many of the ‘Big Deal’ subscription-access agreements in secrecy. The ‘green’ route to Open Access whereby copies of research can be posted to research repositories after a short embargo period is also allowed under this policy.

There are concerns for some of the smaller details of this new policy and how it might affect particular groups like British Learned Societies, and thus the House of Lords are calling for written evidence submissions concerning these specific issues within the proposed plan:

  • support for universities through funds to cover article processing charges;
  • embargo periods for articles published under open access;
  • engagement with publishers, universities learned societies and other stakeholders in developing the new open access policies; and
  • how the Government should address the concerns raised by the scientific and publishing communities about the policy.

Any party interested in submitting written evidence should contact the Clerk to the Committee on hlscience@parliament.uk by the end of this week.

Example submissions include: Mike Taylor’s, 12 January 2013

I shall also be submitting some evidence ASAP.

The Committee will hold its final day of interview evidence on Tuesday 29 January when they will hear in-person from a number of witnesses including David Willetts MP, Minister of State for Science Universities, and Professor Rick Rylance, Chair of Research Councils UK.

Open Research Data Handbook Sprint – 15-16 February

January 16, 2013 in Events, Featured, Open Data Handbook, Open Economics, Open Science, Open Standards, Sprint / Hackday, WG Development, WG Economics, WG Open Bibliographic Data, WG Open Data in Science

On February 15-16, the Open Research Data Handbook Sprint will happen at the Open Data Institute, 65 Clifton Street, London EC2A 4JE.

The Open Research Data Handbook aims to provide an introduction to the processes, tools and other areas that researchers need to consider to make their research data openly available.

Join us for a book sprint to develop the current draft, and explore ways to remix it for different disciplines and contexts.

Who it is for:

  • Researchers interested in carrying out their work in more open ways
  • Experts on sharing research and research data
  • Writers and copy editors
  • Web developers and designers to help present the handbook online
  • Anyone else interested in taking part in an intense and collaborative weekend of action

Register at Eventbrite

What will happen:

The main sprint will take place on Friday and Saturday. After initial discussions we’ll divide into open space groups to focus on research, writing and editing for different chapters of the handbook, developing a range of content including How To guidance, stories of impact, collections of links and decision tools.

A group will also look at digital tools for presenting the handbook online, including ways to easily tag content for different audiences and remix the guide for different contexts.

Agenda:

Week before & after:

  • Calling for online contributions and reviews

Friday:

  • Seminar or bring your own lunch on open research data.
  • From 2pm: planning and initial work in the handbook in small teams (optional)

Saturday:

  • 10.00 – 10:30: Arrive and coffee
  • 10.30 – 11.30: Introducing open research – lightning talks
  • 11.30 – 13:30: Forming teams and starting sprint. Groups on:
    • Writing chapters
    • Decision tools
    • Building website & framework for book
    • Remixing guide for particular contexts
  • 13.30 – 14:30: Lunch
  • 14.30 – 16:30: Working in teams
  • 17.30 – 18:30: Report back
  • 18:30 – …… : Pub

Partners:

OKF Open Science Working Group – creators of the current Open Research Data Handbook
OKF Open Economic Working Group – exploring economics aspects of open research
Open Data Research Network - exploring a remix of the handbook to support open social science
research in a new global research network, focussed on research in the Global South.
Open Data Institute – hosting the event

Open Data and Privacy Concerns in Biomedical Research

November 26, 2012 in Ideas and musings, Open Data, Open Science, WG Open Data in Science

Privacy has long been the focus of debates about how to use and disseminate data taken from human subjects during clinical research. The increasing push to share data freely and openly within biomedicine poses a challenge to the idea of private individual information, whose dissemination patients and researchers can control and monitor.

In order to address this challenge, however, it is not enough to think about (or simply re-think) the meaning of ‘informed consent’ procedures. Rather, addressing privacy concerns in biomedical research today, and the ways in which the Open Data movement might transform how we think about the privacy of patients, involves understanding the ways in which data are disseminated and used to generate new results. In other words, one needs to study how biomedical researchers confront the challenges of making data intelligible and useful for future research.

Efficient data re-use comes from what the Royal Society calls ‘intelligent openness’ – the development of standards for data dissemination which make data both intelligible and assessable. Data are intelligible when they can be used as evidence for one or more claims, thus helping scientists to advance existing knowledge. Data are assessable when scientists can evaluate their quality and reliability as evidence, usually on the basis of their format, visualisation and extra information (metadata) also available in databases.

Yet the resources and regulatory apparatus for securing proper curation of data, and so their adequate dissemination and re-use, are far from being in place. Making data intelligible and assessable requires labour, infrastructures and funding, as well as substantial changes to the institutional structures surrounding scientific research. While the funding to build reliable and stable biomedical databases and Open Data Repositories is increasing, there is no appropriate business model to support the long-term sustainability of these structures, with national funders, industry, universities and publishing houses struggling to agree on their respective responsibilities in supporting data sharing.

Several other factors are important. For instance, the free dissemination of data is not yet welcomed by the majority of researchers, who do not have the time or resources for sharing their data, are not rewarded for doing so and who often fear that premature data-sharing will damage their competitive advantage over other research groups. There are intellectual property concerns too, especially when funding for research comes from industry or specific parts of government such as defence. Further, there are few clear standards for what counts as evidence in different research contexts and across different geographical locations. And more work needs to be done on how to relate datasets collected at different times and with different technologies.

The social sciences and humanities have an important role to help scientific institutions and funders develop policies and infrastructures for the evaluation of data-sharing practices, particularly the collaborative activities that fuel data-intensive research methods. An improved understanding of how data can be made available so as to maximise their usefulness for future research can also help tackle privacy concerns relating to sensitive data about individuals.

When it comes to sharing medical records, it is now generally agreed that providing ‘informed consent’ to individual patients is simply not possible, as neither patients not researchers themselves can predict how the data could be used in the future. Even the promise of anonymity is failing, as new statistical and computational methods make it possible to retrieve the identity of individuals from large, aggregated datasets, as shown by genome-wide association studies.

A more effective approach is the development of ‘safe havens’: data repositories which would give access to data only to researchers with appropriate credentials. This could potentially safeguard data from misuse, without hampering researchers’ ability to extract new knowledge from them. Whether this solution succeeds ultimately depends on the ability of researchers to work with data providers, including patients, to establish how data travel online, how they are best re-used and how data sharing is likely to affect, and hopefully improve, future medicine. This work is very important, and should be supported and rewarded by universities, research councils and other science funders as an integral part of the research process.

To learn more, read the report ‘Making Data Accessible to All’

Members of the public asked to help tend Feynman’s Flowers

November 12, 2012 in Featured, Open Science, PyBossa, WG Open Data in Science

A project at the London Centre for Nanotechnology (LCN) is making fantastic use of the Pybossa tool (a project of the Open Knowledge Foundation and the Citizen Cyberscience Centre) in a citizen science project called ‘Feynman’s Flowers’, which launched this weekend.

The project asks members of the public to help unlock the secrets of magnetism at the molecular scale, and is powered by our free, open-source, platform for creating and running crowd-sourcing applications that utilise online assistance in performing tasks that require human cognition.

From their press release:

The project’s website invites volunteers from across the world to analyse microscope images of individual molecules, which have characteristic flower shapes. Anyone can take part, and only a few clicks of the computer mouse are required to collect valuable information.

The Feynman’s Flowers project will allow volunteers to measure the position of a molecule in relation to a metal surface to help scientists understand how this can affect the molecule’s properties. Data that volunteers produce will contribute to a research project run by the group of Dr. Cyrus Hirjibehedin at the LCN, in collaboration with Tsinghua University in Beijing and the Citizen Cyberscience Centre.

Currently, the research project is focused on exploring the behaviour of phthalocyanine molecules. In the past, these were used as dyes for fabrics, but scientists now realise that they also have interesting electronic and magnetic properties that make them potentially useful for creating nanoscale devices that can manipulate or store information.

This website is the first project of its kind in this area of physics, applying the power of crowd-sourcing to help understand images created by a scanning tunnelling microscope (STM). Operating at temperatures close to absolute zero (-273˚C), the STM allows scientists to image individual atoms and molecules on surfaces and to explore their fascinating magnetic and electronic properties. Public participation will allow for the analysis of data in ways that previously would not have been possible.

PyBossa Logo

Find out more on the UCL website – and get involved here!

Please create an account to get started.

Sign up to the Open Knowledge Newsletter

Get Updates