Support Us

You are browsing the archive for Data Journalism.

The Data Journalism Handbook is Go!

May 1, 2012 in Data Journalism, Featured

On Saturday 29th April, at the International Journalism Festival in Perugia – 6 months of work on the Data Journalism Handbook was unveiled: the Data Journalism Handbook was launched.

The Handbook contains contributions from over 70 of the world’s leading data journalists.

The book’s contributors are a “who’s who of data journalism”, says Simon Rogers from the Guardian. There are pieces by data journalists and data wranglers from the Australian Broadcasting Corporation, the BBC, the Chicago Tribune, Deutsche Welle, the Guardian, the Financial Times, Helsingin Sanomat, La Nacion, the New York Times, ProPublica, the Washington Post, the Texas Tribune, Verdens Gang, Wales Online, Zeit Online and many many more from across the globe.

It is now freely available online at datajournalismhandbook.org and a print version is from O’Reilly can be pre-ordered and will be available as an e- and print book within the month.

The book was launched in the session ‘You too can be a Data Journalist’ and we hope it will inspire budding data journalists to try data journalism out for size. If you missed it, or any of the other sessions at the festival, check out this Storify (abridged below) for highlights, videos and the crib sheets from the sessions.

Data Journalism Handbook: Why is data journalism important?

April 25, 2012 in Data Journalism

The Data Journalism Handbook is a free, open source reference book for anyone interested in the emerging field of data journalism. It is the result of an international, collaborative effort involving dozens of data journalism's leading advocates and best practitioners – including from the BBC, the Chicago Tribune, the Guardian, the Financial Times, the New York Times, the Washington Post and many others.

The book will be made freely available online under a Creative Commons Attribution-ShareAlike license so anyone can read, copy, share, redistribute and reuse it. Additionally a printed version and an e-book will be published by O’Reilly Media. If you want to be notified when the book is released, you can sign up on the website

The handbook will be released this Saturday at the International Journalism Festival in Perugia. Here is an excerpt from the book where leading data journalism practitioners, advocates, and enthusiasts tell us… 

Why is Data Journalism Important?

book_oreilly_good_3d_mockup1.png

Filtering the Flow of Data

When information was scarce, most of our efforts were devoted to hunting and gathering. Now that information is abundant, processing is more important. We process at two levels: (1) analysis to bring sense and structure out of the never-ending flow of data and (2) presentation to get what's important and relevant into the consumer's head. Like science, data journalism discloses its methods and presents its findings in a way that can be verified by replication.

Philip Meyer (Professor Emeritus: University of North Carolina at Chapel Hill)

New Approaches to Storytelling

Data journalism is an umbrella term that, to my mind, encompasses an ever-growing set of tools, techniques and approaches to storytelling. It can include everything from traditional computer-assisted reporting (using data as a 'source') to the most cutting edge data visualisation and news applications. The unifying goal is a journalistic one: providing information and analysis to help inform us all about important issues of the day.

Aron Pilhofer (New York Times)

Like Photo Journalism with a Laptop

'Data journalism' only differs from 'words journalism' in that we use a different kit. We all sniff out, report, and relate stories for a living. It's like 'photo journalism' – just swap the camera for a laptop.

Brian Boyer (Chicago Tribune)

Data Journalism is the Future

Data-driven journalism is the future. Journalists need to be data-savvy. It used to be that you would get stories by chatting to people in bars, and it still might be that you'll do it that way some times. But now it's also going to be about poring over data and equipping yourself with the tools to analyse it and picking out what's interesting. And keeping it in perspective, helping people out by really seeing where it all fits together, and what's going on in the country.

Tim Berners-Lee (Founder of the World Wide Web)

Number-Crunching Meets Word-Smithing

Data journalism is bridging the gap between stat technicians and wordsmiths. Locating outliers and identifying trends that are not just statistically significant, but relevant to de-compiling the inherently complex world of today.

David Anderton (Freelance Journalist)

Updating Your Skills Set

Data journalism is a new set of skills for searching, understanding and visualising digital sources in a time that basic skills from traditional journalism just aren't enough. It's not a replacement of traditional journalism, but an addition to it.

In a time where sources go digital, journalists can and have to be closer to those sources. The Internet opened up possibilities beyond our current understanding. Data journalism is just the beginning of evolving our past practices to adapt to the online.

Data journalism serves two important purposes for news organisations: finding unique stories (not from news wires) and executing the watchdog function. Especially in times of financial peril, these are important goals for newspapers to achieve.

From the standpoint of a regional newspaper, data journalism is crucial. We have the saying 'a loose tile in front of your door is considered more important than a riot in a far-away country'. It's hits you in the face and impacts your life more directly. At the same time, digitisation is everywhere. Because local newspapers have this direct impact in their neighbourhood and sources become digitised, a journalist must know how to find, analyse and visualise a story from data.

Jerry Vermanen (NU.nl)

A Remedy for Information Asymmetry

Information asymmetry – not the lack of information, but the inability to take in and process it with the speed and volume that it comes to us – is one of the most significant problems that citizens face in making choices about how to live their lives. Information taken in from print, visual and audio media influence citizens' choices and actions. Good data journalism helps to combat information asymmetry.

Tom Fries (Bertelsmann Foundation)

An Answer to Data-driven PR

The availability of measurement tools and their decreasing prices, in a self-sustaining combination with a focus on performance and efficiency in all aspects of society, have led decision-makers to quantify the progresses of their policies, monitor trends and identify opportunities.

Companies keep coming up with new metrics showing how well they perform. Politicians love to brag about reductions in unemployment numbers and increases in GDP. The lack of journalistic insight in the Enron, Worldcom, Madoff or Solyndra affairs is proof of many a journalist's inability to clearly see through numbers. Figures are more likely to be taken at face value than other facts as they carry an aura of seriousness, even when they are entirely fabricated.

Fluency with data will help journalists sharpen their critical sense when faced with numbers and will hopefully help them gain back some terrain in their exchanges with PR departments.

Nicolas Kayser-Bril (Journalism ++)

To Provide Independent Interpretations of Official Information

After the devastating earthquake and subsequent Fukushima nuclear plants disaster in 2011, the importance of data journalism has been driven home to media people in Japan, a country which is generally lagging behind in digital journalism.

We were at a loss when the government and experts had no credible data about the damage. When officials hid SPEEDI data (predicted diffusion of radioactive materials) from the public, we were not prepared to decode it even if it were leaked. Volunteers began to collect radioactive data by using their own devices but we were not armed with the knowledge of statistics, interpolation, visualisation and so on. Journalists need to have access to raw data, and to learn not to rely on official interpretations of it.

Isao Matsunami (Tokyo Shimbun)

Dealing with the Data Deluge

The challenges and opportunities presented by the digital revolution continue to disrupt journalism. In an age of information abundance, journalists and citizens alike all need better tools, whether we're curating the samizdat of the 21st century in the Middle East, processing a late night data dump, or looking for the best way to visualise water quality for a nation of consumers. As we grapple with the consumption challenges presented by this deluge of data, new publishing platforms are also empowering everyone to gather and share data digitally, turning it into information.  While reporters and editors have been the traditional vectors for information gathering and dissemination, the flattened information environment of 2012 now has news breaking first online, not on the news-desk.

Around the globe, in fact, the bond between data and journalism is growing stronger. In an age of big data, the growing importance of data journalism lies in the ability of its practitioners to provide context, clarity and, perhaps most important, find truth in the expanding amount of digital content in the world. That doesn't mean that the integrated media organisations of today don't play a crucial role. Far from it. In the information age, journalists are needed more than ever to curate, verify, analyse and synthesise the wash of data.  In that context, data journalism has profound importance for society.

Today, making sense of big data, particularly unstructured data, will be a central goal for data scientists around the world, whether they work in newsrooms, Wall Street or Silicon Valley. Notably, that goal will be substantially enabled by a growing set of common tools, whether they're employed by government technologists opening Chicago, healthcare technologists or newsroom developers.

Alex Howard (O'Reilly Media)

Our Lives are Data

Good data journalism is hard, because good journalism is hard. It means figuring out how to get the data, how to understand it, and how to find the story. Sometimes there are dead ends, and sometimes there's no great story. After all if it were just a matter of pressing the right button, it wouldn't be journalism. But that's what makes it worthwhile, and – in a world where our lives are increasingly data – essential for a free and fair society.

Chris Taggart (OpenCorporates)

A Way to Save Time

Journalists don't have time to waste transcribing things by hand and messing around trying to get data out of PDFs, so learning a little bit of code, or knowing where to look for people who can help, is incredibly valuable.

One reporter from Folha de São Paulo was working with the local budget and called me to thank us for putting up the accounts of the municipality of São Paolo online (two days work from a single hacker!). He said he had been transcribing them by hand for the past three months, trying to build up a story. I also remember solving a 'PDF issue' for 'Contas Abertas', a parliamentary monitoring news organisation: 15 minutes and 15 lines of code solved a months worth of work.

Pedro Markun (Transparência Hacker)

An Essential Part of the Journalists' Toolkit

I think it's important to stress the 'journalism' or reporting aspect of 'data journalism'. The exercise should not be about just analysing data or visualising data for the sake of it, but to use it as a tool to get closer to the truth of what is going on in the world. I see the ability to be able to analyse and interpret data as an essential part of today's journalists' toolkit, rather than a separate discipline. Ultimately, it is all about good reporting, and telling stories in the most appropriate way.

Data journalism is another way to scrutinise the world and hold the powers that be to account. With an increasing amount of data available, now more than ever it is important that journalists are aware of data journalism techniques. This should be a tool in the toolkit of any journalist: whether learning how to work with data directly, or collaborating with someone who can.

Its real power is in helping you to obtain information that would otherwise be very difficult to find or to prove. A good example of this is Steve Doig's story that analysed damage patterns from Hurricane Andrew. He joined two different datasets: one mapping the level of destruction caused by the hurricane and one showing wind speeds. This allowed him to pinpoint areas where weakened building codes and poor construction practices contributed to the impact of the disaster. He won a Pulitzer Prize for the story in 1993 and it's great inspiration of what is possible.

Ideally you use the data to pinpoint outliers, areas of interest, or things which are surprising. In this sense data can act as a lead or a tip off. While numbers can be interesting, just writing about the data is not enough. You still need to do the reporting to explain what it means.

Cynthia O'Murchu (Financial Times)

Adapting to Changes in Our Information Environment

New digital technologies bring new ways of producing and disseminating knowledge in society. Data journalism can be understood as the media's attempt to adapt and respond to the changes in our information environment – including more interactive, multi-dimensional story-telling, enabling readers to explore the sources underlying the news and encouraging them to participate in the process of creating and evaluating stories.

César Viana (University of Goiás)

A Way to See Things You Might Not Otherwise See

Some stories can only be understood and explained through analysing – and sometimes visualising – the data. Connections between powerful people or entities would go unrevealed, deaths caused by drug policies that would remain hidden, environmental policies that hurt our landscape would continue unabated. But each of the above was changed because of data that journalists have obtained, analysed and provided to readers. The data can be as simple as a basic spreadsheet or a log of cell phone calls, or complex as school test scores or hospital infection data, but inside it all are stories worth telling.

Cheryl Phillips (The Seattle Times)

A Way To Tell Richer Stories

We can paint pictures of our entire lives with our digital trails. From what we consume and browse, to where and when we travel, to our musical preferences, our first loves, our children’s milestones, even our last wishes – it all can be tracked, digitised, stored in the cloud and disseminated. This universe of data can be surfaced to tell stories, answer questions and impart an understanding of life in ways that is currently surpassing even the most rigorous and careful reconstruction of anecdotes.

Sarah Slobin (Wall Street Journal)

Sneak Peek Inside the Data Journalism Handbook

April 21, 2012 in Data Journalism, Featured

After several months of hard work, the Data Journalism Handbook is almost ready to be released. The handbook will be launched during the School of Data Journalism at the International Journalism Festival in Perugia, Italy, next week.

The handbook is a free, open source reference book for anyone interested in the emerging field of data journalism. The book will be made freely available online under a CC BY-SA license so anyone can read and share it. Additionally a printed version and an e-book will be published by O’Reilly Media. If you want to be notified when the book is released, you can sign up on the website.

book_oreilly_good_3d_mockup1.png

Here is a preview of what will be in the book:

INTRODUCTION

DJH_1.png

What is data journalism? What potential does it have? What are its limits? Where does it come from? In this section we look at what data journalism is and what it might mean for news organisations. Paul Bradshaw (Birmingham City University) and Mirko Lorenz (Deutsche Welle) say a bit about what is distinctive about data journalism. Leading data journalists tell us why they think it is important and what their favourite examples are. Finally Liliana Bounegru (European Journalism Centre) puts data journalism into its broader historical context.

  • What Is Data Journalism? (Paul Bradshaw, Birmingham City University)
  • Why Journalists Should Use Data (Mirko Lorenz, Deutsche Welle)
  • Why Is Data Journalism Important? (Various Contributors)
  • Favourite Examples (Various Contributors)
  • Data Journalism in Perspective (Liliana Bounegru, European Journalism Centre)

IN THE NEWSROOM

DJH_2.png

How does data journalism sit within newsrooms around the world? How did leading data journalists convince their colleagues that it is a good idea to publish datasets or launch data-driven news apps? Should journalists learn how to code, or work in tandem with talented developers? In this section we look at the role of data and data journalism at the Australian Broadcasting Corporation, the BBC, the Chicago Tribune, the Guardian, the Texas Tribune, and the Zeit Online. We learn about how to spot and hire good developers, how to engage people around a topic through hackathons and other events, how to collaborate across borders, and business models for data journalism.

  • The ABC’s Data Journalism Play (Wendy Carlisle, Australian Broadcasting Corporation)
  • Data Journalism at the BBC (Bella Hurrel, BBC News)
  • How the News Apps Team at Chicago Tribune Works (Brian Boyer, Chicago Tribune)
  • Behind the Scenes at the Guardian Datablog (Simon Rogers, Guardian)
  • Measuring the Impact of Data Journalism (Texas Tribune)
  • Data Journalism at the Zeit Online (Sascha Venohr, Zeit Online)
  • How to Hire a Hacker (Lucy Chambers, Open Knowledge Foundation)
  • Harnessing External Expertise Through Hackthons (Jerry Vermanen, NU.nl)
  • Following the Money: Data Journalism and Cross-Border Collaboration (Paul Radu, Organized Crime and Corruption Reporting Project)
  • Our Stories Come As Code by Lorenz Matzat (OpenDataCity.de)
  • Kaas & Mulvad: Semi-finished Content for Stakeholder Groups (Mark Lee Hunter and Luk N. Van Wassenhove, INSEAD)
  • Business Models for Data Journalism (Mirko Lorenz, Deutsche Welle)

CASE STUDIES

DJH_3.png

In this section we take a more in depth, behind-the-scenes look at several data journalism projects – from apps developed in a day to nine-month investigations. We learn about how data sources have been used to augment and improve coverage of everything from elections to spending, riots to corruption, the performance of schools to the price of water. As well as larger media organisations such as the BBC, the Chicago Tribune, the Guardian, the Financial Times, Helsingin Sanomat, La Nacion, Wall Street Journal and the Zeit Online, we learn from smaller initiatives such as California Watch, Hacks/Hackers Buenos Aires, ProPublica, and a group of local Brazilian citizen journalists called Friends of Januária.

  • The Opportunity Gap (Scott Klein, ProPublica)
  • A 9 Month Investigation into European Structural Funds (Cynthia O'Murchu, Financial Times)
  • The Eurozone Meltdown (Sarah Slobin, Wall Street Journal)
  • Covering the Public Purse with OpenSpending.org (Lucy Chambers and Jonathan Gray, Open Knowledge Foundation)
  • Finnish Parliamentary Elections and Campaign Funding (Esa Mäkinen, Helsingin Sanomat)
  • Electoral Hack in Realtime (Hacks/Hackers Buenos Aires)
  • Data in the News: Wikileaks (Simon Rogers, The Guardian)
  • Mapa76 Hackathon (Mariano Blejman, Hacks/Hackers Buenos Aires)
  • The Guardian Datablog’s Coverage of the UK Riots (Farida Vis, University of Leicester)
  • Illinois School Report Cards (Brian Boyer, Chicago Tribune)
  • Hospital Billing (Steve Doig, Walter Cronkite School of Journalism of Arizona State University)
  • Care Home Crisis (Cynthia O'Murchu, Financial Times)
  • The Tell-All Telephone (Sascha Venohr, Zeit Online)
  • Which Car Model? MOT Failure Rates (Martin Rosenbaum, BBC)
  • Where Do the Subsidies for the Public Bus Transportation System in Argentina Go? (Angélica Peralta Ramos, La Nacion (Argentina))
  • Citizen Data Reporters (Amanda Rossi, Friends of Januária)
  • The "Big Board" for Election Results (Aron Pilhofer, New York Times)
  • The Price of Water (Nicolas Kayser Bril, Journalism++)

GETTING DATA

DJH_4.png

So, you're all ready to get started on your first data journalism project. What now? First of all you need some data. This section looks at where you can get it from. We learn how to find data on the web, how to request it using freedom of information laws, how to use `screen-scraping' to gather data from unstructured sources and how to use `crowd-sourcing' to collect your own datasets from your readers. We look at what the law says about republishing datasets, and how to use simple legal tools to let others reuse your data. Finally the section closes with some anecdotes and war stories about what our contributors have gone through to get hold of the data they were looking for.

  • A Five Minute Field Guide (Various Contributors)
  • Your Right to Data (Various Contributors)
  • Wobbing works. Use it! (Brigitte Alfter, Journalismfund.eu)
  • Getting Data from the Web (Friedrich Lindenberg, Open Knowledge Foundation)
  • The Web as a Data Source (Pete Warden, Independent Data Analyst and Developer)
  • Crowdsourcing Data at the Guardian Datablog (Marianne Bouchart, Bloomberg News)
  • Using and Sharing Data: the Black Letter, Fine Print, and Reality (Mike Linksvayer, Creative Commons)
  • Anecdotes and War Stories (Various Contributors)

UNDERSTANDING DATA

DJH_5.png

Once you've got your data, what do you do with it? What should you look for? What tools should you use? This section opens with some ideas on improving your data literacy, tips for working with numbers and statistics, and things to bear in mind while working with messy, imperfect and often undocumented datasets. We go on to learn about how to get stories from data, data journalists' tools of choice, and how to use data visualisation to give you insights into the topic you're looking at.

  • Become Data Literate in 3 Simple Steps (Nicolas Kayser-Bril, Journalism++)
  • Tips for Working with Numbers in the News (Michael Blastland, Freelance Journalist)
  • Basic Steps in Working with Data (Steve Doig, Walter Cronkite School of Journalism of Arizona State University)
  • The £32 Loaf of Bread (Claire Miller, WalesOnline)
  • Start With the Data, Finish With a Story (Caelainn Barr, Citywire)
  • Data Stories (Martin Rosenbaum, BBC)
  • Data Journalists Discuss Their Tools of Choice (Various Contributors)
  • Using Data Visualisation to Find Insights in Data (Gregor Aisch, Open Knowledge Foundation)

DELIVERING DATA

DJH_6.png

Once you've had a good look at your data and decided that there's something interesting to write about, how can you deliver it to the public? This section opens with short anecdotes about how leading data journalists have served their data up to their readers – from infographics, to open data platforms, to download links. Then we take a more extended look at how to build news apps, and the ins and outs of data visualisation. Finally we take a look at what you can do to engage your audience around your project.

  • Presenting Data to the Public (Various Contributors)
  • How to Build a News App (Chase Davis, Center for Investigative Reporting)
  • News Apps at ProPublica (Scott Klein, ProPublica)
  • Data Visualisation in Journalism: An Introduction (Sarah Cohen, Duke University)
  • Using Visualisations to Tell Stories (Geoff McGhee, Stanford University)
  • Different Charts Tell Different Tales (Brian Suda, (optional.is))
  • Data Visualisation DIY: our Top Tools (Simon Rogers, The Guardian)
  • How Data Visualisation is Used at the Most Read Daily Newspaper in Norway (John Bones, Verdens Gang)
  • Public Data Goes Social (Oluseun Onigbinde, BudgIT Nigeria)
  • Engaging People Around Your Data (Duncan Geere, Wired UK)

Poster about the Data Journalism Handbook for the Information Design Conference 2012

April 4, 2012 in Data Journalism, Open Data

Freelance infographic designer Lulu Pinney has kindly designed a wonderful poster which illustrates some of the topics covered in the Data Journalism Handbook, a free, open source reference book which shows how journalists can use data to improve the news.

She uses a process diagram to represent different aspects of the data journalist’s workflow – from getting data to delivering it to their readers in stories, visualisations and apps. The poster will be presented at the Information Design Conference 2012 in London. The design is based on Kate Hudson’s illustrations for the book.

The first edition of the book is nearly completed, ready for its launch towards the end of April at the International Journalism Festival in Perugia. If you want to be notified when the book is released, you can sign up on the website.

This blog post is cross-posted from jonathangray.org.

Illustrations for the Data Journalism Handbook

April 2, 2012 in Data Journalism, Open Data

Here is a preview of some illustrations for the Data Journalism Handbook, a free, open source reference book which shows how journalists can use data to improve the news. They were created by the talented Kate Hudson, based on the original designs she did for the book at MozFest 2011.

If you want to be notified when the book is released, you can sign up on the website.

CHAPTER 1: INTRODUCTION

What data journalism is and what it might mean for news organisations. Leading data journalists tell us why they think it is important and what their favourite examples are. Finally data journalism is examined in its broader historical context.

CHAPTER 2: IN THE NEWSROOM

How data journalism sits within newsrooms at the Australian Broadcasting Corporation, the Chicago Tribune, the Guardian, the New York Times, the Zeit Online, and elsewhere. We learn about how to hire developers, how to engage people around a topic through hackathons and other events, cross-border collaboration, and business models for data journalism.

CHAPTER 3: CASE STUDIES

Data journalists tell us about projects that they have worked on – from election monitoring to looking into how public funds are spent, from covering corruption and riots to in depth investigations into education and healthcare.

CHAPTER 4: GETTING DATA

Where to find data on the web, how to request it using freedom of information laws, how to screen scrape and crowdsource it, and how you can republish it and give others permission to reuse it.

CHAPTER 5: UNDERSTANDING DATA

How to make sense of your data – including tips on working with numbers and statistics, how to get stories from data, data journalists’ tools of choice, and how to use data visualisation to find insights in data.

CHAPTER 6: DELIVERING DATA

How to deliver your data to the public – from news apps, to data visualisations, to engaging audiences around your project.

This blog post is cross-posted from jonathangray.org.

Europe’s Energy wins a Silver Award at Malofiej 20

March 24, 2012 in Data Journalism, OKF Projects

This post is cross-posted from jonathangray.org.

Europe’s Energy, a project I helped to create to put EU energy targets into context, has just won a Silver Award at Malofiej 20. The Malofiej Awards recognise innovative infographics from around the world:

The Malofiej Awards have since 1993 given recognition to the best infographics published in print and on-line across the globe. Every March an international jury meets for nearly four days at the School of Communication at the University of Navarra in Pamplona (Spain). They select those works worthy of gold, silver or bronze medals from among dozens of entries submitted to the competition.

The awards are named after Alexander Malofiej, “an Argentinian cartographer considered to be a pioneer in infographics, and who died in 1987″. This year there were 1,356 entries from 151 media organisations in 29 countries.

The Europe’s Energy project was born out of a series of hackathons I organised in December 2010, which sought to encourage developers, designers and data journalists in several European cities to do useful and interesting things using information from Eurostat. Having a personal interest in climate change, energy and carbon emissions, I was very keen to look into datasets related to these areas. This became the focus of our event in London – and we looked into how energy consumption and production changed in different EU member states over time. The Guardian covered this and I did a writeup with a wishlist of other things we wanted to explore.

In early 2011, I spoke to a press officer at the European Commission who was looking for visual material that would help to put EU energy policies into context for a big meeting at the European Council. I spoke to Friedrich Lindenberg and Gregor Aisch and we decided to try and put something together to enable more people to understand what the EU energy targets meant, how ambitious they were, and how they looked in the context of energy production and consumption in different countries. Soon we were on regular calls with Dirk Heine, Guo Xu, Nathaniel Scheer, and doing late nights to try and get hold of the data we needed, to figure out what it meant, and how best to present it to the public. In just under two weeks the Europe’s Energy was born. In the last 24 hours we crowdsourced translations of the project into 16 languages and I wrote an article for the Guardian Datablog.

The project would not have been possible without the willingness of our team to drop everything and wade knee deep into EU energy policy, data wrangling and data visualisation design. It was supported by the LOD2 project, part of which aims to encourage more people to use open data to provide value to society.

There is a lot more work to be done to continue to gather, combine, interpret and present data related to climate change, energy and carbon emissions. Hopefully Europe’s Energy and projects like it will make some small contribution towards increasing public understanding of complex, but very important, issues. One hopes that Gregor’s beautiful graphics just might help to encourage a few more people to find out more about energy policy, and perhaps even to act to encourage decision makers to increase the share of green energy in the mix.

Finnish data journalism app contest

February 21, 2012 in Data Journalism, External, OKF Finland, Visualization

Helsingin Sanomat, Finland’s leading national paper, is organizing an article app contest to find data visualizations.

For many journalists today, it’s not a lack of open data that’s the problem, but a lack of the skills and off-the-shelf visualizations needed to make that open data useful to them.

A year ago, the Finnish government decided that in principle all data generated with taxpayer money should be free.

This has been leading to tonnes of great releases. At the beginning of May, the National Land Survey of Finland will release all its maps as open data. The National Audit Office of Finland has already released campaign funding data as a kind of API. The City of Helsinki has the Helsinki Region Infoshare project, that collects city-level data into one place. Statistics Finland are also publishing all their data openly.

Furthermore, Open data activists such as Antti Poikola and Petri Kola have been doing great work in lobbying the Government and creating a data ecosystem. An Open Knowledge Foundation chapter is about to be formed and Open Data activists are crowdsourcing Freedom of Information Act-related data requests on Tietopyynto.fi.

So we have plenty of data, but using and publishing it is still lagging behind. This is especially true with the major media outlets. Journalists are still publishing static charts with their articles online or using Google Fusion Tables to make very basic visualizations. Not very innovative.

To tackle this problem, Helsingin Sanomat is organizing a contest to find article apps.

By article apps we mean applications that can be embedded into any web site in 560×400 pixel Iframe. An article app should visualize some interesting data, with the possibility of user interaction or of displaying data inputted by the users.

There are 3000 euros worth of prizes. Developers will not lose any rights to the works they submit to the contest. The contest is open to everyone, and the deadline for submissions is the 8th April 2012. More info can be found here.

There are few limitations for the article apps, but we hope that the apps use open data. If the article app crowdsources data from the users, it would be great if the data could be exported openly.

One part of this process has been to think about the business models of open data journalism. The idea behind the article app format is to standardize at least one format in data journalism. When we have some kind of standard, it will be easier to buy and to sell data journalism.

Our suggestion is that outlets buy the license to publish an article app once with each article – regardless whether it’s published at the HS.fi site, in our iPad application or some other channel. The next time we use the same graph with different data, we would pay the license fee again. For one article the compensation would be quite low, but if the app is used hundred times, it would be higher.

This business model is still theoretical, as we have not published anything using this model. Also, the amount we would pay for one article is still unclear, as we have not had any discussions with developers. We’d love to hear your thoughts on

Data Journalism Awards – Call for Entries!

February 20, 2012 in Data Journalism, External

Showcase your work and win a chance to €45,000 in prizes by applying for the first ever Data Journalism Awards

In an age of overwhelming abundance of data, journalists and media organisations are learning to separate signal from noise in order to provide valuable insights to society. From the Guardian to the New York Times, La Stampa to Die Zeit, journalists and media organisations are experimenting with new ways of using data to improve reportage of complex issues and to give readers direct access to the sources behind the headlines. As Tim Berners-Lee says, “data-driven journalism is the future.”

To recognize and showcase outstanding work, as well as highlight best practices in this fast-growing field, the first international Data Journalism Awards (DJA) has been established this year. The DJA is organised by the Global Editors and is sponsored by Google. The competition is run by the European Journalism Centre.

A jury of data journalism experts and editors from all over the world, including from prestigious organisations like New York Times, Reuters, and Les Echos will award a total of €45,000 (over $55,000) to six winners. The jury is headed by Paul Steiger, founder of ProPublica.

There are three award categories awarded at both (i) national and international and (ii) local and regional levels to give a total of six prizes. The three categories are:

  1. Data-driven investigative journalism
  2. Data visualisation & storytelling
  3. Data-driven applications

How to apply

The competition is open to media companies, non-profit organisations, freelancers and individuals. Applicants are welcome to submit their best data journalism projects before 10 April 2012 at http://datajournalismawards.org/ submit-your-work/.

Find out more about the competition and how to apply at datajournalismawards.org. If you have any questions about the competition get in touch with Liliana Bounegru, DJA Coordinator (bounegru [at] ejc [dot] net).


Relevant links:
Website: www.datajournalismawards.org
Facebook page: http://www.facebook.com/pages/Data-Journalism-Awards/305250662849826
Twitter: @ddjournalism, @EditorsNet
Twitter hashtag #dja

The Data Journalism Handbook: Final call for contributions

February 6, 2012 in Data Journalism, Events, Our Work

This post is by Liliana Bounegru, Project Coordinator at the European Journalism Centre, and Lucy Chambers, Community Coordinator at the Open Knowledge Foundation. It is cross posted on DataDrivenJournalism.net

Where can I find data and how can I request access to it? What tools are available to me? How can I find useful stories within datasets? And, most importantly, how can I make a living through the practise of data journalism?

Do these questions sound familiar? They should, because these are all questions that any data journalist who is just getting started should be asking.

Leading data journalists from the New York Times, the Guardian, the BBC, and other top media organisations from around the world, are working together to answer all of these questions (and more) in the Data Journalism Handbook. The handbook is the first comprehensive practical guide to data journalism.

As Aron Pilhofer, editor of Interactive News at the New York Times, noted at the beginning of the project:

> “A project like this is quite necessary. It’s kind of surprising that someone hasn’t tried to do this until now.”

Interested in contributing? We want YOU!

Work on the handbook was kick-started in November of last year at the Mozilla Festival in London. In just two days, 55 contributors drafted 60 pages (20,000 words) for six chapters. The handbook is a community project; therefore, anyone who has experience in data journalism can help draft the book. The work, in turn, goes back to the community as anyone is allowed to freely use, modify, adapt, and reuse the handbook.

Contributions have been flooding in and since Mozilla we have been editing, updating, filling in gaps, and restructuring. We are very close to a first complete draft of the book and we need your help to get there. We are looking for authors, editors and peer reviewers to draft chapters, review content, style and accuracy of the book.

In the table of contents below you can see the progress that has been made and where input is needed:

  • 0. Preface (in progress)
    • 0.1 The purpose of this book
    • 0.2 Add to this book
    • 0.3 Share this book
  • 1. Introduction (done)
    • 1.1 What is data journalism?
    • 1.2 Why is it important?
  • 2. Introducing data journalism in the newsroom
    • 2.1 Changes in the newsroom (contributors needed)
    • 2.2 How is it done: journo-developers vs. coders for hire (in progress)
  • 3. Types of outcomes/projects and case studies
    • 3.1 Data powered stories (in progress)
    • 3.2 Data served with stories (in progress)
    • 3.3 Data driven applications (contributors needed)
  • 4. Working on the data story
    • 4.1. Step 1: Getting your data
      • 4.1.1 Where does data live? (in progress)
      • 4.1.2 Asking for data (in progress)
      • 4.1.3 Getting your own data (in progress)
      • 4.1.4 Crowdsourcing data (contributors needed)
    • 4.2 Step 2: Understanding your data
      • 4.2.1 Data literacy (done)
      • 4.2.2 Working with data tips (in progress)
      • 4.2.3 Tools and techniques for analysing data (contributors needed)
      • 4.2.4 Harnessing expert opinions: Annotating datasets (contributors needed)
    • 4.3 Step 3: Finding a story in your data (in progress)
      • 4.3.1  From datasets to stories – approaches 
    • 4.4 Step 4: Delivering your data project (contributors needed)
      • 4.4.1 Serving data with stories 
      • 4.4.2 Visualising data 
      • 4.4.3 Data driven applications 
  • 5. Engagement, outreach and community (contributors needed)
  • 6. How to make data journalism sustainable
    • 6.1 Measuring impact (contributors needed)
    • 6.2 Business models (contributors needed)
  • 7. Appendix (in progress)
    • 7.1 Further resources 
    • 7.2 Glossary

Whether you are a budding data journalist putting the manual through its paces or someone with years of experience, we’d love your help to fine-tune the book and get it ready for the press (no pun intended).

If you’re interested to contribute to the first Data Journalism Handbook please fill in this form as soon as possible indicating your level and area of expertise and the chapter that you would like to contribute to. We will get in touch with you as soon as we can once you’ve submitted the form.

Deadline

A first complete draft is planned to be ready by the end of February.

The official launch of the The Data Journalism Handbook will be at the International Journalism Festival in Perugia from 25-29 April. The book will be available online as an e-book. Participants at the festival will have the opportunity to buy a printed copy of the book and enjoy a meet and greet with the authors where they may exchange knowledge and learn helpful tips about how to successfully become data journalists.

Questions?

For questions get in touch with the Data Journalism Handbook coordinators: Liliana Bounegru (bounegru [at] ejc.net) or Lucy Chambers (lucy.chambers [at] okfn.org).

Diving into Data: The School of Data Journalism at the International Journalism Festival in Perugia

February 2, 2012 in Data Journalism

This post is by Liliana Bounegru, Project Coordinator at the European Journalism Centre, and Lucy Chambers, Community Coordinator at the Open Knowledge Foundation. It is cross posted on DataDrivenJournalism.net and journalismfestival.com.

In the past investigative reporters would suffer from a scarcity of information relating to questions they were trying to answer. While this is still the case, today journalists are also faced with an overwhelming abundance of data. In an age of information overload, to stay relevant to society journalists need to learn to separate signal from noise in order to provide valuable insights. Journalists need to be equipped with knowledge of the tools, techniques and tactics of working with data in order to derive maximum value from for their readers.

The European Journalism Centre and the Open Knowledge Foundation are pleased to invite you to the School of Data Journalism hosted at the sixth edition of Italy’s leading journalism event, the International Journalism Festival. The 2012 edition takes place in the beautiful city of Perugia between 25-29 April. Entry to the School of Data Journalism panels and workshops is free. Each workshop has a limited number of places and therefore registration will be necessary. Please note that not all requests to participate in the workshops will be accepted.

What is the School of Data Journalism and who is it for?

The School consists of three panel discussions and five workshops.

The panels attempt to provide answers to crucial questions for aspiring data journalists, editors and decision-makers in newsrooms:

  • What can aspiring data journalists learn from the successes of the past?
  • How can data journalism save your newsroom?
  • How do you start a data journalism operation?
  • How can you become a data journalist and what do you need to do?

In the workshops journalists who are interested to get started with reporting with data and budding data journalists will learn from experienced data journalists and open data experts essential skills related to how to get the data you need, how to analyse it, how to get stories from data and how to present your stories.

Panels

Panel 1: News and numbers: from CAR to data journalism (Thursday, 26 April)

Journalists have always used data and numbers to produce stories…and win Pulitzers. From Philip Meyer’s coverage of the Detroit riots in 1967 to Steve Doig’s ‘What Went Wrong’ analysis of the damage patterns from Hurricane Andrew, data-driven reporting has brought valuable public service and won journalists recognition and prizes.

Whereas there may be distinguishing aspects about the data journalism of today and the computer-assisted reporting of the past, it is crucial to learn from successful examples, techniques and approaches of the past.

  • How can we bring the data journalism community and the CAR community closer together?
  • What can aspiring data journalists learn from the successes of the past?
  • What is the future of data journalism?

Panel 2: How can data journalism save your newsroom? (Friday, 27 April)

Journalism is under siege. Traditional models are collapsing. Developing the know-how to use the available data more effectively, to understand it, communicate and generate stories based on it, could be a huge opportunity to breathe new life into journalism.

  • What is the potential of data journalism?
  • How do you start a data journalism operation?
  • How can you start thinking about making money with data journalism?

Panel 3: You Too Can Be A Data Journalist! (Saturday, 28 April)

Where can I find data? How can I request data? What tools can I use? How can I find stories in data? How can I make money with data journalism?

Several leading data journalists, CAR specialists and journalism professors from the Guardian, the New York Times, Financial Times, Chicago Tribune, Medill School of Journalism and Cronkite School of Journalism worked together to answer these questions in the Data Journalism Handbook.

The handbook, the first comprehensive practical guide to data journalism, will be officially launched in this session. The session will provide the opportunity to meet and greet authors of the book, exchange knowledge and learn from them what you need to know to be a data journalist, as well as get a printed copy of the book.

Speakers

  • Caelainn Barr, EU data journalist, formerly with the Bureau of Investigative Journalism in London
  • Sarah Cohen, Knight Professor of the Practice of Journalism and Public Policy, DeWitt Wallace Center for Media and Democracy, Duke University (Pulitzer prize winner)
  • Steve Doig, Knight Chair in Journalism, Walter Cronkite School of Journalism, Arizona State University (Pulitzer prize winner)
  • Mirko Lorenz, data journalism trainer, Deutsche Welle
  • Aron Pilhofer, editor of Interactive News at The New York Times and co-founder ofDocumentCloud.org
  • Dan Nguyen, news application developer, ProPublica
  • Simon Rogers, editor of the Guardian Data Blog

Workshops

Workshop 1: Scraping data & cracking PDFs (Thursday, 26 April)

Hands up who knows what machine-readable data is? You will soon, and more importantly, how to get it and what you can do with it once you have it!

A workshop targeted at scraping from scratch, including: What PDFs and webpages look like to your computer – An introduction to machine-readable / non-machine readable data The Scraper Cookbook – an overview of the key things you need to know to write a scraper Hands on session – learning to screen-scrape. Main focus: hands on session using tools such as ScraperWiki. If sufficient interest & time, we will also touch on some of the tools & skills needed to extract data from PDFs. Error checking – how to check what you have makes sense, spotting the types of errors sometimes introduced if you don’t get it quite right!

Workshop 2: Information wants to be free – Freedom of information requests and how to use them (Friday, 27 April)

Freedom of information requests are constantly evolving. Law changes and technological advancements make it increasingly easier to file and systematise FOI requests, and importantly track their progress through the system. This workshop includes demonstrations and case studies examining the current state-of-play with FOI requests in Europe and beyond and looking into what’s next for the freedom of information movement.

Workshop 3: Making data pretty (Friday, 27 April)

Journalism is no longer just a block of prose on a page. The modern reader often demands maps, infographics and visualisations to make the story jump out at them, particularly in digital environments.

There are a vast array of free tools available on the web to allow data-journalists to quickly and easily digest, process and display the data powering their stories. This workshop aims to give a good overview of what is currently available and delve into depth on one of the most powerful: Google Fusion Tables.

Workshop 4: Getting Stories from Data (Saturday, 28 April)

Enormous datasets can often prove extremely daunting to the unfamiliar. Mistakes and crimes have historically benefited from, and triumphs and good decisions been obscured by, a mask of bewildering numbers and statistics and gone unreported.

Large datasets often hold a wealth of undiscovered stories for those willing to invest the time into exploring them. This workshop is a ‘spotters’-guide’ for things to look out for and where to look for datasets.

Workshop 5: Spending Stories (Sunday, 29 April)

To get to the bottom of a story, you need only to ‘follow the money’. The same is true of government: budgeting is where policies and priorities are broken down into figures. Financial programming has a direct influence on all political areas: while other data on health or social help us understand what challenges society faces, looking at spending data allows us to see how government reacts to all of these.

There are many spending databases available on the web, some impenetrable, some accessible for analysis. We’ll show how we enable journalists and researchers to make sense of the data and what strategies can be used to investigate stories and policies.

  • Overview: What datasets are out there? Where can you look for more?
  • Interrogating databases, how to extract the maximum amount of data out of tricky databases
  • Tools for spending analysis, how to slice and dice once you have your data

Workshop leaders

  • Caelainn Barr, formerly with the Bureau of Investigative Journalism, working on the award-winning Structural Funds investigation
  • Heather Brooke, journalist and FOI campaigner, uncovered the UK MP expenses scandal
  • Lucy Chambers, Open Knowledge Foundation, Community Coordinator for OpenSpending and the ‘Spending Stories’ project.
  • Steve Doig, Knight Chair in Journalism, Walter Cronkite School of Journalism, Arizona State University (Pulitzer prize winner)
  • Friedrich Lindenberg, Open Knowledge Foundation Deutschland, Developer on OpenSpending
  • Dan Nguyen, news application developer, ProPublica
  • Simon Rogers, editor of the Guardian Data Blog

When and where?

The Data Journalism School takes place at the International Journalism Festival in Perugia between 25 and 29 April 2012. The schedule of the Data Journalism School, with confirmed speakers for each panel and workshop, will be posted on the festival website in early February.

How to register?

Entry to the festival and the School are free. There is no registration process to attend the festival. For the workshops there is a limited number of available seats. To secure a seat in the workshops please register via this form. The deadline for workshop registration is 20 March 2012. You will be notified by email by 25 March at the latest if we were able to confirm you a seat. The workshops are entry-level. Consideration will be given to your experience, skills and motivation to attend the workshop when making the selection.

What do you need to bring?

A lot of enthusiasm and a laptop for the workshop sessions are required. Please note for hands-on workshops tablet PC’s will not be appropriate.

Questions?

If you have questions about the School of Data Journalism get in touch with the coordinators: Liliana Bounegru (bounegru [at] ejc.net) or Lucy Chambers (lucy.chambers [at] okfn.org).

Please create an account to get started.

Sign up to the Open Knowledge Newsletter

Get Updates