I’m going to throw a bunch of data at you now. I want to pass along some great information I discovered tonight, but my brain is having trouble getting into a decent writing mode. Regard this as a jumbled bag of nice quality chocolate instead of a nicely laid out tray of nice quality chocolates. OK?
10 Things Paul Bradshaw Learned about Data Journalism
My favorite online communication and online journalism group in Denmark invited Paul Bradshaw to speak at a Dona.dk meeting tonight. The talk was entitled “10 things I learned about data journalism”, but I am not going to list all ten items (because I didn’t write them all down). I was trying to listen, learn, absorb, and think.
I took home some great resources and a few very nice quotes.
The Usual Suspects
Let’s start with the resources – people actually. A lineup of Paul Bradshaw’s People to Watch (hmmm, POW seems like a good acronym in light of the topic) are
- Tom Steinberg with My Society, “one of UK’s best known democracy websites”
- Charles Arthur, Technology Editor at The Guardian
- Heather Brooke, “journalist, writer, and freedom of information activist” according to her description on Wikipedia
- Tim Berners-Lee (without his invention, would you be reading something called a blog?)
- Chris Taggart, who launched Open Charities and Open Corporates (rather impressive sites)
Yeah, you could say that some of the work being done in this list does have a “pow” effect. Anyway, five people, and reading about them and what they are working on should keep you busy for the next many days, months, years, etc.
Compile. Clean. Connect.
Add the word data to that and you get:
Compiling data.
Cleaning data.
Connecting data.
This is the job of a data journalist. There is a ton of news hidden in data. The data journalist has to dig out the information and be able to present it to the public in an accurate fashion. That means digging up, or compiling, the data, and then examining it closely and carefully. Finally, those bits and pieces have to be put together in some coherent fashion that is objective and relevant and so on. These three C’s are one of the lessons learned.
The data journalism continuum, as Bradshaw describes it, is similiar: finding, interrogating, visualizing, mashing. There are double-headed arrows between those stages so you can move along the continuum in both directions. And you will.
I began to think about recent comments on how reporting from the Japan earthquake and tsunami was inaccurate, and how news about the nuclear reactors was especially poor. It’s this type of situation Paul Bradshaw is discussing – gathering the data, going through it, questioning and checking it all, and possibly starting over if something is incorrect. It’s a demanding job and the flaws will show if it is not done properly. However, an unsuspecting public might now realize the weaknesses right away, and then we can get skewed or completely wrong impressions of the news. (I see parallels to the job of many technical communcators, too.)
When he showed the data journalism continuum, Paul Bradshaw was talking about his first lesson: you can never be an expert. There is so very much to do. One person may not necessarily be able to do some coding to work over the data and make super sharp data visualizations and so on. Data journalism is too all-encompassing to master.
Another person mentioned was Adrian Holovaty from Everyblock. He was apparently doing mashups similar to Google mashups – before they released the API! This was to plot in crime reports for Chicago. Now he has moved on to Everyblock. Who doesn’t want an Everyblock to stayed tuned to local neighborhood news? (I’m betting that this was discussed at the Cognitive Cities conference last month. It’s also the type of news that would be popular at Reboot conferences.)
Seeing Data
“Visualization is about taking things away.” Visualizations are great for instant communication, but they should always link to the full data set and give proper credit to sources, Bradshaw said. The reference to this point was this Tree Hugger article about last year’s volcano story and its infographic. We also had a slide with the image in this article by David McCandless. If you don’t know about David McCandless, check out that site. His material is drool-worthy. He is a must-know resource for visualization.
There was another great visualization with a great story in a tale from Mexico. It was from a post that also involves Benford’s Law. Read the post for an explanation of Benford’s Law and for some great links. Your mind will boggle.
Data journalism has commercial potential in that it is “sticky, engaged, targeted, and valuable”. A visualization might be here today and gone tomorrow, from one perspective. The in-depth analysis can be stickier – sticking in people’s minds. I feel more engaged as a reader when I see attribution and references and well-articulated arguments about some topic. OK, maybe someone can fool me, but I think when you are reading the work of a good data journalist, you’ll sense the craftsmanship.
I was quite intrigued by The Guardian’s Datastore on Flickr.com. They state: Take our data, mash it up and create great visualisations with it – then post them on here. Whoa. The point here? “Data is a social object.”
That could be a closing comment to this post – it was Bradshaw’s last slide, and it’s a good one. However, I was captivated by another lesson, which I tweeted out after the talk.
Numeracy should be as valued by news orgs as literacy.
And in parentheses: Every newsroom should have a statistician. See? That’s back to not being an expert at everything. Maybe you do not know stats that well. Get someone who does!
Epilogue
There is a Delicious list of references for this talk.
The most delicious link is Paul Bradshaw’s website. I signed up to get his posts by email because my RSS feeder suffers from neglect at the moment. Signing up for news there means I’ll know when his book, “The Online Journalism Handbook”, comes out. I know someone who will be interested in that! If this chapter on UGC is in that handbook, I think some technical communication hearts will go pitter-patter!
This post was written to PInk Floyd’s “Dark Side of the Moon”. Many references in the talk rang bells in my mind. Most references were from the UK and through @stcaccess, I happen to follow a number of people in the UK who happen to be involved in open data and local government. Of course, they are passionate about what they are doing, but despite only knowing a handful of people doing this, I feel like there is a big movement of ordinary citizens doing extraordinary things with open data in their community. They are the crowd that can be sourced by the data journalist. One of those people is the very nice @pigsonthewing, who happens to be connected to Pink Floyd and who happens to come from Birmingham, just like Paul Bradshaw. For some silly reason, writing about the topic of data journalism and thinking about the local/open government tales I’ve heard over the past two years seemed most appropriate with Pink Floyd lending a hand.
Hi Karen
Thanks for the nice recap of the Paul Bradshaw-talk. I really don’t have that much to add besides the fact that Paul, as he also told yesterday, strongly believes that datajournalism can be profitable – an investment worth exploring for media outlets.
His points about datajournalism giving value to advertisers is interresting. The fact that placing ads along with interactive content – and targeting ads depending on what content the user looks at – might prove part of how the media can make money from doing this kind of journalism. And it will also become a strong argument for those of us who consider data to be free.
Thanks for stopping by, Martin. I meant to reply earlier. 🙂 However, being delayed means I can share this link from the Guardian:
http://www.guardian.co.uk/news/datablog/2011/apr/07/data-journalism-workflow
It’s “Data journalism broken down: what we do to the data before you see it. What happens before our data journalism produces results? See how the process works.”
I’m flattered that so many people liked this post. I regarded it merely as a collection of thoughts and nothing substantial.
I’m glad you commented on the advertising part. I got a bit distracted at that point and didn’t get good notes. As a consumer of news, I love finding someone I can trust to curate the ton of news that is happening. I know it costs money. How do we strike the balance between what I can afford to stay in touch with a special curator (as opposed to the firehose of social networks) and what the industry can afford to stay afloat. By afford, I mean both money and mental/emotional engagement. We live in interesting times – isn’t that the usual answer? 😉
[…] to think that attending a lecture by Paul Bradshaw last year could lead to such delight in reading these valuable blog […]