This is a list of both useful and eclectic articles and guides to data journalism, in no particular order, though I’ve sorted them into rough categories for now. This list is auto-generated from a Google Spreadsheet
Title | Author | Published | Category | Type | Added |
---|---|---|---|---|---|
An international collaboration of the best of data journalism worldwide.
|
The European Journalism Center and the Open Knowledge Foundation
datajournalismhandbook.org |
2012 | Reporting | Book | 9/4/2013 |
Daniel Gilbert had spreadsheets with thousands of rows of data. "There was a story there, he was certain. But control-f would not find it." After he took a class on how to use Microsoft Access, he could finally ask and answer the questions he knew were in the data. Gilbert's meticulously detailed series on landowners being cheated by oil and gas companies ended up winning a Pulitzer.
|
Janet Paskin with Daniel Gilbert
www.cjr.org |
11/9/2010 | Reporting | Article | 9/4/2013 |
Anthony DeBarros
www.anthonydebarros.com |
Reporting | Article | 9/4/2013 | ||
How the structure of data can create new kinds of storytelling.
|
Matt Waite
source.mozillaopennews.org |
Reporting | Article | 9/4/2013 | |
How exactly do you define "worst"? Here's a high-level description of the factors reporters used to rank the performance of charity, and the amount of data they had to sift and analyze to do so.
|
Kendall Taggart and Kris Hundley
www.tampabay.com |
6/6/2013 | Reporting | Article | 9/4/2013 |
Derek Willis
dwillis.github.io |
Reporting | Guide | 9/4/2013 | ||
Richard Parker
www.cjr.org |
Reporting | Article | 9/4/2013 | ||
Nicholas Diakopoulos
towcenter.org |
Reporting | Article | 9/4/2013 | ||
A video report on data visualization as a storytelling medium
|
Geoff McGhee
datajournalism.stanford.edu |
9/28/2010 | Reporting | Article | 9/4/2013 |
Using the MTA turnstile data to see what kind of crowds Jay-Z drew at Jay-Z's inaugural concert with Barclays Center.
|
John Keefe
johnkeefe.net |
2012-09 | Reporting | Article | 9/12/2013 |
WNYC's political team wanted to know: how much money was being raised by candidates for a state legislative district from within the district itself? John Keefe walks through his use of Fusion Tables and additional database techniques for the analysis and visualization.
|
John Keefe
johnkeefe.net |
2011-09 | Reporting | Guide | 9/12/2013 |
One day John Keefe was just browsing the official NYC data site and tucked away some code that involved the city's flood zones. When Hurricane Irene stuck, NYC.gov was flooded (with web traffic) and WNYC became an invaluable resource.
|
John Keefe
johnkeefe.net |
Reporting | Article | 9/12/2013 | |
Why aren't more Congressional reporters taking advantage of the copious generated by our legislators?
|
Derek Willis
thescoop.org |
Reporting | Article | 9/12/2013 | |
Chrys Wu
blog.chryswu.com |
Reporting | Article | 9/14/2013 | ||
A brilliant walkthrough of benign data sets and the code that could potentially launch investigative projects.
|
Kate Martin, Grant Smith, Megan Luther
docs.google.com |
Reporting | Guide | 9/14/2013 | |
Adam Hochberg
www.poynter.org |
Reporting | Article | 9/14/2013 | ||
This 2013 Pulitzer Public Service winning story epitomizes the best of watchdog journalism, as its expose of a deadly public injustice sparked swift reform. But it deserves an award for its ingenious do-it-yourself way of collecting the kind of data that, in theory, was non-existent.
|
Sally Kestin and John Maines
www.ire.org |
Reporting | Article | 9/14/2013 | |
Sarah Harkins
ire.org |
Reporting | Article | 9/15/2013 | ||
OK, this is not a traditional data journalism book, though it's one of the best true crime and/or journalism books ever written. Simon does keep and review a tally of the bodies and where they marticulate in the Balitmore justice system throughout the year. Also, there's a memorable passage about the "murder board", a simple visualization that has unintended consequences on the efficency of the homicide squad.
|
David Simon
www.amazon.com |
Reporting | Book | 9/18/2013 | |
The modern computing age may have irreperably damaged the business of journalism, but it may also streghten the act of journalism.
|
Sarah Cohen, James T. Hamilton, Fred Turner
cacm.acm.org |
Reporting | Article | 9/21/2013 | |
Paul Bradshaw
www.theguardian.com |
10/1/2010 | Reporting | Article | 9/23/2013 | |
This landmark study that studied whether financial disclosure laws were actually being followed was the result of old-fashioned data collection: travelling to Minnesota and manually copying documents, before analyzing them in Access.
|
Joseph S. Ross, Josh E. Lackner, et al.
jama.jamanetwork.com |
Reporting | Article | 9/27/2013 | |
"That intellectual curiosity. That bullshit detector for lack of a better term, where you see a data set and you have at least a first approach on how much signal there is there...That stuff is kind of hard to teach through book learning. So it’s by experience. I would be an advocate if you’re going to have an education, then have it be a pretty diverse education so you’re flexing lots of different muscles...You can learn the technical skills later on"
|
Walter Frick
blogs.hbr.org |
Reporting | Article | 10/1/2013 | |
Is it possible to run a mugshot-aggregating website and still follow journalistic ethics?
|
Matt Waite
source.mozillaopennews.org |
3/11/2013 | Reporting | Article | 10/6/2013 |
When there is no data, build your own. After discovering that no one was keeping track of Washington D.C.'s numerous homicide victims, Laura and Chris Amico simply started counting. The hand-built data source became a community resource as well as a source for data analysis.
|
Erin Kissane, with Laura and Chris Amico
contentsmagazine.com |
Reporting | Article | 11/17/2013 | |
"But when you open up the Excel spreadsheets from the VBA, you’ll find lots of figures with little explanation. So we’ve put together this guide to get you started in your pursuit of good local, state, regional and national stories that are to be had in the data. A grounding in Excel basics is needed to follow along."
|
Shane Shifflett
nationalsecurityzone.org |
12/6/2013 | Reporting | Article | 12/6/2013 |
Scott Klein
www.niemanlab.org |
12/16/2013 | Reporting | Article | 12/16/2013 | |
A great example of making use of limited data through plain deductive reasoning, cleverness, and good reporting. Comes with pretty pictures, too.
|
Al Shaw and Justin Elliott
www.propublica.org |
Reporting | Guide | 1/13/2014 | |
Harry Jaffe
www.washingtonian.com |
Reporting | Article | 1/24/2014 | ||
Brad Racino and Joe Yerardi
inewsource.org |
Reporting | Guide | 1/31/2014 | ||
www.propublica.org |
Reporting | Article | 3/6/2014 | ||
Judy Bayer and Marie Taillard
blogs.hbr.org |
9/27/2013 | Reporting | Article | 3/6/2014 | |
Computers have made mapping almost too easy, to the point where we overlook clearer, more efficient ways to display data that isn't particularly geospatially interesting.
|
Matthew Ericson
www.ericson.net |
10/14/2011 | Visualization | Article | 9/4/2013 |
A well-lillustrated list of design concepts to bring to visualizations and news interactives.
|
Lena Groeger
www.propublica.org |
5/30/2013 | Visualization | Article | 9/4/2013 |
Are modern web interactives becoming too complicate for users to interpret? Probably.
|
Shazna Nessa
source.mozillaopennews.org |
6/13/2013 | Visualization | Article | 9/4/2013 |
How ugly formatting, including improperly bolded text, can obscure the important information in technical presentations.
|
Edward Tufte
www.edwardtufte.com |
Visualization | Article | 9/4/2013 | |
A great discussion between Edward Tufte and his readers on why pie charts aren't ideal.
|
Edward Tufte discussion board
www.edwardtufte.com |
Visualization | Article | 9/4/2013 | |
Chart types, from simple to exotic, in tabular form.
|
Angela Zoss
guides.library.duke.edu |
2010 | Visualization | Guide | 9/4/2013 |
Applying Tufte's principle of small multiples to online video thumbnails.
|
Matt Linderman
37signals.com |
2/13/2007 | Visualization | Article | 9/4/2013 |
Charts can be too beautiful for their own good.
|
Kaiser Fung
junkcharts.typepad.com |
Visualization | Article | 9/4/2013 | |
An awe-inspiring visual essay from the creator of D3, its purpose "to share my love of examples with you."
|
Mike Bostock
bost.ocks.org |
Visualization | Article | 9/4/2013 | |
Drawn sketches can hint at what's visually compelling to the designer.
|
Jason Fried
37signals.com |
8/27/2009 | Visualization | Article | 9/4/2013 |
This is not really about data, but about the details of the creation process, and the importance of streamlining your workflow.
|
Jason Fried
37signals.com |
Visualization | Article | 9/4/2013 | |
Another popular web visualization that is more junk than data.
|
Jacob Harris
www.niemanlab.org |
Visualization | Article | 9/4/2013 | |
When you want something done beautifully, sometimes you have to do it yourself.
|
Alastair Coote
blogging.alastair.is |
Visualization | Guide | 9/4/2013 | |
Bret Victor
worrydream.com |
Visualization | Article | 9/4/2013 | ||
Bret Victor
worrydream.com |
Visualization | Article | 9/4/2013 | ||
A critique of a well-liked New York Times interactive.
|
Bret Victor
worrydream.com |
Visualization | Article | 9/4/2013 | |
Daniel Brownstein
dabrownstein.wordpress.com |
8/3/2013 | Visualization | Article | 9/4/2013 | |
A critique of map design and thresholds, in the contest of the NYPD's stop-and-frisk data
|
Steven Romalewski
spatialityblog.com |
7/27/2012 | Visualization | Article | 9/4/2013 |
Ian Storm Taylor
ianstormtaylor.com |
Visualization | Article | 9/4/2013 | ||
A case for where hand-editing has advantages over algorithmic rendering.
|
Seth Stevenson
www.slate.com |
Visualization | Article | 9/4/2013 | |
Susan Schulten
opinionator.blogs.nytimes.com |
Visualization | Article | 9/4/2013 | ||
John Mackenzie
www.udel.edu |
Visualization | Guide | 9/4/2013 | ||
Andy Boyle
blog.apps.chicagotribune.com |
Visualization | Guide | 9/4/2013 | ||
The beach ball chart is about as dumb as it sounds.
|
Joe Germuska
blog.apps.chicagotribune.com |
Visualization | Article | 9/4/2013 | |
Christopher Groskopf
blog.apps.chicagotribune.com |
Visualization | Guide | 9/4/2013 | ||
John Keefe
johnkeefe.net |
Visualization | Guide | 9/4/2013 | ||
Al Globus
www2.cs.uregina.ca |
Visualization | Article | 9/4/2013 | ||
Jim Vallandingham
vallandingham.me |
Visualization | Guide | 9/4/2013 | ||
Millions and millions of beautiful dots, without crashing your computer
|
Eric Fischer
www.mapbox.com |
Visualization | Article | 9/4/2013 | |
Alex Krusz
blog.velir.com |
Visualization | Article | 9/4/2013 | ||
Zachary Forest Johnson
indiemaps.com |
10/18/2011 | Visualization | Article | 9/4/2013 | |
Christopher Groskopf
blog.apps.npr.org |
Visualization | Guide | 9/4/2013 | ||
An behind-the-scenes explanation of how one of the best web visualizations ever was built.
|
Mike Bostock
source.mozillaopennews.org |
Visualization | Article | 9/4/2013 | |
How to use Google-provided election data to build a Fusion Table-powered map.
|
John Keefe
johnkeefe.net |
2012 | Visualization | Guide | 9/12/2013 |
With Fusion Tables and a little Ruby magic, John Keefe describes how he mapped contributions to presumptive NYC mayoral candidates by zip code by lunchtime.
|
John Keefe
johnkeefe.net |
Visualization | Guide | 9/12/2013 | |
“When we understand that slide, we’ll have won the war.”
|
Elisabeth Bumiller
www.nytimes.com |
Visualization | Article | 9/12/2013 | |
A visualization of what makes great visualizations at the New York Times
|
Erin Kissane
source.mozillaopennews.org |
Visualization | Article | 9/14/2013 | |
History's most memorable 3D-column chart.
|
Peter Norvig
www.norvig.com |
Visualization | Article | 9/14/2013 | |
A modern critique of Florence Nightingale's, "Diagram of the Causes of Mortality"
|
Henry Woodbury
dd.dynamicdiagrams.com |
Visualization | Article | 9/14/2013 | |
A (personal) blog of data sketches from the New York Times Graphics Department. Maintained by @KevinQ.
|
Kevin Quealy
chartsnthings.tumblr.com |
Visualization | Blog | 9/14/2013 | |
"Most of what we can do to make the world a better place involves, not doing the unprecedented, but doing what matters and what works, whether unprecedented or not. This might not be as exciting as the unprecedented, but it’s desperately needed."
|
Stephen Few
www.perceptualedge.com |
4/12/2010 | Visualization | Article | 9/18/2013 |
An elegant and compact way to encode a metric with comparative data.
|
Stephen Few
www.perceptualedge.com |
Visualization | Article | 9/18/2013 | |
This book was revolutionary when Tufte published it in 1982 and it remains just as relevant in our current state of PowerPoint and tacky CSS transitions.
|
Edward Tufte
www.amazon.com |
1982 | Visualization | Book | 9/18/2013 |
After you laugh, remember: he's a Nobel Prize winner.
|
Paul Krugman
krugman.blogs.nytimes.com |
Visualization | Article | 9/18/2013 | |
There's no shortage of ways to make data just a little bit easier to work with, and usually the effort is worth it.
|
Jeroen Janssens
jeroenjanssens.com |
9/16/2013 | Visualization | Guide | 9/19/2013 |
The importance of limiting colors in a visualization is one of the most un-obvious principles to grasp. And yet, once it's explained to you, you'll immediately look down on everyone else who failing to see the tackiness.
|
Robert Kosara
eagereyes.org |
Visualization | Article | 9/20/2013 | |
How to turn a PDF into a Leaflet-powered crime map.
|
Chris Essig
csessig.wordpress.com |
Visualization | Guide | 9/21/2013 | |
WTF indeed
|
wtfviz.net |
Visualization | Blog | 9/23/2013 | |
Is the Tufte-school of aesthetic too boring for our Jerry-Bruckheimer-corrupted brains?
|
The Interaction Lab at the University of Saskatchewan
hci.usask.ca |
Visualization | Study | 9/27/2013 | |
A rebuttal to the University of Saskatchewan paper on whether "chartjunk" is more effective in conveying data.
|
junkcharts.typepad.com |
Visualization | Article | 9/27/2013 | |
"Mapping is not value-free, dangers occur when professional cartographers and geographers attempt to map data from fields in which they are ignorant. "
|
Tom Koch
www.ph.ucla.edu |
Visualization | Article | 10/21/2013 | |
Edward Tufte
www.edwardtufte.com |
2007-07 | Visualization | Article | 10/25/2013 | |
A nice guide on to turn stock Excel charts into something Tufte-approved.
|
Cyrille Vincey
insights.qunb.com |
Visualization | Article | 10/25/2013 | |
Edward Tufte
www.edwardtufte.com |
Visualization | Article | 10/25/2013 | ||
Asaf Degani, NASA
ti.arc.nasa.gov |
Visualization | Reference | 10/27/2013 | ||
Duke University LibGuide
guides.library.duke.edu |
Visualization | Article | 10/27/2013 | ||
A nice list of visualization blunders, though the main reason I included this was for its funny title.
|
Eronarn
www.smashingmagazine.com |
Visualization | Article | 10/27/2013 | |
Wikipedia
en.wikipedia.org |
Visualization | Article | 10/27/2013 | ||
Even scientists have trouble with data visualizations
|
www.biostat.wisc.edu |
Visualization | Article | 10/27/2013 | |
Rick Ord
cseweb.ucsd.edu |
Visualization | Article | 10/27/2013 | ||
Julia Turner
www.slate.com |
Visualization | Article | 10/28/2013 | ||
Edward Tufte's influence arrived well before the Web (and yet still persists). Here's one of his relatively few reflections on web-based visualizations.
|
Edward Tufte
www.edwardtufte.com |
Visualization | Article | 11/18/2013 | |
Ranjitha Kumar, Arvind Satyanarayan, Cesar Torres, Maxine Lim, Salman Ahmad, Scott R. Klemmer, Jerry O. Talton
idl.cs.washington.edu |
2013 | Visualization | Paper | 11/27/2013 | |
Sharon Lin, Julie Fortuna, Chinmay Kulkarni, Maureen Stone, Jeffrey Heer
idl.cs.washington.edu |
Visualization | Paper | 11/27/2013 | ||
Steven Romalewski
spatialityblog.com |
9/29/2011 | Visualization | Article | 12/14/2013 | |
dabrownstein.wordpress.com |
Visualization | Guide | 1/30/2014 | ||
untappedcities.com |
Visualization | Article | 2/26/2014 | ||
source.opennews.org |
Visualization | Article | 2/26/2014 | ||
Albert Cairo
www.peachpit.com |
11/25/2013 | Visualization | Article | 3/10/2014 | |
Edward Tufte
www.edwardtufte.com |
2013-11 | Visualization | Article | 4/8/2014 | |
A hallmark essay on how data can be used in the very core of journalsitic operations, by the journalist who created the popular Django framework. Too bad his advice hasn't been much heeded.
|
Adrian Holovaty
www.holovaty.com |
9/6/2006 | Curation | Article | 9/4/2013 |
An influential economic analysis by two Harvard economists, based on a spreadsheet snafu? This paper and the articles that followed are a bag of laughs.
|
Thomas Herndon, Michael Ash, and Robert Pollin
www.peri.umass.edu |
4/15/2013 | Curation | Paper | 9/4/2013 |
Google Refine is now Open Refine and its interface and feature sets have improved quite a bit since I wrote this essay. But it's still a good overview of how a single digital tool can make a data-intensive investigation possible.
|
Daniel Nguyen
www.propublica.org |
12/30/2010 | Curation | Guide | 9/4/2013 |
The horror of scraping government data.
|
Jacob Harris
source.mozillaopennews.org |
Curation | Article | 9/4/2013 | |
A brilliant example of a (legal) hack to get public data.
|
Derek Willis
source.mozillaopennews.org |
Curation | Article | 9/4/2013 | |
A three-part series to just how complicated just doing data updates can be.
|
Adrian Holovaty
source.mozillaopennews.org |
3/11/2013 | Curation | Article | 9/4/2013 |
Etsy found interesting and practical ways to mash up the (non-HR-private) data of its staff.
|
Ian Malpass
codeascraft.com |
Curation | Article | 9/4/2013 | |
If you're new to SQL, or even if you're skilled at it, this is one of the best collections of virtually every half-way complicated query you can dream of.
|
Peter Brawley and Arthur Fuller
www.artfulsoftware.com |
Curation | Guide | 9/4/2013 | |
A clever example of how to use code and available technology to track variables in the "meatspace"
|
Sau Sheong Chang
blog.airbrake.io |
Curation | Guide | 9/4/2013 | |
Troy Thibodeaux
github.com |
Curation | Guide | 9/4/2013 | ||
Matt Might
matt.might.net |
Curation | Guide | 9/4/2013 | ||
A high-level description of the engineering used to mine and index 3.4 billion web pages in about 14 hours.
|
blog.luckyoyster.com |
Curation | Article | 9/4/2013 | |
A guide to practical data mining, collective intelligence, and building recommendation systems
|
Ron Zacharski
guidetodatamining.com |
Curation | Book | 9/4/2013 | |
Ricky Ho
horicky.blogspot.com.au |
Curation | Guide | 9/4/2013 | ||
Neal Caren
nealcaren.web.unc.edu |
Curation | Guide | 9/4/2013 | ||
If there's one data-related skill I wish I had learned years ago, it would be regular expressions.
|
Daniel Nguyen
regex.bastardsbook.com |
Curation | Book | 9/4/2013 | |
Taylor Dungjen
www.toledoblade.com |
Curation | Article | 9/4/2013 | ||
John Keefe
source.mozillaopennews.org |
Curation | Article | 9/4/2013 | ||
Easily one of the most esoteric and brilliant crowdsourcing data projects yet conceived.
|
WNYC and Radiolab
project.wnyc.org |
Curation | Guide | 9/4/2013 | |
Irene Ros and Nathan Matias
source.mozillaopennews.org |
Curation | Article | 9/4/2013 | ||
This says everything you need to know about the complexity and annoyance of real-life data: "For example, in the Senate it is possible for the Majority Leader and Minority Leader to alter the rules of math when it comes to how many senators constitute a three-fifths majority."
|
Derek Willis
thescoop.org |
2/4/2012 | Curation | Article | 9/12/2013 |
Citizen data journalism at its best, from John Krauss: "I already knew that they were finally releasing -- after the Council forced them to -- crash data as idiotically obfuscated PDFs, but reading that they justified this out of concern for "the integrity of the data," was so galling that it goaded me into action. I would make the data accessible as friendly, parseable CSVs."
|
John Krauss
blog.accursedware.com |
2/16/2012 | Curation | Article | 9/12/2013 |
When it comes to large data-projects involving messy data, don't always get hung up on the technology to use. Coming up with an efficient way to divide and distribute the data collection and cleaning, even within your own newsroom, can often do the job
|
John Bones
ire.org |
Curation | Article | 9/18/2013 | |
If you've ever tried to scrape a non-trivial website, you'll understand why this book is 80,000+ words long.
|
Paul Bradshaw
leanpub.com |
Curation | Book | 9/18/2013 | |
Quick, light-weight ways to reshape and visualize data from the command-line.
|
Seth Brown
www.drbunsen.org |
12/3/2012 | Curation | Article | 9/19/2013 |
SQL joins illustrated with Venn diagrams
|
Jeff Atwood
www.codinghorror.com |
10/11/2007 | Curation | Article | 9/19/2013 |
Mmmm...munging
|
Trevor Muñoz
trevormunoz.com |
8/19/2013 | Curation | Guide | 9/20/2013 |
Noah Veltman
github.com |
Curation | Guide | 9/27/2013 | ||
A short readable guide to one of the most vital and fundamental data journalism tools.
|
MaryJo Webster
extra.twincities.com |
2010 | Collection | Article | 9/4/2013 |
A wonderfully clever way to use computer-vision and a webcam for in-the-wild text recognition
|
Arnab Nandi
arnab.org |
Collection | Guide | 11/5/2013 | |
A comprehensive guide to properly using indices in SQL databases: a must for any developer dealing with more than a few million rows of data.
|
Markus Winand
use-the-index-luke.com |
Collection | Book | 11/11/2013 | |
A free book written by one of the best R programmers around, aimed at experienced programmers wanting to use R as a programming language.
|
Hadley Wickham
adv-r.had.co.nz |
Collection | Book | 11/18/2013 | |
A long list of slides about statistical data mining and foundational principles of statistics.
|
Andrew Moore
www.autonlab.org |
Collection | Guide | 11/19/2013 | |
Dan Kois
www.slate.com |
12/12/2013 | Collection | Article | 12/13/2013 | |
Jeremy B. Merrill
www.propublica.org |
3/25/2013 | Collection | Article | 12/16/2013 | |
This document is aimed towards game developers, but it is the most thorough explanation of how to best incorporate creative writing into a dynamic code-generated experience, while giving writers the most degree of freedom possible. It's a great examination of the core facets of good writing and goes deep into the technical implementation.
|
Elan Ruskin
www.valvesoftware.com |
2012 | Collection | Article | 12/27/2013 |
Besides data-munging, taxonomy is probably the most underappreciated data skill.
|
Stjin Debrouwere
stdout.be |
4/7/2010 | Collection | Article | 1/8/2014 |
Even nuclear historians write web scrapers.
|
Alex Wellerstein
blog.nuclearsecrecy.com |
12/27/2013 | Collection | Article | 1/13/2014 |
Thiago Valverde
blog.valverde.me |
Collection | Guide | 1/23/2014 | ||
Predictably, the answer is very complicated and involves calculus.
|
Michael Nielsen
www.michaelnielsen.org |
1/23/2012 | Analysis | Article | 9/4/2013 |
A short list of statistical snafus as applied to data journalism.
|
Dave Stanton
source.mozillaopennews.org |
7/25/2013 | Analysis | Article | 9/4/2013 |
A nice explanation of selection bias, in the context of Twitter and other social media.
|
Jacob Harris
source.mozillaopennews.org |
7/18/2013 | Analysis | Article | 9/4/2013 |
Ricky Ho
horicky.blogspot.com |
Analysis | Guide | 9/4/2013 | ||
Alexandru Nedelcu
www.bionicspirit.com |
Analysis | Guide | 9/4/2013 | ||
An introduction to Probability and Statistics for Python programmers.
|
Allen B. Downey
www.greenteapress.com |
Analysis | Book | 9/4/2013 | |
An exploration of Bayesian statistics, using the Python programming language
|
Allen B. Downey
www.greenteapress.com |
Analysis | Book | 9/4/2013 | |
Edward Tufte
www.edwardtufte.com |
Analysis | Book | 9/4/2013 | ||
Sonya Song
sonya2song.blogspot.com |
Analysis | Article | 9/4/2013 | ||
A great explainer of how to implement real metrics to the field of news applications.
|
Brian Abelson
brianabelson.com |
Analysis | Article | 9/4/2013 | |
Elizabeth A. Stuart
biostat.jhsph.edu |
Analysis | Guide | 9/4/2013 | ||
Gary King
dash.harvard.edu |
Analysis | Guide | 9/4/2013 | ||
Harlan Harris, Sean Murphy, Marck Vaisman
oreilly.com |
Analysis | Book | 9/4/2013 | ||
Jonathan Stray
source.mozillaopennews.org |
Analysis | Article | 9/4/2013 | ||
The master of presidential predictions explains why he's not so great at predicting Oscar winners.
|
Nate Silver
fivethirtyeight.blogs.nytimes.com |
Analysis | Article | 9/14/2013 | |
Simulations don't yet have much place in journalism, but I like this analysis of laundry room behavior because it acknowledges the limitation of analysis and the impact of assumptions, as well as intuitively walking through the design of the simulation (with Julia code snippets). Also, it tackles a question near to all of us.
|
Iain Dunning
iaindunning.com |
Analysis | Guide | 9/15/2013 | |
Ryan Holiday
www.forbes.com |
Analysis | Article | 9/15/2013 | ||
"Merely knowing who people are, paradoxically, isn’t very predictive of their movie tastes."
|
Clive Thompson
www.nytimes.com |
Analysis | Article | 9/16/2013 | |
The final product of this fashion interactive is of course a visualization. But Hinton's writeup is more about the analysis used to "fingerprint" each fashion pohoto.
|
Erik Hinton
source.mozillaopennews.org |
Analysis | Article | 9/18/2013 | |
Saunders's blog is fascinating to me, as he describes his struggles (and triumphs) as a hobbyist programmer in a field - bioinformatics - that you'd think would embrace his type of coding skills. Here, he shows the code needed to analyze the writing tics found in 47 gigabytes of PubMed abstracts.
|
Neil Saunders
nsaunders.wordpress.com |
Analysis | Guide | 9/18/2013 | |
Jeff Leek
simplystatistics.org |
Analysis | Article | 9/18/2013 | ||
This is both an interesting read about sex and relationships as well as something that should pique your interest about how online services measure your behavior.
|
Christian Rudder
blog.okcupid.com |
Analysis | Article | 9/18/2013 | |
Truman Collins
www.tkcs-collins.com |
Analysis | Guide | 9/18/2013 | ||
Because even his elite engineering and mathematics friends didn't understand how easy it is to make a spell-checker, Norvig explains the theory and shows the less-than-a-page-worth of code needed to achieve 80 to 90% accuracy.
|
Peter Norvig
norvig.com |
Analysis | Guide | 9/18/2013 | |
Here's a way to look at data science as five distinct steps. Also, it's the first I've heard of the awesome acronym, OSEMN.
|
Hilary Mason and Chris Wiggins
www.dataists.com |
9/25/2010 | Analysis | Article | 9/19/2013 |
After a triumphant night of presidential election predictions, Nate Silver rubs it in by examining how the vairous pollsters did in their predictions.
|
Nate Silver
fivethirtyeight.blogs.nytimes.com |
Analysis | Article | 9/19/2013 | |
"This textbook is intended to provide a comprehensive introduction to forecasting methods and present enough information about each method for readers to use them sensibly. We don’t attempt to give a thorough discussion of the theoretical details behind each method, although the references at the end of each chapter will fill in many of those details."
|
Rob J. Hyndman and George Athanasopoulos
otexts.com |
Analysis | Book | 9/19/2013 | |
Andrey Fradkin, Riley Newman, Rebecca Rosenfelt
nerds.airbnb.com |
7/22/2013 | Analysis | Article | 9/20/2013 | |
A brief overview geared toward engineers with no prior knowledge of machine learning.
|
Hilary Mason
www.hilarymason.com |
Analysis | Video | 9/20/2013 | |
A health-care themed example of how data analysis can discover counterintuitive solutions.
|
Atul Gawande
www.newyorker.com |
Analysis | Article | 9/20/2013 | |
One of the classics of scientific papers, influential and controversial to this day.
|
John P. A. Ioannidis
www.plosmedicine.org |
Analysis | Article | 9/20/2013 | |
"What’s actually going on is that the division between red and blue America is mostly about a split among richer voters."
|
Andrew Gelman and Avi Feller
campaignstops.blogs.nytimes.com |
Analysis | Article | 9/20/2013 | |
Andrew Gelman, Boris Shor, Joseph Bafumi, David Park
www.stat.columbia.edu |
Analysis | Article | 9/20/2013 | ||
Kieran Healy
kieranhealy.org |
6/9/2013 | Analysis | Article | 9/23/2013 | |
Ali Almossawi
bookofbadarguments.com |
Analysis | Book | 10/1/2013 | ||
Jason Liszka
jliszka.github.io |
10/1/2013 | Analysis | Article | 10/1/2013 | |
"Rap’s history has been traced many ways -- through books, documentaries, official compilations, DJ mixes, university archives, even parties. But until now you haven’t been able to look at the development of the genre through its building blocks: the actual words used by emcees."
|
ATodd & Rap Genius Engineering Team
news.rapgenius.com |
Analysis | Article | 10/3/2013 | |
Chase Davis
source.mozillaopennews.org |
10/4/2013 | Analysis | Article | 10/4/2013 | |
This essay's theme encompasses the frustration of engineers, the wild imaginations of product managers, and the proper use of statistics.
|
Dan McKinley
mcfunley.com |
Analysis | Article | 10/8/2013 | |
Jeanne Pi
www.appsblogger.com |
Analysis | Article | 10/8/2013 | ||
Dan Remenyi, George Onofrei, Joe English
academic-publishing.org |
Analysis | Article | 10/25/2013 | ||
A free online guide to the most popular statistical errors committed by working scientists and analysts.
|
Alex Reinhart
www.refsmmat.com |
Analysis | Book | 11/5/2013 | |
Another generously-made-free book on data science.
|
R.H. Shumway & D.S. Stoffer
www.stat.pitt.edu |
Analysis | Book | 11/11/2013 | |
Another detailed write-up about web analytics and modern news organizations by OpenNews Fellow, Brian Abelson
|
Brian Abelson
brianabelson.com |
11/14/2013 | Analysis | Article | 11/14/2013 |
"The states where a single vote was most likely to matter are New Mexico, Virginia, New Hampshire, and Colorado, where your vote had an approximate 1 in 10 million chance of determining the national election outcome. On average, a voter in America had a 1 in 60 million chance of being decisive in the presidential election."
|
Andrew Gelman, Nate Silver, Aaron Edlin
www.nber.org |
2009-08 | Analysis | Article | 11/18/2013 |
A primer to the latest buzzword in big data and artificial intelligence.
|
Markus Beissinger
markus.com |
11/14/2013 | Analysis | Article | 11/18/2013 |
One of a series of posts tackling the eternal, "If correlation isn't causation, than what is?"
|
Robin Evans
itsastatlife.blogspot.com |
Analysis | Article | 11/18/2013 | |
It's not what you say but who you like.
|
Carter Jernigan and Behram F.T. Mistree.
firstmonday.org |
7/26/2009 | Analysis | Paper | 11/18/2013 |
"As far as I'm concerned, Gould's The Median Isn't the Message is the wisest, most humane thing ever written about cancer and statistics. It is the antidote both to those who say that, "the statistics don't matter," and to those who have the unfortunate habit of pronouncing death sentences on patients who face a difficult prognosis"
|
Stephen Jay Gould
www.edwardtufte.com |
Analysis | Article | 11/18/2013 | |
Dan Birken
danbirken.com |
11/19/2013 | Analysis | Article | 11/20/2013 | |
Correlation doesn't equal causation, but the strategy behind this analysis does a good job to isolate the hidden factors here.
|
Ken Shirriff
www.righto.com |
11/18/2013 | Analysis | Article | 11/27/2013 |
Brian Burke and Kevin Quealy
www.nytimes.com |
12/3/2013 | Analysis | Article | 12/3/2013 | |
David Romer
elsa.berkeley.edu |
2006-04 | Analysis | Paper | 12/3/2013 | |
Megan McArdle
www.bloomberg.com |
12/4/2013 | Analysis | Article | 12/13/2013 | |
Hilary Parker
hilaryparker.com |
1/30/2013 | Analysis | Article | 12/16/2013 | |
In a tournament of freestyle chess, the winners are not a human grandmaster, nor the best supercomputer, or even a grandmaster with a supercomputer, but amateurs who best know how to use their mediocre computers.
|
Garry Kasparov
www.nybooks.com |
2/11/2010 | Analysis | Article | 12/22/2013 |
This paper discusses how the current standards in psychology research make it more likely for false positives to be found.
|
Joseph P. Simmons, Leif D. Nelson, and Uri Simonsohn
www.socio.mta.hu |
3/7/2011 | Analysis | Paper | 1/2/2014 |
I don't know what's more astounding: that 30 years ago, before computers were in our homes, Arthur Luerhmann could so clearly describe the difference between being taught by a computer and learning to teach the computer, or that 30 years later, so few people even realize there's a difference.
|
Arthur Luehrmann
www.citejournal.org |
1972 | Analysis | Article | 1/8/2014 |
Arthur Luehrmann
www.citejournal.org |
2002 | Analysis | Article | 1/8/2014 | |
Felix Salmon
www.wired.com |
1/7/2014 | Analysis | Article | 1/8/2014 | |
It turns out that the work of data journalism and startup founders wanting to appease Google are very much alike. Excellent write up and source code to check out here.
|
Rap Genius Founders
news.rapgenius.com |
1/2/2013 | Analysis | Article | 1/8/2014 |
Mark Zachry and Charlotte Thralls with Edward Tufte
www.edwardtufte.com |
2004 | Analysis | Article | 1/8/2014 | |
I think the metaphor of "dry lab" and "wet lab" work applies to journalism too.
|
Roberta Kwok
www.nature.com |
12/11/2013 | Analysis | Article | 1/13/2014 |
www.slate.com |
Analysis | Article | 1/24/2014 | ||
Dr. Michael C. Labossiere
www.nizkor.org |
Analysis | Guide | 1/31/2014 | ||
"An astonishing act of statistical chutzpah in the Indiana schools’ grade-changing scandal."
|
Jordan Ellenberg
www.slate.com |
Analysis | Article | 2/1/2014 | |
"P values, the 'gold standard' of statistical validity, are not as reliable as many scientists assume."
|
Regina Nuzzo
www.nature.com |
2/12/2014 | Analysis | Article | 2/12/2014 |
andrewgelman.com |
Analysis | Article | 3/6/2014 | ||
www.stat.columbia.edu |
Analysis | Paper | 3/6/2014 | ||
Kevin Quealy
kpq.github.io |
3/4/2014 | Analysis | Article | 3/7/2014 | |
Eddie Bell
developers.lyst.com |
2/22/2014 | Analysis | Article | 3/10/2014 |