The Art and Science of Data-Driven Journalism

Понравилась презентация – покажи это...

Слайд 1

The Art and Science of Data-Driven Journalism Alexander B. Howard Tow Fellow, Columbia University May 30, 2014

Слайд 2

You know something, John Snow.

Слайд 3

This John Snow knew something.

Слайд 4

Newspapers have used data for centuries Source: The Guardian

Слайд 5

1960s: computer-assisted reporting (CAR) Bob Woodward, via Cliff1066

Слайд 6

Traditional tools applying tech to journalism… Calculators and Graphs Mainframe and PCs Spreadsheets Databases Text and code editors Statistics Programming

Слайд 7

In the 1990s, government and civil society spread the Internet globally

Слайд 8

In the 2000s, mobile phones and social networking connected us ever more

Слайд 9

In the 2010s, data creation exploded. Image Credit: Real Time Rome from Senseable.MIT.edu

Слайд 10

“Data-driven journalism is the future” Source: Tim Berners-Lee in the Guardian

Слайд 11

…combined with new tools & context… Online spreadsheets and wikis Data visualization tools Open source frameworks Code sharing Agile development Cloud storage and processing (EC2 & Heroku) More data and more access Privacy and security riskss

Слайд 12

2014: data journalism is the present Gathering, cleaning, organizing, analyzing, visualizing and publishing data to support the creation of acts of journalism

Слайд 13

Слайд 14

Trendy but not new The collection, protection and interrogation of data as a source, complementing traditional “shoe leather” investigative reporting relying on witnesses, experts and authorities

Слайд 15

Слайд 16

Dollars for Docs

Слайд 17

The Guardian

Слайд 18

Chicago Tribune Flame retardants

Слайд 19

Слайд 20

A tangled web

Слайд 21

Слайд 22

Los Angeles Times

Слайд 23

Слайд 24

La Nacion

Слайд 25

Reuters: Connected China

Слайд 26

Слайд 27

Слайд 28

Слайд 29

Best practices?

Слайд 30

Report it out

Слайд 31

Слайд 32

Show people something new about the world

Слайд 33

Слайд 34

Tell a story

Слайд 35

Center for Public Integrity

Слайд 36

Storytelling still matters. “We use these tools to find and tell stories. We use them like we use a telephone. The story is still the thing.” - Anthony DeBarros USA Today Source: Data Journalism and the Big Picture

Слайд 37

Make it personal

Слайд 38

Слайд 39

Understand the context for the data

Слайд 40

Слайд 41

Show your data

Слайд 42

Слайд 43

Show your work

Слайд 44

Слайд 45

Share your code

Слайд 46

Слайд 47

Consider ethics

Слайд 48

Questions Is the data clean? Is the data representative? What biases might be hidden in the data? Was the data legally obtained? Does the data contain personally identifiable information (PII)?

Слайд 49

Collection Who gathered the data? How? Was it clear how data would be used? Can people opt-out of collection or usage? “Notice and consent” is not enough “Privacy by design” applies to news apps

Слайд 50

Слайд 51

Data Analysis & Numeracy N = ? Average vs Median Statistical significance? Correlation != causation Regression to the mean

Слайд 52

Слайд 53


Слайд 54

Bad Data Viz wtfviz.net

Слайд 55

Present data with context, in context

Слайд 56

Be aware of de-anonymization risks

Слайд 57

Emerging trends

Слайд 58


Слайд 59

Networked reporting of corruption ICIJ: Offshore Leaks

Слайд 60

International Consortium of Investigative Journalists Offshoring $ 80 journalists 40 countries 260 gigabytes 2.5 million files

Слайд 61

Create your data “If Stage 1 of data journalism was “find and scrape data,” then… Stage 2 was “ask government agencies to release data” in easy to use formats. Stage 3 is going to be “make your own data”, and those sources of data are going to be automated and updated in real-time.” -Javaun Moradi, Mozilla

Слайд 62

Safecast open source Geiger counter

Слайд 63

Networked accountability

Слайд 64

Bus route in Nairobi, Kenya

Слайд 65

Sensor Journalism

Слайд 66

Слайд 67

Слайд 68

Citizens as Sensors: Andhra Pradesh

Слайд 69

Drones + data collection

Слайд 70

Privacy challenges

Слайд 71

Слайд 72

Open Data, FOIA & Press Freedom

Слайд 73

An expanding number of data sources

Слайд 74

Слайд 75

Слайд 76

Social data and crisis data

Слайд 77

Open government data platforms

Слайд 78

Слайд 79

Слайд 80

Fauxpen Data In an age of “openwashing”… We need to: Evaluate licenses. Peruse the Terms of Service. Review the governance. Look at community. Check the format.

Слайд 81

Слайд 82

Слайд 83

Center for Public Integrity

Слайд 84

Accountability for “personalized redlining” Gun map graphic

Слайд 85

Transparency for geographic profiling Gun map graphic WSJ: Websites vary prices, based upon user information

Слайд 86

Monitoring predictive policing Gun map graphic Verge: Chicago crime and profiling Geekwire: Predictive Policing

Слайд 87

Investigating human tissue trafficking Gun map graphic ICIJ: The data behind skin and bone

Слайд 88

Data + journalism + activism + responsive institutions = social change

Слайд 89

The fun part: predictions, prognostications and recommendations!

Слайд 90

1) Data will become even more of a strategic resource for media.

Слайд 91

2) Better tools will emerge that democratize data skills.

Слайд 92

3) News apps will explode as a primary way people consume data journalism.

Слайд 93

4) Being digital first means being data-centric and mobile-friendly.

Слайд 94

5. Expect more robo-journalism. Human relationships and storytelling still matter.

Слайд 95

6) More journalists will need to study the social sciences and statistics. Source: Ed Yong

Слайд 96

7) There will be higher standards for accuracy and corrections. Source: Jake Harris

Слайд 97

8) Competency in security and data protection will become more important. Source: Jake Harris

Слайд 98

9) Demand for more transparency on reader data collection and use. Source: eConsultancy

Слайд 99

10) More conflicts over public records, data scraping, and ethics will arise. Gun map graphic

Слайд 100

12) Data-driven personalization and predictive news in wearables.

Слайд 101

13) More diverse newsrooms will produce better (data) journalism. SOURCE: The Atlantic A 2013 ASNE survey of 68 online news organizations found that 63% of them had no minorities.

Слайд 102

14) Be mindful of data-ism and bad data. Embrace skepticism.