| Introduction |
|
xxvii | |
|
|
|
1 | (166) |
|
|
|
3 | (14) |
|
Basic Concepts in Data Analysis |
|
|
3 | (6) |
|
What Is a Random Variable? |
|
|
9 | (4) |
|
|
|
13 | (4) |
|
Chapter 2 Was the 1969 Draft Lottery Fair? |
|
|
17 | (1) |
|
|
|
17 | (1) |
|
|
|
18 | (2) |
|
|
|
20 | |
|
Chapter 3 Who Won the 2000 Election: Bush or Gore? |
|
|
23 | (1) |
|
Projecting the Undervotes |
|
|
24 | (1) |
|
What Happened with the Overvotes? |
|
|
25 | (1) |
|
|
|
25 | (3) |
|
|
|
28 | (3) |
|
Chapter 4 Was Liverpool Over Barcelona the Greatest Upset in Sports History? |
|
|
31 | (4) |
|
How Should We Rank Upsets? |
|
|
31 | (1) |
|
Leicester Wins the 2015-2016 Premier League |
|
|
32 | (1) |
|
#16 Seed UMBC Beats #1 Seed Virginia |
|
|
33 | (1) |
|
The Jets Win Super Bowl III |
|
|
33 | (1) |
|
|
|
34 | (1) |
|
Chapter 5 How Did Bernie Madoff Keep His Fund Going? |
|
|
35 | (10) |
|
The Mathematics of Ponzi Schemes |
|
|
36 | (1) |
|
Madoff's Purported Strategy |
|
|
37 | (2) |
|
The Sharpe Ratio Proves Madoff Was a Fraud |
|
|
39 | (1) |
|
Benford's Law and Madoff's Fraud |
|
|
40 | (1) |
|
|
|
41 | (4) |
|
Chapter 6 Is the Lot of the American Worker Improving? |
|
|
45 | (8) |
|
Is U.S. Family Income Skewed? |
|
|
45 | (1) |
|
Median Income and Politics |
|
|
46 | (2) |
|
Causes of Increasing U.S. Income Inequality |
|
|
48 | (2) |
|
Money Isn't Everything: The Human Development Index |
|
|
50 | (1) |
|
Create Your Own Ranking of Weil-Being |
|
|
50 | (1) |
|
Are Other Countries Catching Up to the U.S.? |
|
|
51 | (1) |
|
|
|
52 | (1) |
|
Chapter 7 Measuring Income Inequality with the Gini, Palm, and Atkinson Indices |
|
|
53 | (8) |
|
|
|
53 | (3) |
|
|
|
56 | (1) |
|
|
|
57 | (2) |
|
|
|
59 | (2) |
|
Chapter 8 Modeling Relationships Between Two Variables |
|
|
61 | (12) |
|
Examples of Relationships Between Two Variables |
|
|
61 | (1) |
|
Finding the Best-Fitting (Least Squares) Line |
|
|
62 | (1) |
|
Computing the Beta of a Stock |
|
|
63 | (1) |
|
|
|
64 | (1) |
|
|
|
65 | (2) |
|
We Are Not Living in a Linear World |
|
|
67 | (2) |
|
|
|
69 | (4) |
|
Chapter 9 Intergenerational Mobility |
|
|
73 | (12) |
|
Absolute Intergenerational Mobility |
|
|
74 | (1) |
|
Intergenerational Elasticity |
|
|
74 | (1) |
|
|
|
75 | (1) |
|
Comparing IGE and Rank-Rank Mobility |
|
|
75 | (3) |
|
Measuring Mobility with Quintiles |
|
|
78 | (2) |
|
|
|
80 | (2) |
|
|
|
82 | (3) |
|
Chapter 10 Is Anderson Elementary School a Bad School? |
|
|
85 | (4) |
|
How Can We Adjust for Family Income? |
|
|
86 | (1) |
|
Estimating the Least Squares Line |
|
|
86 | (1) |
|
Can We Compare Standardized Test Performance for Students in Different States? |
|
|
86 | (1) |
|
|
|
87 | (2) |
|
Chapter 11 Value-Added Assessments of Teacher Effectiveness |
|
|
89 | (8) |
|
Simple Gain Score Assessment |
|
|
90 | (1) |
|
Covariate Adjustment Assessment |
|
|
91 | (1) |
|
|
|
91 | (1) |
|
Cross-Classified Constant Growth Assessment |
|
|
91 | (2) |
|
|
|
93 | (1) |
|
How Much Is a Good Teacher Worth? |
|
|
94 | (1) |
|
|
|
95 | (2) |
|
Chapter 12 Berkeley, Buses, Cars, and Planes |
|
|
97 | (6) |
|
Simpson's Paradox and College Admissions |
|
|
98 | (2) |
|
|
|
100 | (1) |
|
When Is the Average of 40 and 80 Not 60? |
|
|
100 | (1) |
|
Why Pre COVID Were There Never Empty Seats on My Flight? |
|
|
101 | (1) |
|
|
|
101 | (2) |
|
Chapter 13 Is Carmelo Anthony a Hall of Famer? |
|
|
103 | (6) |
|
What Metric Defines Basketball Ability? |
|
|
104 | (1) |
|
Wins Above Replacement Player (WARP) |
|
|
105 | (1) |
|
Manu, Melo, Dirk, and Dwayne |
|
|
106 | (1) |
|
How Do 25,000 Points Lead to So Few Wins? |
|
|
106 | (3) |
|
Chapter 14 Was Derek Jeter a Great Fielder? |
|
|
109 | (6) |
|
Fielding Statistics: The First Hundred Years |
|
|
109 | (1) |
|
|
|
110 | (1) |
|
The Fielding Bible: A Great Leap Forward |
|
|
111 | (1) |
|
|
|
112 | (3) |
|
Chapter 15 "Drive for Show and Putt for Dough?" |
|
|
115 | (2) |
|
|
|
115 | (1) |
|
|
|
116 | (1) |
|
Chapter 16 What's Wrong with the NFL QB Rating? |
|
|
117 | (10) |
|
|
|
117 | (7) |
|
ESPN's Total Quarterback Rating |
|
|
124 | (1) |
|
|
|
125 | (2) |
|
Chapter 17 Some Sports Have All the Luck |
|
|
127 | (4) |
|
Skill vs. Luck: The Key Idea |
|
|
127 | (2) |
|
|
|
129 | (2) |
|
Chapter 18 Gerrymandering |
|
|
131 | (12) |
|
|
|
132 | (4) |
|
The Mathematics of Gerrymandering |
|
|
136 | (7) |
|
Chapter 19 Evidence-Based Medicine |
|
|
143 | (8) |
|
James Lind and Scurvy: The Birth of Evidence-Based Medicine |
|
|
143 | (2) |
|
The Randomized Streptomycin Tuberculosis Trial |
|
|
145 | (1) |
|
|
|
146 | (2) |
|
Hormone Replacement: Good or Bad? |
|
|
148 | (3) |
|
Chapter 20 How Do We Compare Hospitals? |
|
|
151 | (8) |
|
|
|
152 | (4) |
|
|
|
156 | (1) |
|
|
|
157 | (2) |
|
Chapter 21 What Is the Worst Health Care Problem in My Country? |
|
|
159 | (8) |
|
Disability-Adjusted Life Years |
|
|
159 | (1) |
|
Determination of Disability Weights |
|
|
160 | (2) |
|
To Age Weight or Discount, That Is the Question |
|
|
162 | (1) |
|
Key Facts About World Health |
|
|
163 | (4) |
|
Part II What Will Happen? |
|
|
167 | (116) |
|
Chapter 22 Does a Mutual Fund's Past Performance Predict Future Performance? |
|
|
169 | (12) |
|
|
|
170 | (1) |
|
|
|
170 | (1) |
|
Risk-Adjusting Fund Returns |
|
|
171 | (4) |
|
How Well Do Morningstar Star Ratings Predict a Fund's Future Performance? |
|
|
175 | (2) |
|
The Effect of Expense Ratio on Long-Term Performance |
|
|
177 | (1) |
|
|
|
178 | (3) |
|
Chapter 23 Is Vegas Good at Picking NFL Games? |
|
|
181 | (10) |
|
|
|
181 | (3) |
|
|
|
184 | (1) |
|
Vegas Forecasts Are Unbiased |
|
|
185 | (3) |
|
Totals Predictions and Money Line Predictions Are Unbiased |
|
|
188 | (1) |
|
NFL Accuracy: The Line vs. the Computers |
|
|
188 | (1) |
|
A System Works Until It Doesn't |
|
|
189 | (2) |
|
Chapter 24 Will My New Hires Be Good Employees? |
|
|
191 | (8) |
|
What Data Do We Need to Determine Attributes That Best Predict Employee Performance? |
|
|
192 | (4) |
|
Besides GMA, Not Much Affects Job Performance |
|
|
196 | (1) |
|
|
|
197 | (2) |
|
Chapter 25 Should I Go to State U or Princeton? |
|
|
199 | (4) |
|
Analyzing Princeton vs. Penn State |
|
|
200 | (2) |
|
|
|
202 | (1) |
|
Chapter 26 Will My Favorite Sports Team Be Great Next Year? |
|
|
203 | (6) |
|
Francis Galton and Regression to the Mean |
|
|
203 | (1) |
|
Regression to the Mean in the NFL and the NBA |
|
|
204 | (3) |
|
|
|
207 | (2) |
|
Chapter 27 How Did Central Bankers Fail to Predict the 2008 Recession? |
|
|
209 | (10) |
|
|
|
210 | (1) |
|
The Sahm Rule: Early Warning Signal for Recession |
|
|
211 | (1) |
|
Control Charts and the Housing Price/Rent Ratio |
|
|
211 | (4) |
|
|
|
215 | (4) |
|
Chapter 28 How Does Target Know If You're Pregnant? |
|
|
219 | (6) |
|
What Available Data Can Be Used to Identify Pregnant Women? |
|
|
220 | (1) |
|
|
|
220 | (1) |
|
An Example of a Pregnancy Prediction Score |
|
|
221 | (4) |
|
Chapter 29 How Does Netflix Recommend Movies and TV Shows? |
|
|
225 | (8) |
|
User-Based Collaborative Filtering |
|
|
226 | (3) |
|
|
|
229 | (4) |
|
Chapter 30 Can We Predict Heart Attacks in Real Time? |
|
|
233 | (8) |
|
|
|
234 | (1) |
|
Sensitivity and Specificity |
|
|
235 | (1) |
|
|
|
235 | (2) |
|
Back to the Apple Heart Study |
|
|
237 | (2) |
|
|
|
239 | (2) |
|
Chapter 31 Is Proactive Policing Effective? |
|
|
241 | (8) |
|
|
|
242 | (1) |
|
|
|
243 | (1) |
|
|
|
244 | (1) |
|
|
|
244 | (2) |
|
|
|
246 | (1) |
|
|
|
247 | (2) |
|
Chapter 32 Guess How Many Are Coming to Dinner? |
|
|
249 | (10) |
|
Which Parameters Must Be Estimated? |
|
|
250 | (2) |
|
|
|
252 | (1) |
|
|
|
253 | (1) |
|
Which Factor Really Matters? |
|
|
254 | (1) |
|
|
|
254 | (5) |
|
Chapter 33 Can Prediction Markets Predict the Future? |
|
|
259 | (6) |
|
Examples of Trade Contracts |
|
|
260 | (1) |
|
Prediction Market Trading Mechanisms |
|
|
261 | (1) |
|
Accuracy of Prediction Markets and Wisdom of Crowds |
|
|
262 | (3) |
|
Chapter 34 The ABCs of Polling |
|
|
265 | (8) |
|
Why Are 1,112 People Enough to Represent U.S. Voters? |
|
|
265 | (2) |
|
Why Doesn't a Larger Population Require a Larger Sample Size? |
|
|
267 | (1) |
|
|
|
268 | (3) |
|
|
|
271 | (2) |
|
Chapter 35 How Did Buzzfeed Make the Dress Go Viral? |
|
|
273 | (4) |
|
Measuring Instagram Engagement |
|
|
274 | (1) |
|
Tweets Do Not Always Go Viral Immediately |
|
|
274 | (1) |
|
Do the First Few Days Predict the Future of a Meme? |
|
|
275 | (2) |
|
Chapter 36 Predicting Game of Thrones TV Ratings |
|
|
277 | (6) |
|
What Does Google Trends Tell Us? |
|
|
277 | (1) |
|
Predicting the Present with Google Trends |
|
|
278 | (1) |
|
Using Google Trends to Forecast GOT Ratings |
|
|
279 | (2) |
|
|
|
281 | (2) |
|
Part III Why Did It Happened? |
|
|
283 | (68) |
|
Chapter 37 Does Smoking Cause Lung Cancer? |
|
|
285 | (6) |
|
Correlation and Causation Redux |
|
|
285 | (1) |
|
|
|
286 | (1) |
|
Could Air Pollution Have Caused Lung Cancer? |
|
|
287 | (1) |
|
The Cigarette Companies Hit Back |
|
|
287 | (1) |
|
|
|
288 | (3) |
|
Chapter 38 Why Are the Houston Rockets a Good Basketball Team? |
|
|
291 | (6) |
|
|
|
292 | (3) |
|
Zach LaVine Battles the Bulls' Analytics Department |
|
|
295 | (1) |
|
|
|
296 | (1) |
|
|
|
296 | (1) |
|
Chapter 39 Why Have Sacrifice Bunts and Intentional Walks Nearly Disappeared? |
|
|
297 | (4) |
|
|
|
298 | (1) |
|
Bunting Against the Shift |
|
|
299 | (1) |
|
Why Are Intentional Walks on the Decline? |
|
|
300 | (1) |
|
Chapter 40 Do NFL Teams Pass Too Much and Go for It Often Enough on Fourth Down? |
|
|
301 | (6) |
|
|
|
301 | (2) |
|
|
|
303 | (1) |
|
New Data Partially Vindicates the Coaches |
|
|
304 | (2) |
|
Teams Should Go for Two More Often |
|
|
306 | (1) |
|
Chapter 41 What Caused the 1854 London Cholera Outbreak? |
|
|
307 | (6) |
|
|
|
307 | (1) |
|
Snow and the Broad Street Pump |
|
|
308 | (2) |
|
Snow's Randomized Controlled Trial |
|
|
310 | (1) |
|
|
|
311 | (1) |
|
|
|
312 | (1) |
|
Chapter 42 What Affects the Sales of a Retail Product? |
|
|
313 | (6) |
|
|
|
313 | (2) |
|
Estimating the Model Parameters |
|
|
315 | (1) |
|
|
|
316 | (3) |
|
Chapter 43 Why Does the Pareto Principle Explain So Many Things? |
|
|
319 | (8) |
|
|
|
320 | (2) |
|
Why Do Incomes Follow the Pareto Principle? |
|
|
322 | (1) |
|
Why Do a Few Websites Get Most of the Hits? |
|
|
323 | (1) |
|
|
|
324 | (3) |
|
Chapter 44 Does Where You Grow Up Matter? |
|
|
327 | (6) |
|
Quasi-Experimental Design vs. Randomized Controlled Trials |
|
|
328 | (1) |
|
What Drives Neighborhood Differences in Upward Mobility? |
|
|
329 | (1) |
|
How Can We Make Things Better? |
|
|
330 | (3) |
|
Chapter 45 The Waiting is the Hardest Part |
|
|
333 | (6) |
|
Which Factors Influence the Performance of a Queueing System? |
|
|
334 | (1) |
|
Operating Characteristics of a Queueing System |
|
|
334 | (1) |
|
How Does Variability Degrade the Performance of a Queueing System? |
|
|
335 | (1) |
|
Calculating the Operating Characteristics of a Queueing System |
|
|
336 | (2) |
|
|
|
338 | (1) |
|
Chapter 46 Are Roundabouts a Good Idea? |
|
|
339 | (8) |
|
|
|
340 | (1) |
|
|
|
340 | (1) |
|
|
|
341 | (2) |
|
Disadvantages of Roundabouts |
|
|
343 | (1) |
|
|
|
344 | (1) |
|
Roundabouts and Revolutions |
|
|
345 | (2) |
|
Chapter 47 Red Light, Green Light, or No Light? |
|
|
347 | (4) |
|
What Causes Traffic Jams? |
|
|
347 | (1) |
|
How Should We Set the Lights? |
|
|
348 | (1) |
|
|
|
349 | (1) |
|
Measuring the Impact of Ramp Meters |
|
|
350 | (1) |
|
The Twin Cities Metering Holiday |
|
|
350 | (1) |
|
Part IV How Do I Make Good Things Happen? |
|
|
351 | (118) |
|
Chapter 48 How Can We Improve K-12 Education? |
|
|
353 | (10) |
|
Tennessee's STAR Study on K-2 Class Size |
|
|
355 | (1) |
|
|
|
356 | (4) |
|
Can Predictive Analytics Increase Enrollment and Performance in Eighth-Grade Algebra I? |
|
|
360 | (1) |
|
|
|
360 | (3) |
|
Chapter 49 Can A/B Testing Improve My Website's Performance? |
|
|
363 | (6) |
|
Improving Obama's Fundraising in 2008 |
|
|
364 | (1) |
|
The Mechanics of Resampling |
|
|
365 | (1) |
|
|
|
366 | (3) |
|
Chapter 50 How Should I Allocate My Retirement Portfolio? |
|
|
369 | (8) |
|
The Basic Portfolio Optimization Model |
|
|
369 | (3) |
|
|
|
372 | (2) |
|
Difficulties in Implementing the Markowitz Model |
|
|
374 | (1) |
|
|
|
374 | (3) |
|
Chapter 51 How Do Hedge Funds Work? |
|
|
377 | (12) |
|
Growth in Hedge Funds and Hedge Fund Fee Structure |
|
|
378 | (1) |
|
|
|
378 | (1) |
|
Long/Short and Market-Neutral Strategies |
|
|
378 | (2) |
|
|
|
380 | (2) |
|
|
|
382 | (1) |
|
|
|
383 | (1) |
|
|
|
384 | (1) |
|
The George Costanza Portfolio |
|
|
384 | (1) |
|
|
|
385 | (4) |
|
Chapter 52 How Much Should We Order and When Should We Order? |
|
|
389 | (8) |
|
The Economic Order Quantity Model |
|
|
389 | (3) |
|
Reorder Points, Service Levels, and Safety Stock |
|
|
392 | (1) |
|
|
|
393 | (4) |
|
Chapter 53 How Does the UPS Driver Know the Order to Deliver Packages? |
|
|
397 | (8) |
|
Why Is the Traveling Salesperson Problem So Hard? |
|
|
398 | (1) |
|
Solving the Traveling Salesperson Problem |
|
|
399 | (1) |
|
The Traveling Salesperson Problem in the Real World |
|
|
400 | (1) |
|
|
|
401 | (4) |
|
Chapter 54 Can Data Win a Presidential Election? |
|
|
405 | (10) |
|
Democratic Presidential Analytics |
|
|
405 | (4) |
|
|
|
409 | (3) |
|
Cambridge Analytica and the 2016 Election |
|
|
411 | (1) |
|
|
|
412 | (3) |
|
Chapter 55 Can Analytics Save Our Republic? |
|
|
415 | (8) |
|
Arrow's Impossibility Theorem |
|
|
416 | (1) |
|
It's Not Easy to Pick a Winner! |
|
|
417 | (2) |
|
|
|
419 | (1) |
|
|
|
420 | (1) |
|
|
|
420 | (1) |
|
|
|
421 | (2) |
|
Chapter 56 Why Do I Pay Too Much on eBay? |
|
|
423 | (6) |
|
How Many Pennies in the Jar? |
|
|
423 | (1) |
|
The Importance of Asymmetric Information |
|
|
424 | (1) |
|
The Winner's Curse and Offshore Oil Leases |
|
|
424 | (1) |
|
Sports Free Agents and the Winner's Curse |
|
|
425 | (1) |
|
Can You Avoid the Winner's Curse? |
|
|
425 | (2) |
|
|
|
427 | (2) |
|
Chapter 57 Can Analytics Recognize, Predict, or Write a Hit Song? |
|
|
429 | (8) |
|
How Does Shazam Know What Song You Are Listening To? |
|
|
430 | (1) |
|
How Did Hit Song Science Know Norah Jones's Album Would Be a Smash? |
|
|
431 | (2) |
|
Can Artificial Intelligence Write a Good Song? |
|
|
433 | (4) |
|
Chapter 58 Can an Algorithm Improve Parole Decisions? |
|
|
437 | (12) |
|
An Example of Risk Scores |
|
|
438 | (3) |
|
ProPublica Criticizes Risk Scores |
|
|
441 | (2) |
|
Skeem and Lowenkamp and PCRA |
|
|
443 | (1) |
|
Machine Learning and Parole Decisions |
|
|
444 | (5) |
|
Chapter 59 How Do Baseball Teams Decide Where to Shift Fielders? |
|
|
449 | (8) |
|
|
|
449 | (1) |
|
|
|
450 | (2) |
|
Empirical Evidence on the Shift |
|
|
452 | (1) |
|
Why Not Just Beat the Shift? |
|
|
452 | (1) |
|
|
|
453 | (4) |
|
Chapter 60 Did Analytics Help the Mavericks Win the 2011 NBA Title? |
|
|
457 | (6) |
|
How Can You Evaluate a Basketball Player? |
|
|
457 | (2) |
|
From Player Ratings to Lineup Ratings |
|
|
459 | (4) |
|
Chapter 61 Who Gets the House in the Hamptons? |
|
|
463 | (6) |
|
|
|
464 | (1) |
|
What Asset Division Is Best? |
|
|
465 | (1) |
|
|
|
466 | (3) |
| Index |
|
469 | |