Preface |
|
xvii | |
Acknowledgments |
|
xxi | |
About the Author |
|
xxiii | |
|
1 A Conceptual Introduction To Bivariate Logistic Regression |
|
|
1 | (18) |
|
What Is Ordinary Least Squares Regression and How Is Logistic Regression Different? |
|
|
3 | (7) |
|
OLS Regression---A Deeper Conceptual Look |
|
|
7 | (1) |
|
Maximum Likelihood Estimation---A Gentle but Deeper Look |
|
|
8 | (2) |
|
Differences and Similarities in Assumptions Between OLS and Logistic Regression |
|
|
10 | (5) |
|
Distributional Assumptions |
|
|
10 | (1) |
|
Linearity of the Relationship |
|
|
10 | (4) |
|
|
14 | (1) |
|
Homoscedasticity (or Constant Variance of the Residuals) |
|
|
14 | (1) |
|
Independence of Observations |
|
|
15 | (1) |
|
Similarities Between OLS Regression and Logistic Regression |
|
|
15 | (1) |
|
Summarizing the Overall A Model |
|
|
15 | (1) |
|
What Is Discriminant Function Analysis and How Is Logistic Regression Superior/Different? |
|
|
16 | (1) |
|
|
17 | (1) |
|
|
17 | (2) |
|
2 How Does Logistic Regression Handle A Binary Dependent Variable? |
|
|
19 | (26) |
|
Probabilities, Conditional Probabilities, and Odds |
|
|
21 | (8) |
|
A Brief Thought Experiment on the Logistic Curve |
|
|
24 | (2) |
|
The Benefits of Odds Over Simple OLS Regression for Binary Outcomes |
|
|
26 | (1) |
|
|
26 | (1) |
|
|
27 | (2) |
|
|
29 | (1) |
|
Still More Fun With Logits, Odds, and Probabilities |
|
|
30 | (2) |
|
|
31 | (1) |
|
Where Is the Logit of the Other Group? |
|
|
32 | (1) |
|
Converting From Logits Directly Back to Conditional Probabilities |
|
|
33 | (4) |
|
Some Benefits of Conditional Probabilities |
|
|
34 | (3) |
|
Confidence Intervals for Logits, Odds Ratios, and Predicted Probabilities |
|
|
37 | (2) |
|
One Concrete Reason Why You Should Care About This Stuff---Clarity of Communication! |
|
|
39 | (1) |
|
|
39 | (2) |
|
|
41 | (2) |
|
|
43 | (1) |
|
|
44 | (1) |
|
3 Performing Simple Logistic Regression |
|
|
45 | (40) |
|
Simple Binary Logistic Regression With One Independent Variable |
|
|
45 | (11) |
|
Example 1 The Relationship Between Obesity and Diagnosis of Diabetes |
|
|
46 | (10) |
|
An Example Summary of These Results |
|
|
56 | (1) |
|
Logistic Regression With a Continuous Independent Variable |
|
|
56 | (11) |
|
Example Summary of This Analysis |
|
|
64 | (1) |
|
Example 2 The Relationship Between Family Poverty and Dropping Out of School Prior to Graduation |
|
|
64 | (2) |
|
Example Summary of This Analysis |
|
|
66 | (1) |
|
Predicting Dropout From a Continuous Family SES Variable |
|
|
67 | (4) |
|
Example Summary of This Analysis |
|
|
70 | (1) |
|
Are Classification Tables Ever Useful? |
|
|
71 | (4) |
|
How Should We Interpret Odds Ratios That Are Less Than 1.0? |
|
|
75 | (3) |
|
|
78 | (1) |
|
|
78 | (2) |
|
|
80 | (2) |
|
|
82 | (1) |
|
|
83 | (2) |
|
4 A Practical Guide To Testing-Assumptions And Cleaning Data For Logistic Regression |
|
|
85 | (46) |
|
Independence of Observations |
|
|
86 | (1) |
|
Collinearity of Independent Variables |
|
|
87 | (1) |
|
Fully Represented Data Matrix |
|
|
88 | (1) |
|
|
89 | (2) |
|
Model Is Correctly Specified---No Important Variables Omitted, No Extraneous Variables Included |
|
|
91 | (9) |
|
The Logit Link Function Is Appropriate |
|
|
91 | (1) |
|
The Relationships Are "Linear on the Logit" |
|
|
92 | (3) |
|
Effects of Independent Variables Are Additive in Nature |
|
|
95 | (1) |
|
All Important Variables Are Included in the Analysis |
|
|
96 | (1) |
|
No Extraneous Variables Are Included in the Analysis |
|
|
97 | (3) |
|
Normality of Errors, Distributional Assumptions |
|
|
100 | (2) |
|
Exploring Data for Inappropriately Influential Cases |
|
|
102 | (8) |
|
|
104 | (6) |
|
Leverage and Influence in Logistic Regression |
|
|
110 | (4) |
|
Cook's Distance (Influence) |
|
|
112 | (2) |
|
|
114 | (2) |
|
The Relationship Between DfBetas and Other Measures of Influence/Leverage |
|
|
116 | (1) |
|
|
116 | (1) |
|
|
116 | (1) |
|
|
117 | (11) |
|
|
128 | (2) |
|
|
130 | (1) |
|
5 Continuous Predictors: Why Splitting Continuous Variables Into Categories Is Undesirable |
|
|
131 | (40) |
|
What Is Categorization and Why Does It Exist? |
|
|
132 | (2) |
|
How Widespread Is This Practice? |
|
|
134 | (2) |
|
Why Do Researchers Use Dichotomization and Similar Techniques? |
|
|
136 | (3) |
|
The Evils of Cutoff Scores |
|
|
137 | (2) |
|
Are Analyses More Easily Interpreted With Dichotomous Variables? |
|
|
139 | (1) |
|
Are Analyses With Dichotomous Variables Easier to Compute? |
|
|
140 | (1) |
|
Are Dichotomous Variables More Reliable? |
|
|
140 | (10) |
|
Dichotomization Improves Reliability of Measurement |
|
|
141 | (9) |
|
Does Dichotomization Effectively Deal With Non-Normality or Outliers? |
|
|
150 | (2) |
|
Extreme Groups Analysis: Generally Ill-Advised and Often Dishonest |
|
|
152 | (1) |
|
Some Other Drawbacks of Dichotomization |
|
|
153 | (1) |
|
Curvilinearity and Interactions Will Be Masked or Undetectable |
|
|
153 | (1) |
|
The Variable Is by Nature Categorical |
|
|
153 | (1) |
|
Spurious Statistical Significance |
|
|
154 | (1) |
|
Dichotomization of Variables and Logistic Regression |
|
|
154 | (3) |
|
Inhibiting Accurate Meta-Analysis of Results |
|
|
156 | (1) |
|
What About Other k-Groups Categorization Schemes? |
|
|
157 | (1) |
|
So What Are Best Practices for Logistic Regression With Continuous Variables? |
|
|
157 | (3) |
|
Best Practices in Reporting Results from Continuous Variables: Conditional Probabilities |
|
|
160 | (3) |
|
|
163 | (1) |
|
|
163 | (2) |
|
|
165 | (2) |
|
|
167 | (4) |
|
6 Using Unordered Categorical Independent Variables In Logistic Regression |
|
|
171 | (30) |
|
|
171 | (1) |
|
|
172 | (1) |
|
|
172 | (1) |
|
Different Classifications of Measurement |
|
|
173 | (2) |
|
|
175 | (4) |
|
Define the Reference Group |
|
|
175 | (1) |
|
Set Up the Dummy Coded Variables |
|
|
175 | (4) |
|
Alternatives to Dummy Coding |
|
|
179 | (6) |
|
Difference (Reverse Helmert) Contrasts |
|
|
182 | (2) |
|
|
184 | (1) |
|
|
185 | (2) |
|
|
187 | (2) |
|
|
189 | (1) |
|
|
189 | (1) |
|
|
190 | (8) |
|
|
198 | (1) |
|
|
199 | (2) |
|
7 Curvilinear Effects In Logistic-Regression |
|
|
201 | (42) |
|
A Brief Review of the Assumption of Linearity |
|
|
202 | (1) |
|
Illegitimate Causes of Curvilinearity |
|
|
203 | (4) |
|
Model Misspecification: Omission of Important Variables |
|
|
203 | (1) |
|
Violating Equal Intervals in Coding Continuous Variables |
|
|
203 | (2) |
|
|
205 | (2) |
|
Detection of Nonlinear Effects |
|
|
207 | (2) |
|
|
207 | (1) |
|
|
208 | (1) |
|
Box-Tidwell Transformations |
|
|
208 | (1) |
|
Curvilinear Logistic Regression Example: Diabetes and Age |
|
|
209 | (4) |
|
Adding Quadratic and Cubic Terms to the Logistic Regression Analysis |
|
|
209 | (4) |
|
An Example Summary of This Analysis |
|
|
213 | (1) |
|
Estimating Curvilinear Relationships Using Box-Tidwell Transformations |
|
|
213 | (1) |
|
Data Cleaning and Curvilinear Effects |
|
|
214 | (6) |
|
SAS Analyses Using DIFCHISQ |
|
|
220 | (2) |
|
Advanced Topics in Curvilinear Regression: Estimating Minima and Maxima as Well as Slope at Any Point on the Curve |
|
|
222 | (6) |
|
|
228 | (1) |
|
|
229 | (1) |
|
|
229 | (11) |
|
|
240 | (2) |
|
|
242 | (1) |
|
8 Logistic Regression With Multiple Independent Variables: Opportunities And Pitfalls |
|
|
243 | (54) |
|
The Basics of Multiple Predictors |
|
|
244 | (1) |
|
What Are the Implications of This Act? |
|
|
245 | (3) |
|
Example Summary of Previous Analysis |
|
|
248 | (1) |
|
Different Methods of Entry |
|
|
249 | (5) |
|
User-Controlled Methods of Entry |
|
|
249 | (1) |
|
|
249 | (1) |
|
|
250 | (1) |
|
Software-Controlled Entry |
|
|
251 | (3) |
|
|
254 | (2) |
|
Assessing the Overall Model---Why There Is No R2 for Logistic Regression |
|
|
256 | (2) |
|
|
258 | (9) |
|
|
259 | (1) |
|
Procedural Issues in Testing for Interactions Between Continuous Variables |
|
|
259 | (4) |
|
Procedural Issues With Graphing |
|
|
263 | (4) |
|
Example Summary of Interaction Analysis |
|
|
267 | (1) |
|
Interactions Between Categorical and Continuous Variables |
|
|
267 | (4) |
|
Interactions and Data Cleaning |
|
|
271 | (3) |
|
|
274 | (7) |
|
Step 1 Create the Terms Prior to Analysis |
|
|
276 | (1) |
|
Step 2 Build Your Equation Slowly |
|
|
276 | (5) |
|
Curvilinear Interactions With Categorical Variables |
|
|
281 | (3) |
|
|
284 | (1) |
|
|
285 | (1) |
|
|
286 | (4) |
|
Curvilinear Effects With BMI and Smoking |
|
|
290 | (4) |
|
|
294 | (3) |
|
9 A Brief Overview Of Probit Regression |
|
|
297 | (16) |
|
|
298 | (2) |
|
|
300 | (2) |
|
Why Are There Two Different Procedures If They Produce the Same Results? |
|
|
302 | (4) |
|
The Value of Having a Sensible Intercept |
|
|
306 | (3) |
|
Some Nice Features of Probit |
|
|
309 | (1) |
|
Assumptions of Probit Regression |
|
|
310 | (1) |
|
|
310 | (1) |
|
|
310 | (1) |
|
|
311 | (1) |
|
|
311 | (2) |
|
10 Replication And Generalizability In Logistic Regression |
|
|
313 | (44) |
|
Sample Size, Power, and Volatility in Logistic Regression |
|
|
314 | (1) |
|
What Is Statistical Power and Why Should You Care About It? |
|
|
314 | (1) |
|
How Null Hypothesis Statistical Testing Relates to Power |
|
|
315 | (1) |
|
What Do Statistical Tests Tell Us? |
|
|
316 | (2) |
|
So What Does Failure to Reject the Null Hypothesis Mean? |
|
|
318 | (1) |
|
So What Is Power and How Does It Relate to Error Rates? |
|
|
319 | (3) |
|
Power in Logistic Regression |
|
|
320 | (2) |
|
Summary of Points Thus Far |
|
|
322 | (1) |
|
Who Cares as Long as p < .05? Volatility in Logistic Regression Analyses |
|
|
323 | (14) |
|
|
324 | (11) |
|
|
335 | (2) |
|
Cross-Validation and Replication of Logistic Regression Analyses |
|
|
337 | (5) |
|
Prediction and Explanation Using Regression |
|
|
337 | (4) |
|
Comparing Internal Validation Models in Logistic Regression |
|
|
341 | (1) |
|
|
342 | (7) |
|
Biased Sample #1 (N = 100, b = 0.361, OR = 1.435) |
|
|
342 | (2) |
|
Biased Sample #2 (N = 100, b = 1.619, OR = 5.051) |
|
|
344 | (1) |
|
Relatively Unbiased Sample #3 |
|
|
344 | (1) |
|
|
345 | (4) |
|
|
349 | (1) |
|
|
350 | (1) |
|
|
351 | (2) |
|
|
353 | (4) |
|
11 Modern And Effective Methods Of Dealing With Missing Data |
|
|
357 | (32) |
|
Dealing With Missing or Incomplete Data in Logistic Regression |
|
|
358 | (7) |
|
Not All Missing Data Are the Same |
|
|
360 | (3) |
|
Categories of Missingness: Why Do We Care If Data Are Missing Completely at Random or Not? |
|
|
363 | (2) |
|
How Do You Know If Your Data Are MCAR, MAR, or MNAR? |
|
|
365 | (4) |
|
What Do We Do With Missing Data? |
|
|
369 | (1) |
|
Data Missing Completely at Random (MCAR) |
|
|
369 | (5) |
|
|
371 | (1) |
|
Strong and Weak Imputation |
|
|
372 | (2) |
|
|
374 | (1) |
|
Data Missing Not at Random |
|
|
374 | (8) |
|
The Effects of Listwise Deletion (Complex Case Analysis) |
|
|
375 | (3) |
|
The Detrimental Effects of Mean Substitution |
|
|
378 | (1) |
|
The Effects of Weak Imputation of Values |
|
|
379 | (1) |
|
|
380 | (1) |
|
So Where Does That Leave Us? |
|
|
381 | (1) |
|
Multiple Imputation as a Modern Method of Missing Data Estimation |
|
|
382 | (1) |
|
How Missingness Can Be an Interesting Variable in and of Itself |
|
|
383 | (1) |
|
Summing Up: Benefits of Appropriately Handling Missing Data |
|
|
384 | (1) |
|
|
385 | (1) |
|
|
386 | (3) |
|
12 Multinomial And Ordinal Logistic Regression |
|
|
389 | (46) |
|
Multinomial Logistic Regression With a Continuous Variable |
|
|
393 | (2) |
|
Moving Beyond Simple Multinomial Logistic Regression |
|
|
395 | (1) |
|
More Complex Terms in Multinomial Logistic Regression |
|
|
396 | (3) |
|
Multinomial Logistic Regression as a Series of Binary Logistic Regression Equations |
|
|
399 | (2) |
|
Examples of Data Cleaning Using Binary Logistic Regression |
|
|
401 | (4) |
|
Testing Whether Groups Can Be Combined |
|
|
405 | (2) |
|
Ordered Logit (Proportional Odds) Model |
|
|
407 | (2) |
|
Assumptions of the Ordinal or Proportional Odds Model |
|
|
409 | (2) |
|
Interpreting the Results of the Analysis |
|
|
411 | (2) |
|
Interpreting the Intercepts |
|
|
412 | (1) |
|
Interpreting the Parameter Estimates |
|
|
412 | (1) |
|
Data Cleaning and More Advanced Models in Ordinal Logistic Regression |
|
|
413 | (2) |
|
Why Not Just Use OLS Regression for This Type of Analysis? |
|
|
414 | (1) |
|
|
415 | (1) |
|
|
415 | (1) |
|
|
416 | (16) |
|
|
432 | (1) |
|
|
433 | (2) |
|
13 Multilevel Modeling With Logistic Regression |
|
|
435 | (16) |
|
|
436 | (12) |
|
What Is a Hierarchical Data Structure? |
|
|
436 | (1) |
|
The Issue With Nested Data |
|
|
437 | (2) |
|
How Do Hierarchical Models Work? A Brief Primer |
|
|
439 | (2) |
|
A Brief Note About HLM and Statistical Software |
|
|
441 | (1) |
|
|
441 | (1) |
|
Logits, Odds Ratios, Conditional Odds, Conditional Probabilities, and Relative Risk in HLM |
|
|
442 | (1) |
|
Results of DROPOUT Analysis in HLM |
|
|
443 | (1) |
|
Cross-Level Interactions in HLM Logistic Regression |
|
|
444 | (1) |
|
So What Would Have Happened If These Data Had Been Analyzed via Simple Logistic Regression Without Accounting for the Nested Data Structure? |
|
|
445 | (3) |
|
|
448 | (1) |
|
|
449 | (1) |
|
|
449 | (2) |
Author Index |
|
451 | (4) |
Subject Index |
|
455 | |