About the Author |
|
xi | |
About the Technical Reviewer |
|
xiii | |
Foreword |
|
xv | |
Acknowledgments |
|
xvii | |
Introduction |
|
xix | |
|
Chapter 1 Introduction to Supervised Learning |
|
|
1 | (46) |
|
|
2 | (8) |
|
Relationship Between Data Analysis, Data Mining, ML, and Al |
|
|
3 | (2) |
|
Data, Data Types, and Data Sources |
|
|
5 | (5) |
|
How ML Differs from Software Engineering |
|
|
10 | (5) |
|
|
12 | (3) |
|
Statistical and Mathematical Concepts for ML |
|
|
15 | (10) |
|
Supervised Learning Algorithms |
|
|
25 | (9) |
|
Regression vs. Classification Problems |
|
|
28 | (2) |
|
Steps in a Supervised Learning Algorithm |
|
|
30 | (4) |
|
Unsupervised Learning Algorithms |
|
|
34 | (3) |
|
|
35 | (1) |
|
|
36 | (1) |
|
Semi-supervised Learning Algorithms |
|
|
37 | (1) |
|
|
37 | (2) |
|
|
39 | (2) |
|
|
41 | (3) |
|
|
44 | (3) |
|
Chapter 2 Supervised Learning for Regression Analysis |
|
|
47 | (70) |
|
Technical Toolkit Required |
|
|
48 | (1) |
|
Regression analysis and Use Cases |
|
|
49 | (2) |
|
What Is Linear Regression |
|
|
51 | (8) |
|
Assumptions of Linear Regression |
|
|
56 | (3) |
|
Measuring the Efficacy of Regression Problem |
|
|
59 | (25) |
|
Example 1 Creating a Simple Linear Regression |
|
|
68 | (3) |
|
Example 2 Simple Linear Regression for Housing Dataset |
|
|
71 | (7) |
|
Example 3 Multiple Linear Regression for Housing Dataset |
|
|
78 | (6) |
|
Nonlinear Regression Analysis |
|
|
84 | (4) |
|
Identifying a Nonlinear Relationship |
|
|
88 | (3) |
|
Assumptions for a Nonlinear Regression |
|
|
89 | (2) |
|
Challenges with a Regression Model |
|
|
91 | (3) |
|
Tree-Based Methods for Regression |
|
|
94 | (4) |
|
Case study: Petrol consumption using Decision tree |
|
|
98 | (5) |
|
Ensemble Methods for Regression |
|
|
103 | (3) |
|
Case study: Petrol consumption using Random Forest |
|
|
106 | (4) |
|
Feature Selection Using Tree-Based Methods |
|
|
110 | (3) |
|
|
113 | (4) |
|
Chapter 3 Supervised Learning for Classification Problems |
|
|
117 | (74) |
|
Technical Toolkit Required |
|
|
118 | (1) |
|
Hypothesis Testing and p-Value |
|
|
118 | (3) |
|
Classification Algorithms |
|
|
121 | (8) |
|
Logistic Regression for Classification |
|
|
124 | (5) |
|
Assessing the Accuracy of the Solution |
|
|
129 | (7) |
|
|
136 | (14) |
|
|
149 | (1) |
|
Naive Bayes for Classification |
|
|
150 | (4) |
|
Case Study: Income Prediction on Census Data |
|
|
154 | (9) |
|
K-Nearest Neighbors for Classification |
|
|
163 | (6) |
|
Case Study: k-Nearest Neighbor |
|
|
169 | (9) |
|
|
170 | (1) |
|
|
170 | (8) |
|
Tree-Based Algorithms for Classification |
|
|
178 | (4) |
|
Types of Decision Tree Algorithms |
|
|
182 | (6) |
|
|
188 | (3) |
|
Chapter 4 Advanced Algorithms for Supervised Learning |
|
|
191 | (100) |
|
Technical Toolkit Required |
|
|
192 | (1) |
|
|
193 | (15) |
|
Using Gradient Boosting Algorithm |
|
|
198 | (10) |
|
|
208 | (13) |
|
|
210 | (3) |
|
|
213 | (2) |
|
|
215 | (6) |
|
Supervised Algorithms for Unstructured Data |
|
|
221 | (1) |
|
|
222 | (21) |
|
|
223 | (3) |
|
Challenges with Text Data |
|
|
226 | (2) |
|
Text Analytics Modeling Process |
|
|
228 | (2) |
|
Text Data Extraction and Management |
|
|
230 | (3) |
|
Preprocessing of Text Data |
|
|
233 | (3) |
|
Extracting Features from Text Data |
|
|
236 | (7) |
|
Case study: Customer complaints analysis using NLP |
|
|
243 | (5) |
|
|
246 | (2) |
|
Case study: Customer complaints analysis using word embeddings |
|
|
248 | (4) |
|
|
252 | (9) |
|
|
253 | (3) |
|
Challenges with Image Data |
|
|
256 | (2) |
|
Image Data Management Process |
|
|
258 | (2) |
|
Image Data Modeling Process |
|
|
260 | (1) |
|
Fundamentals of Deep Learning |
|
|
261 | (15) |
|
Artificial Neural Networks |
|
|
261 | (4) |
|
|
265 | (3) |
|
Loss Function in a Neural Network |
|
|
268 | (1) |
|
Optimization in a Neural Network |
|
|
268 | (4) |
|
Neural Network Training Process |
|
|
272 | (4) |
|
Case Study 1: Create a Classification Model on Structured Data |
|
|
276 | (5) |
|
Case Study 2: Image Classification Model |
|
|
281 | (6) |
|
|
287 | (4) |
|
Chapter 5 End-to-End Model Development |
|
|
291 | (76) |
|
Technical Toolkit Required |
|
|
292 | (1) |
|
|
292 | (2) |
|
Step 1 Define the Business Problem |
|
|
294 | (2) |
|
Step 2 Data Discovery Phase |
|
|
296 | (5) |
|
Step 3 Data Cleaning and Preparation |
|
|
301 | (15) |
|
Duplicates in the Dataset |
|
|
302 | (2) |
|
Categorical Variable Treatment in Dataset |
|
|
304 | (3) |
|
Missing Values Present in the Dataset |
|
|
307 | (9) |
|
|
316 | (5) |
|
|
321 | (4) |
|
Other Common Problems in the Dataset |
|
|
325 | (3) |
|
|
328 | (7) |
|
|
335 | (16) |
|
|
336 | (6) |
|
Finding the Best Threshold for Classification Algorithms |
|
|
342 | (1) |
|
Overfitting vs. Underfitting Problem |
|
|
343 | (7) |
|
Key Stakeholder Discussion and Iterations |
|
|
350 | (1) |
|
Presenting the Final Model |
|
|
350 | (1) |
|
Step 6 Deployment of the Model |
|
|
351 | (12) |
|
|
363 | (1) |
|
Step 8 Model Refresh and Maintenance |
|
|
363 | (1) |
|
|
364 | (3) |
Index |
|
367 | |