Are you using SAP ERP and eager to unlock the enormous value of its data? With this practical guide, SAP veterans Greg Foss and Paul Modderman show you how to use several data analysis tools to solve interesting problems with your SAP data. Throughout the book, youll follow a fictional company as it tackles real scenarios.
Using actual data to create example code and visualizations, SAP business analysts will learn practical methods for gaining deeper insights into their businesss data. Data engineers and data scientists will explore ways to add SAP data to their analysis processes. Through grounded explanations of both SAP processes and data science tools, youll discover powerful methods for discovering data truths.
Use data to tell revealing stories about your customers Model purchase requisition data using exploratory data analysis Create an anomaly detection system for SAP sales orders Use R and Python to make predictions on sales data Cluster and segment your customers based on their buying habits Use association rule learning to discover customer buying patterns Apply NLP to uncover the most highly actionable customer complaints
Preface |
|
vii | |
|
|
1 | (12) |
|
Telling Better Stories with Data |
|
|
1 | (2) |
|
A Quick Look: Data Science for SAP Professionals |
|
|
3 | (3) |
|
A Quick Look: SAP Basics for Data Scientists |
|
|
6 | (5) |
|
|
8 | (3) |
|
Roles and Responsibilities |
|
|
11 | (1) |
|
|
11 | (2) |
|
2 Data Science for SAP Professionals |
|
|
13 | (32) |
|
|
14 | (12) |
|
Supervised Machine Learning |
|
|
15 | (3) |
|
Unsupervised Machine Learning |
|
|
18 | (3) |
|
Semi-Supervised Machine Learning |
|
|
21 | (1) |
|
Reinforcement Machine Learning |
|
|
22 | (4) |
|
|
26 | (17) |
|
|
43 | (2) |
|
3 SAP for Data Scientists |
|
|
45 | (48) |
|
|
46 | (3) |
|
|
49 | (19) |
|
|
50 | (3) |
|
|
53 | (1) |
|
Data Elements and Domains |
|
|
54 | (4) |
|
|
58 | (4) |
|
|
62 | (6) |
|
|
68 | (1) |
|
|
68 | (12) |
|
|
80 | (11) |
|
|
91 | (2) |
|
4 Exploratory Data Analysis with R |
|
|
93 | (40) |
|
|
95 | (1) |
|
Phase 1 Collecting Our Data |
|
|
96 | (11) |
|
|
104 | (3) |
|
Phase 2 Cleaning Our Data |
|
|
107 | (1) |
|
|
107 | (1) |
|
|
107 | (1) |
|
Removing Extraneous Columns |
|
|
108 | (1) |
|
|
108 | (1) |
|
|
109 | (1) |
|
Phase 3 Analyzing Our Data |
|
|
109 | (12) |
|
|
110 | (3) |
|
|
113 | (4) |
|
|
117 | (4) |
|
Phase 4 Modeling Our Data |
|
|
121 | (11) |
|
|
122 | (1) |
|
Training and Testing Split |
|
|
122 | (1) |
|
Shaping and One-Hot Encoding |
|
|
123 | (1) |
|
|
124 | (2) |
|
Preparing Data for the Neural Network |
|
|
126 | (4) |
|
|
130 | (2) |
|
|
132 | (1) |
|
5 Anomaly Detection with R and Python |
|
|
133 | (58) |
|
|
134 | (1) |
|
|
135 | (39) |
|
|
135 | (1) |
|
|
136 | (1) |
|
|
136 | (1) |
|
|
137 | (5) |
|
|
142 | (11) |
|
|
153 | (21) |
|
|
174 | (15) |
|
|
174 | (10) |
|
|
184 | (5) |
|
|
189 | (2) |
|
6 Predictive Analytics in R and Python |
|
|
191 | (34) |
|
|
193 | (17) |
|
|
193 | (1) |
|
|
193 | (1) |
|
|
194 | (1) |
|
|
195 | (11) |
|
|
206 | (4) |
|
Predicting Sales in Python |
|
|
210 | (12) |
|
|
210 | (1) |
|
|
210 | (6) |
|
|
216 | (3) |
|
|
219 | (1) |
|
|
220 | (2) |
|
|
222 | (3) |
|
7 Clustering and Segmentation in R |
|
|
225 | (42) |
|
Understanding Clustering and Segmentation |
|
|
226 | (7) |
|
|
227 | (1) |
|
|
228 | (1) |
|
|
229 | (1) |
|
|
230 | (1) |
|
|
231 | (2) |
|
|
233 | (1) |
|
Step 1 Collecting the Data |
|
|
233 | (1) |
|
|
234 | (6) |
|
Step 3 Analyzing the Data |
|
|
240 | (18) |
|
Revisiting the Pareto Principle |
|
|
240 | (1) |
|
|
241 | (3) |
|
|
244 | (5) |
|
|
249 | (4) |
|
|
253 | (2) |
|
|
255 | (3) |
|
Step 4 Report the Findings |
|
|
258 | (6) |
|
|
261 | (1) |
|
|
262 | (2) |
|
|
264 | (3) |
|
8 Association Rule Mining |
|
|
267 | (22) |
|
Understanding Association Rule Mining |
|
|
269 | (1) |
|
|
269 | (1) |
|
|
269 | (1) |
|
|
270 | (1) |
|
|
270 | (1) |
|
Operationalization Overview |
|
|
270 | (1) |
|
|
271 | (5) |
|
|
276 | (1) |
|
|
277 | (10) |
|
|
282 | (5) |
|
|
287 | (2) |
|
9 Natural Language Processing with the Google Cloud Natural Language API |
|
|
289 | (16) |
|
Understanding Natural Language Processing |
|
|
290 | (2) |
|
|
290 | (2) |
|
|
292 | (1) |
|
|
292 | (6) |
|
|
298 | (3) |
|
|
301 | (2) |
|
|
303 | (2) |
|
|
305 | (6) |
|
|
305 | (1) |
|
|
306 | (2) |
|
|
306 | (1) |
|
Chapter 2 Data Science for SAP Professionals |
|
|
306 | (1) |
|
Chapter 3 SAP for Data Scientists |
|
|
306 | (1) |
|
Chapter 4 Exploratory Data Analysis |
|
|
307 | (1) |
|
Chapter 5 Anomaly Detection with R and Python |
|
|
307 | (1) |
|
Chapter 6 Prediction with R |
|
|
307 | (1) |
|
Chapter 7 Clustering and Segmentation in R |
|
|
307 | (1) |
|
Chapter 8 Association Rule Mining |
|
|
307 | (1) |
|
Chapter 9 Natural Language Processing with the Google Cloud Natural Language API |
|
|
308 | (1) |
|
|
308 | (1) |
|
|
308 | (1) |
|
|
308 | (1) |
|
|
309 | (1) |
|
|
309 | (2) |
Index |
|
311 | |
Greg Foss fuses battle-tested deep SAP knowledge with a passion for all things data science. His SAP career spans all areas of the technology stack - server, database, security, back and front end development, and functional expertise. As an enterprise architect, he's been the steady guiding hand for years of managing, supporting, and enhancing SAP. As the founder of Blue Diesel Data Science, he focuses years of R, Python, machine learning algorithms, and analytics expertise on finding unique stories to tell from enterprise SAP data. Through Blue Diesel, Greg regularly contributes unique knowledge and insight into the data science blogging community, and is the principal developer and architect of VisionaryRX, an innovative pharmaceutical data dashboarding product.
Paul Modderman loves creating things and sharing them. His tech career has spanned web applications with technologies like .NET, Java, Python, and React to SAP solutions in ABAP, OData and SAPUI5, to cloud technologies in Google Cloud Platform, Amazon Web Services, and Microsoft Azure. He was principal technical architect on Mindset's certified solutions CloudSimple and Analytics for BW. He's an SAP Developer Hero, honored in 2017. Paul is the author of two books: Mindset Perspectives: SAP Development Tips, Tricks, and Projects, and the SAP Press published SAPUI5 and SAP Fiori: The Psychology of UX Design.