Foreword |
|
xii | |
Preface |
|
xiv | |
Acknowledgments |
|
xvi | |
About this book |
|
xvii | |
About the authors |
|
xxiv | |
About the cover illustration |
|
xxvi | |
|
|
1 | (84) |
|
1 The what and why of the data mesh |
|
|
3 | (27) |
|
|
4 | (3) |
|
|
7 | (4) |
|
|
8 | (1) |
|
Data warehouses and data lakes inside the data mesh |
|
|
9 | (1) |
|
|
10 | (1) |
|
1.3 Use case: A snow-shoveling business |
|
|
11 | (4) |
|
|
15 | (9) |
|
Domain-oriented decentralized data ownership and architecture |
|
|
16 | (2) |
|
|
18 | (2) |
|
Federated computational governance |
|
|
20 | (2) |
|
Self-serve data infrastructure as a platform |
|
|
22 | (2) |
|
1.5 Back to snow shoveling |
|
|
24 | (1) |
|
1.6 Socio-technical architecture |
|
|
25 | (2) |
|
|
25 | (1) |
|
|
26 | (1) |
|
|
26 | (1) |
|
|
27 | (3) |
|
|
27 | (1) |
|
Data management challenges |
|
|
27 | (1) |
|
Organizational challenges |
|
|
28 | (2) |
|
2 Is a data mesh right for you? |
|
|
30 | (26) |
|
2.1 Analyzing data mesh drivers |
|
|
31 | (8) |
|
|
31 | (2) |
|
|
33 | (2) |
|
|
35 | (1) |
|
Minor organizational drivers |
|
|
36 | (2) |
|
Is a data mesh a good fit for me? |
|
|
38 | (1) |
|
2.2 Data mesh alternatives and complementary solutions |
|
|
39 | (6) |
|
Enterprise data warehouse |
|
|
39 | (2) |
|
|
41 | (1) |
|
|
42 | (1) |
|
|
43 | (1) |
|
Data mesh vs. the rest of the world |
|
|
44 | (1) |
|
2.3 Understanding a data mesh implementation effort |
|
|
45 | (11) |
|
The data mesh development cycle |
|
|
45 | (3) |
|
Development cycle in the shoveling example |
|
|
48 | (1) |
|
|
49 | (3) |
|
Development cycle in detail |
|
|
52 | (4) |
|
3 Kickstart your data mesh MVP in a month |
|
|
56 | (29) |
|
3.1 Getting the lay of the land |
|
|
57 | (6) |
|
Drawing a system landscape diagram |
|
|
58 | (2) |
|
Performing stakeholder analysis |
|
|
60 | (3) |
|
3.2 Identifying candidates for the MVP implementation team |
|
|
63 | (6) |
|
Choosing development teams |
|
|
63 | (3) |
|
Choosing the cooperation model |
|
|
66 | (1) |
|
Choosing a data governance team |
|
|
66 | (3) |
|
3.3 Setting up MVP governance |
|
|
69 | (3) |
|
Defining data mesh value statement(s) |
|
|
70 | (1) |
|
Defining data governance policies |
|
|
71 | (1) |
|
Federating data governance |
|
|
72 | (1) |
|
3.4 Developing minimal data products |
|
|
72 | (8) |
|
Identifying domain-oriented datasets |
|
|
73 | (3) |
|
Choosing data product owners |
|
|
76 | (1) |
|
Deciding on the minimum viable data product description |
|
|
77 | (2) |
|
Developing the simplest tools to expose your data |
|
|
79 | (1) |
|
3.5 Setting up the minimal platform |
|
|
80 | (5) |
|
Ensuring platform-forced governability |
|
|
81 | (1) |
|
Ensuring platform security |
|
|
82 | (3) |
|
Part 2 The four principles in practice |
|
|
85 | (134) |
|
|
87 | (36) |
|
4.1 Capturing and analyzing domains |
|
|
90 | (5) |
|
|
91 | (1) |
|
|
92 | (1) |
|
Choose the correct workshop technique |
|
|
93 | (2) |
|
4.2 Applying ownership using domain decomposition |
|
|
95 | (14) |
|
Domain, subdomain, and business capability |
|
|
97 | (3) |
|
Decompose domains using business capability modeling |
|
|
100 | (1) |
|
How are domains and business capabilities related to data? |
|
|
101 | (5) |
|
Assign responsibilities to the data-product-owning team |
|
|
104 | (2) |
|
Choose the right team to own data |
|
|
106 | (3) |
|
4.3 Applying ownership using data use cases |
|
|
109 | (5) |
|
|
109 | (2) |
|
Model and bounded context |
|
|
111 | (2) |
|
Setup boundaries of use-case-driven data products |
|
|
113 | (1) |
|
Choose the right team to own data |
|
|
114 | (1) |
|
4.4 Applying ownership using design heuristics |
|
|
114 | (4) |
|
|
115 | (1) |
|
|
115 | (1) |
|
Designing heuristics and possible boundaries |
|
|
115 | (3) |
|
4.5 Final landscape: The mesh of interconnected data products |
|
|
118 | (5) |
|
|
118 | (2) |
|
Data products form a mesh |
|
|
120 | (1) |
|
Is it already a data mesh? |
|
|
121 | (2) |
|
|
123 | (40) |
|
5.1 Applying product thinking |
|
|
124 | (7) |
|
Product thinking analysis |
|
|
125 | (3) |
|
|
128 | (3) |
|
5.2 What is a data product? |
|
|
131 | (4) |
|
|
131 | (2) |
|
|
133 | (1) |
|
What can be a data product? |
|
|
134 | (1) |
|
5.3 Data product ownership |
|
|
135 | (5) |
|
|
136 | (1) |
|
Data product owner responsibilities |
|
|
137 | (1) |
|
An Agile DevOps team as a base for data product dev team |
|
|
138 | (1) |
|
Data product owner and product owner |
|
|
139 | (1) |
|
5.4 Conceptual architecture of a data product |
|
|
140 | (6) |
|
External architecture view |
|
|
140 | (4) |
|
Internal architecture view |
|
|
144 | (2) |
|
5.5 Data product fundamental characteristics |
|
|
146 | (6) |
|
Self-described data product |
|
|
146 | (1) |
|
|
147 | (1) |
|
|
147 | (2) |
|
|
149 | (1) |
|
|
150 | (1) |
|
|
151 | (1) |
|
5.6 Additional data product characteristics: FAIR and immutability |
|
|
152 | (5) |
|
|
152 | (2) |
|
|
154 | (1) |
|
|
155 | (1) |
|
|
156 | (1) |
|
|
157 | (1) |
|
5.7 Data contracts and sharing agreements inside the data mesh |
|
|
157 | (6) |
|
Data contracts and sharing agreements |
|
|
159 | (1) |
|
Implementing data contracts and sharing agreements |
|
|
160 | (3) |
|
6 Federated computational governance |
|
|
163 | (29) |
|
6.1 Data governance in a nutshell |
|
|
164 | (3) |
|
6.2 Benefits of data governance |
|
|
167 | (2) |
|
Business value perspective |
|
|
167 | (1) |
|
Data usability perspective |
|
|
168 | (1) |
|
|
168 | (1) |
|
6.3 Planning data governance outcomes |
|
|
169 | (10) |
|
Hierarchy of data governance outcomes |
|
|
171 | (1) |
|
|
172 | (4) |
|
|
176 | (1) |
|
Implementation-level outcomes |
|
|
177 | (2) |
|
6.4 Federating data governance |
|
|
179 | (9) |
|
Thinking of data governance in terms of "sliders" |
|
|
179 | (1) |
|
Extreme ends of data governance models |
|
|
180 | (1) |
|
Federated data governance model |
|
|
181 | (6) |
|
Setting-up governance team operations |
|
|
187 | (1) |
|
6.5 Making data governance computational |
|
|
188 | (4) |
|
Making policies computational |
|
|
189 | (1) |
|
|
190 | (2) |
|
7 The self-serve fata platform |
|
|
192 | (27) |
|
|
193 | (7) |
|
|
195 | (2) |
|
|
197 | (3) |
|
7.2 Improvements with X as a service |
|
|
200 | (5) |
|
|
201 | (2) |
|
|
203 | (2) |
|
7.3 Improvements with platform architecture |
|
|
205 | (7) |
|
Platform architecture explained |
|
|
206 | (2) |
|
Platform architecture applied |
|
|
208 | (4) |
|
7.4 Improvements for the data producers |
|
|
212 | (7) |
|
Part 3 Infrastructure and technical architecture |
|
|
219 | (64) |
|
8 Comparing self-serve data platforms |
|
|
221 | (26) |
|
8.1 Data mesh on Google Cloud Platform |
|
|
223 | (7) |
|
Self-serve data platform architecture |
|
|
223 | (2) |
|
Identifying the components of the platform |
|
|
225 | (1) |
|
Identifying the components of the data product |
|
|
225 | (2) |
|
|
227 | (1) |
|
|
227 | (1) |
|
Relation to data mesh ideas |
|
|
228 | (1) |
|
|
229 | (1) |
|
|
230 | (6) |
|
Self-serve data platform architecture |
|
|
230 | (2) |
|
Identifying the components of the platform |
|
|
232 | (1) |
|
Identifying the components of the data products |
|
|
232 | (2) |
|
|
234 | (1) |
|
Relation to data mesh ideas |
|
|
234 | (1) |
|
|
235 | (1) |
|
A WS architecture summary |
|
|
235 | (1) |
|
8.3 Data mesh on Databricks |
|
|
236 | (5) |
|
Self-serve data platform architecture |
|
|
236 | (2) |
|
Identifying the components of the platform |
|
|
238 | (1) |
|
Identifying the components of the data product |
|
|
239 | (1) |
|
|
240 | (1) |
|
|
240 | (1) |
|
Databricks architecture summary |
|
|
240 | (1) |
|
|
241 | (6) |
|
Self-serve data platform architecture |
|
|
241 | (2) |
|
Identifying the components |
|
|
243 | (1) |
|
|
244 | (1) |
|
Kafka architecture summary |
|
|
245 | (2) |
|
9 Solution architecture design |
|
|
247 | (36) |
|
9.1 Capturing and understanding the current state |
|
|
248 | (4) |
|
What is software architecture? |
|
|
248 | (1) |
|
How to document architecture: The C4 model |
|
|
249 | (3) |
|
9.2 Understanding architectural drivers of a data product design |
|
|
252 | (7) |
|
|
252 | (3) |
|
Capturing architectural drivers for a data-product design |
|
|
255 | (4) |
|
9.3 Designing the future architecture of a data product and related systems |
|
|
259 | (24) |
|
|
260 | (1) |
|
File-based data product: Spreadsheet |
|
|
260 | (6) |
|
From monolith and microservice to a data product |
|
|
266 | (9) |
|
Exposing data for stream processing and batch processing |
|
|
275 | (8) |
Appendix A |
|
283 | (1) |
Appendix B |
|
284 | (1) |
Appendix C |
|
285 | (3) |
Appendix D |
|
288 | (3) |
Index |
|
291 | |