Introduction |
|
ix | |
|
Part I SRE Implementation |
|
|
|
1 Context Versus Control in SRE |
|
|
3 | (12) |
|
2 Interviewing Site Reliability Engineers |
|
|
15 | (10) |
|
3 So, You Want to Build an SRE Team? |
|
|
25 | (8) |
|
4 Using Incident Metrics to Improve SRE at Scale |
|
|
33 | (10) |
|
5 Working with Third Parties Shouldn't Suck |
|
|
43 | (22) |
|
6 How to Apply SRE Principles Without Dedicated SRE Teams |
|
|
65 | (16) |
|
7 SRE Without SRE: The Spotify Case Study |
|
|
81 | (30) |
|
8 Introducing SRE in Large Enterprises |
|
|
111 | (12) |
|
9 From SysAdmin to SRE in 8,963 Words |
|
|
123 | (24) |
|
10 Clearing the Way for SRE in the Enterprise |
|
|
147 | (30) |
|
11 SRE Patterns Loved by DevOps People Everywhere |
|
|
177 | (10) |
|
12 DevOps and SRE: Voices from the Community |
|
|
187 | (20) |
|
13 Production Engineering at Facebook |
|
|
207 | (26) |
|
|
|
14 In the Beginning, There Was Chaos |
|
|
233 | (12) |
|
15 The Intersection of Reliability and Privacy |
|
|
245 | (12) |
|
16 Database Reliability Engineering |
|
|
257 | (18) |
|
17 Engineering for Data Durability |
|
|
275 | (18) |
|
18 Introduction to Machine Learning for SRE |
|
|
293 | (32) |
|
Part III SRE Best Practices and Technologies |
|
|
|
19 Do Docs Better: Integrating Documentation into the Engineering Workflow |
|
|
325 | (18) |
|
20 Active Teaching and Learning |
|
|
343 | (12) |
|
21 The Art and Science of the Service-Level Objective |
|
|
355 | (10) |
|
22 SRE as a Success Culture |
|
|
365 | (14) |
|
|
379 | (28) |
|
24 Immutable Infrastructure and SRE |
|
|
407 | (8) |
|
25 Scriptable Load Balancers |
|
|
415 | (18) |
|
26 The Service Mesh: Wrangler of Your Microservices? |
|
|
433 | (20) |
|
Part IV The Human Side of SRE |
|
|
|
27 Psychological Safety in SRE |
|
|
453 | (12) |
|
|
465 | (26) |
|
|
491 | (20) |
|
30 Against On-Call: A Polemic |
|
|
511 | (22) |
|
31 Elegy for Complex Systems |
|
|
533 | (8) |
|
32 Intersections Between Operations and Social Activism |
|
|
541 | (18) |
|
|
559 | (2) |
Index |
|
561 | |