Muutke küpsiste eelistusi

E-raamat: 97 Things Every SRE Should Know

  • Formaat: 252 pages
  • Ilmumisaeg: 16-Nov-2020
  • Kirjastus: O'Reilly Media
  • Keel: eng
  • ISBN-13: 9781492081449
  • Formaat - EPUB+DRM
  • Hind: 47,96 €*
  • * hind on lõplik, st. muud allahindlused enam ei rakendu
  • Lisa ostukorvi
  • Lisa soovinimekirja
  • See e-raamat on mõeldud ainult isiklikuks kasutamiseks. E-raamatuid ei saa tagastada.
  • Formaat: 252 pages
  • Ilmumisaeg: 16-Nov-2020
  • Kirjastus: O'Reilly Media
  • Keel: eng
  • ISBN-13: 9781492081449

DRM piirangud

  • Kopeerimine (copy/paste):

    ei ole lubatud

  • Printimine:

    ei ole lubatud

  • Kasutamine:

    Digitaalõiguste kaitse (DRM)
    Kirjastus on väljastanud selle e-raamatu krüpteeritud kujul, mis tähendab, et selle lugemiseks peate installeerima spetsiaalse tarkvara. Samuti peate looma endale  Adobe ID Rohkem infot siin. E-raamatut saab lugeda 1 kasutaja ning alla laadida kuni 6'de seadmesse (kõik autoriseeritud sama Adobe ID-ga).

    Vajalik tarkvara
    Mobiilsetes seadmetes (telefon või tahvelarvuti) lugemiseks peate installeerima selle tasuta rakenduse: PocketBook Reader (iOS / Android)

    PC või Mac seadmes lugemiseks peate installima Adobe Digital Editionsi (Seeon tasuta rakendus spetsiaalselt e-raamatute lugemiseks. Seda ei tohi segamini ajada Adober Reader'iga, mis tõenäoliselt on juba teie arvutisse installeeritud )

    Seda e-raamatut ei saa lugeda Amazon Kindle's. 

Site reliability engineering (SRE) is more relevant than ever. Knowing how to keep systems reliable has become a critical skill. With this practical book, newcomers and old hats alike will explore a broad range of conversations happening in SRE. You'll get actionable advice on several topics, including how to adopt SRE, why SLOs matter, when you need to upgrade your incident response, and how monitoring and observability differ.

Editors Jaime Woo and Emil Stolarsky, co-founders of Incident Labs, have collected 97 concise and useful tips from across the industry, including trusted best practices and new approaches to knotty problems. You'll grow and refine your SRE skills through sound advice and thought-provoking questions that drive the direction of the field.

Some of the 97 things you should know:

  • Test Your Disaster Plan--Tanya Reilly
  • Integrating Empathy into Tools--Daniella Niyonkuru
  • The Best Advice I Can Give To Teams--Nicole Forsgren
  • Where to SRE--Fatema Boxwala
  • Facing Your First Page--Andrew Louis
  • I Have an Error Budget, Now What --Alex Hidalgo
  • Get Your Work Recognized: Write a Brag Document--Julia Evans and Karla Burnett
Preface xiii
Part I New to SRE
1 Site Reliability Engineering in Six Words
2(2)
Alex Hidalgo
2 Do We Know Why We Really Want Reliability?
4(2)
Niall Murphy
3 Building Self-Regulating Processes
6(2)
Denise Yu
4 Four Engineers of an SRE Seder
8(2)
Jacob Scott
5 The Reliability Stack
10(2)
Alex Hidalgo
6 Infrastructure: It's Where the Power Is
12(2)
Charity Majors
7 Thinking About Resilience
14(2)
Justin Li
8 Observability in the Development Cycle
16(2)
Charity Majors
Liz Fong-Jones
9 There Is No Magic
18(2)
Bouke van der Bijl
10 How Wikipedia Is Served to You
20(2)
Effie Mouzeli
11 Why You Should Understand (a Little) About TCP
22(2)
Julia Evans
12 The Importance of a Management Interface
24(2)
Salim Virji
13 When It Comes to Storage, Think Distributed
26(2)
Salim Virji
14 The Role of Cardinality
28(2)
Charity Majors
Liz Fong-Jones
15 Security Is like an Onion
30(2)
Lucas Fontes
16 Use Your Words
32(2)
Tanya Reilly
17 Where to SRE
34(2)
Fatema Boxwala
18 Dear Future Team
36(2)
Frances Rees
19 Sustainability and Burnout
38(2)
Denise Yu
20 Don't Take Advice from Graybeards
40(2)
John Looney
21 Facing That First Page
42(3)
Andrew Louis
Part II Zero to One
22 SRE, at Any Size, Is Cultural
45(2)
Matthew Huxtable
23 Everyone Is an SRE in a Small Organization
47(2)
Matthew Huxtable
24 Auditing Your Environment for Improvements
49(2)
Joan O'Callaghan
25 With Incident Response, Start Small
51(2)
Thai Wood
26 Solo SRE: Effecting Large-Scale Change as a Single Individual
53(2)
Ashley Poole
27 Design Goals for SLO Measurement
55(2)
Ben Sigelman
28 I Have an Error Budget-Now What?
57(2)
Alex Hidalgo
29 How to Change Things
59(2)
Joan O'Callaghan
30 Methodological Debugging
61(2)
Avishai Ish-Shalom
Nati Cohen
31 How Startups Can Build an SRE Mindset
63(2)
Tamara Miner
32 Bootstrapping SRE in Enterprises
65(2)
Vanessa Yiu
33 It's Okay Not to Know, and It's Okay to Be Wrong
67(2)
Todd Palino
34 Storytelling Is a Superpower
69(2)
Anita Clarke
35 Get Your Work Recognized: Write a Brag Document
71(3)
Julia Evans
Karla Burnett
Part III One to Ten
36 Making Work Visible
74(2)
Lorin Hochstein
37 An Overlooked Engineering Skill
76(2)
Murali Suriar
38 Unpacking the On-Call Divide
78(2)
Jason Hand
39 The Maestros of Incident Response
80(2)
Andrew Louis
40 Effortless Incident Management
82(2)
Suhail Patel
Miles Bryant
Chris Evans
41 If You're Doing Runbooks, Do Them Well
84(2)
Spike Lindsey
42 Why I Hate Our Playbooks
86(2)
Frances Rees
43 What Machines Do Well
88(2)
Michelle Brush
44 Integrating Empathy into SRE Tools
90(3)
Daniella Niyonkuru
45 Using ChatOps to Implement Empathy
93(2)
Daniella Niyonkuru
46 Move Fast to Unbreak Things
95(2)
Michelle Brush
47 You Don't Know for Sure Until It Runs in Production
97(2)
Ingrid Epure
48 Sometimes the Fix Is the Problem
99(2)
Jake Pittis
49 Legendary
101(2)
Elise Gale
50 Metrics Are Not SLIs (The Measure Everything Trap)
103(2)
Brian Murphy
51 When SLOs Attack: Pathological SLOs and How to Fix Them
105(2)
Narayan Desai
52 Holistic Approach to Product Reliability
107(2)
Kristine Chen
Bart Ponurkiewicz
53 In Search of the Lost Time
109(2)
Ingrid Epure
54 Unexpected Lessons from Office Hours
111(2)
Tamara Miner
55 Building Tools for Internal Customers that They Actually Want to Use
113(2)
Vinessa Wan
56 It's About the Individuals and Interactions
115(2)
Vinessa Wan
57 The Human Baseline in SRE
117(2)
Effie Mouzeli
58 Remotely Productive or Productively Remote
119(2)
Avleen Vig
59 Of Margins and Individuals
121(2)
Kurt Andersen
60 The Importance of Margins in Systems
123(2)
Kurt Andersen
61 Fewer Spreadsheets, More Napkins
125(2)
Jacob Bednarz
62 Sneaking in Your DevOps Deliciously
127(2)
Vinessa Wan
63 Effecting SRE Cultural Changes in Enterprises
129(2)
Vanessa Yiu
64 To All the SREs I've Loved
131(2)
Felix Glaser
65 Complex: The Most Overloaded Word in Technology
133(3)
Laura Nolan
Part IV Ten to Hundred
66 The Best Advice I Can Give to Teams
136(2)
Nicole Forsgren
67 Create Your Supporting Artifacts
138(2)
Daria Barteneva
Eva Parish
68 The Order of Operations for Getting SLO Buy-In
140(2)
David K. Rensin
69 Heroes Are Necessary, but Hero Culture Is Not
142(2)
Lei Lopez
70 On-Call Rotations that People Want to Join
144(2)
Miles Bryant
Chris Evans
Suhail Patel
71 Study of Human Factors and Team Culture to Improve Pager Fatigue
146(2)
Daria Barteneva
72 Optimize for MTTBTB (Mean Time to Back to Bed)
148(2)
Spike Lindsey
73 Mitigating and Preventing Cascading Failures
150(2)
Rita Lu
74 On-Call Health: The Metric You Could Be Measuring
152(2)
Caitie McCaffrey
75 Helping Leaders Prioritize On-Call Health
154(2)
Caitie McCaffrey
76 The SRE as a Diplomat
156(2)
Johnny Boursiquot
77 The Forward-Deployed SRE
158(2)
Johnny Boursiquot
78 Test Your Disaster Plan
160(2)
Tanya Reilly
79 Why Training Matters to an SRE Practice and SRE Matters to Your Training Program
162(2)
Jennifer Petoff
80 The Power of Uniformity
164(2)
Chris Evans
Suhail Patel
Miles Bryant
81 Bytes per User Value
166(2)
Arshia Mufti
82 Make Your Engineering Blog a Priority
168(2)
Anita Clarke
83 Don't Let Anyone Run Code in Your Context
170(2)
John Looney
84 Trading Places: SRE and Product
172(2)
Shubheksha Jalan
85 You See Teams, I See Product
174(2)
Avleen Vig
86 The Performance Emergency Fund
176(2)
Dawn Parzych
87 Important but Not Urgent: Roadmaps for SREs
178(3)
Laura Nolan
Part V The Future of SRE
88 That 50% Thing
181(2)
Tanya Reilly
89 Following the Path of Safety-Critical Systems
183(2)
Heidy Khlaaf
90 Applicable and Achievable Static Analysis
185(2)
Heidy Khlaaf
91 The Importance of Formal Specification
187(2)
Hillel Wayne
92 Risk and Rot in Sociotechnical Systems
189(2)
Laura Nolan
93 SRE in Crisis
191(2)
Niall Murphy
94 Expected Risk Limitations
193(2)
Blake Bisset
95 Beyond Local Risk: Accounting for Angry Birds
195(2)
Blake Bisset
96 A Word from Software Safety Nerds
197(2)
J. Paul Reed
97 Incidents: A Window into Gaps
199(2)
Lorin Hochstein
98 The Third Age of SRE
201(2)
Bjorn "Beorn" Rabenstein
Contributors 203(22)
Index 225(7)
About the Editors 232
Emil Stolarsky is a site reliability engineer, who previously worked on caching, performance, & disaster recovery at Shopify and the internal Kubernetes platform at DigitalOcean. He is the program co-chair for SREcon EMEA 2019 and SREcon Americas West 2020, and contributed a chapter to the O'Reilly book "Seeking SRE."