Muutke küpsiste eelistusi

SLIs and SLOs Demystified: A workshop approach to building and maintaining your service level indicators and service level objectives [Pehme köide]

  • Formaat: Paperback / softback, 300 pages, kõrgus x laius: 235x191 mm
  • Ilmumisaeg: 25-Apr-2025
  • Kirjastus: Packt Publishing Limited
  • ISBN-10: 1835889395
  • ISBN-13: 9781835889381
  • Formaat: Paperback / softback, 300 pages, kõrgus x laius: 235x191 mm
  • Ilmumisaeg: 25-Apr-2025
  • Kirjastus: Packt Publishing Limited
  • ISBN-10: 1835889395
  • ISBN-13: 9781835889381
Master reliability engineering with SLIs and SLOs to optimize performance, enhance observability, and make data-driven decisions

Key Features

Design precise SLIs and SLOs tailored to different system architectures and reliability goals Master observability techniques and incident management strategies to proactively detect and resolve issues Build scenario-based SLIs and SLOs with hands-on guidance for real-world reliability engineering Purchase of the print or Kindle book includes a free PDF eBook

Book DescriptionIn today's digital landscape, ensuring service reliability is more than just a necessityits a competitive advantage. SLIs and SLOs Demystified equips software engineers, SREs, and business leaders with the knowledge to build, measure, and manage service level indicators (SLIs) and service level objectives (SLOs) efficiently. Written by Alexandra F. McCoyan experienced site reliability engineer with over a decade of experience in the cloud and technology industrythis book simplifies complex reliability concepts for engineers at all levels. Starting with a review of reliability engineering basics, Alexandra provides a step-by-step approach to defining impactful SLIs, facilitating productive SLO discussions, and integrating observability into your monitoring strategy. You'll also see how these principles apply to web applications, distributed systems, databases, and new features through real-world examples that can help you develop SLIs and SLOs for your specific environment. The book goes beyond implementation to explore the financial impact of reliability, alerting strategies, integration with incident management, and using error budgets for business decisions. By the end of this book, youll be able to drive operational excellence, minimize unplanned downtime, and optimize end user experiences with well-established reliability metrics.What you will learn

Formulate and implement SLIs and SLOs for assessing and enhancing system reliability objectives Manage incidents proactively using observability and monitoring Create adequate reliability metrics for complex systems Refine incident response strategies to minimize associated risks Align reliability objectives with business and technical goals Implement strong reliability practices across multiple teams and services Integrate reliability engineering with DevOps and site reliability engineering practices

Who this book is forThis book is designed for site reliability engineers (SREs), DevOps engineers, software engineers, product managers, and business leaders looking to enhance service reliability to ensure their applications meet performance expectations. Basic knowledge of cloud services, system monitoring, and software engineering principles is beneficial.
Table of Contents

SLIs and SLOs at the Heart of Reliability
Establishing an SLI and SLO Team
Things to Consider When Crafting Your SLIs and SLOs
Observability and Monitoring Are a Necessity and a Must
The Financial Impact of Not Adopting Indicators
Workshop Preparation: Structuring the SLI and SLO Conversation
Scenario 1: SLIs and SLOs for Web Applications
Scenario 2: SLIs and SLOs for Distributed Systems
Scenario 3: Optimizing SLIs and SLOs for Database Performance
Scenario 4: Developing SLIs and SLOs for New Features
SLO Monitoring and Alerting
Service Level Performance Metrics: Daily Operations
SLO Preservation and Incident Management
SLIs and SLOs as a Service
Alexandra F. McCoy has worked within the software and technology industry, in various roles, for the last 12 years. She spent a portion of that time as a site reliability engineer. Much of her experience was spent within the cloud sector, including hybrid cloud and on-premises Kubernetes environments, implementing cloud-native solutions for container orchestration. She enjoys the practice of reliability engineering, cloud-native development, and container orchestration as they relate to architecting solutions for customers within various industries. She spends her free time with family & close friends, and dedicates time to mentor junior engineers and professionals, with aspirational goals of successfully developing within the technology field.