Hoeppner / Sbaraglia

Erschienen: 08.10.2025

Mastering Site Reliability Engineering in Enterprise

A Complete Guide to Resilient Systems & Chaos Engineering

Apress

ISBN 9798868814471

Standardpreis


ca. 48,14 €

lieferbar, ca. 10 Tage

Preisangaben inkl. MwSt. Abhängig von der Lieferadresse kann die MwSt. an der Kasse variieren. Weitere Informationen

auch verfügbar als eBook (PDF) für 46,99 €

Bibliografische Daten

Fachbuch

Buch. Softcover

2025

45 s/w-Abbildungen.

In englischer Sprache

Umfang: xxii, 311 S.

Format (B x L): 17,8 x 25,4 cm

Verlag: Apress

ISBN: 9798868814471

Produktbeschreibung

Implement site reliability engineering (SRE) practices in an enterprise IT environment and manage its complete lifecycle. This book is a comprehensive guide designed to help site reliability engineers, DevOps teams, and platform engineers identify, address, and mitigate system vulnerabilities before they escalate into significant issues. The authors highlight the shift from IT as a cost centre to a core business function, emphasising the central role of developers and the need for speed and reliability. They detail the challenges of transitioning to SRE, including overcoming cultural resistance and legacy infrastructure limitations, while emphasising the importance of building resilience in systems and processes. Specific SRE capabilities like chaos engineering, observability, and toil management are explored, along with strategies for successful implementation, including building a Center of Excellence, selecting the right tools, and fostering a culture of collaboration and continuous improvement. Finally, the texts discuss emerging trends like the use of generative AI (GenAI) in SRE and the future evolution of chaos engineering. You'll learn how to integrate SRE practices into your existing enterprise tech operating model and see how these methodologies provide significant business value by reducing system downtime and enhancing operational stability. Additionally, this book will explore how GenAI can support SRE teams in planning, executing, and optimising reliability experiments and automating toil reduction and continuous improvement efforts. By the end of this book, you'll be fully equipped to build chaos engineering by SREs, run reliability-focused "game days" to improve observability, troubleshoot failure scenarios, and strengthen the digital resilience of your systems and teams.

Autorinnen und Autoren

Kundeninformationen

Provides a future-proofed roadmap to design, plan, and implement SRE Delivers actionable strategies to implement SRE methodologies and integrate chaos engineering Empowers readers to boost operational excellence, improve an enterprise, and elevate digital resilience

Produktsicherheit

Hersteller

Springer Nature Customer Service Center GmbH

ProductSafety@springernature.com

Topseller & Empfehlungen für Sie

Ihre zuletzt angesehenen Produkte

Rezensionen

Dieses Set enthält folgende Produkte:
    Auch in folgendem Set erhältlich:

    • nach oben

      Ihre Daten werden geladen ...