Tagged articles
2 articles
Page 1 of 1
Xiaohe Frontend Team
Xiaohe Frontend Team
Nov 15, 2022 · Operations

Mastering Incident Postmortems: Turn Failures into Learning Opportunities

This article explains why thorough, blameless incident postmortems are essential, outlines when to initiate them, describes the key components of an effective review, and offers practical steps to transform each outage into a continuous‑improvement opportunity for engineering teams.

Blameless CultureIncident ManagementRoot Cause Analysis
0 likes · 6 min read
Mastering Incident Postmortems: Turn Failures into Learning Opportunities
DevOpsClub
DevOpsClub
May 11, 2018 · Operations

How Anti‑Fragility and GameDays Turn System Failures into Growth

This article explores anti‑fragility theory and real‑world DevOps practices such as Phoenix Server, Chaos Monkey, GameDays, and blameless post‑mortems, showing how organizations can transform inevitable failures into opportunities for resilience and continuous improvement.

Anti-FragilityBlameless CultureOperations
0 likes · 11 min read
How Anti‑Fragility and GameDays Turn System Failures into Growth