Operational Excellence Assessment Guidance 11 How Do You Address Deployment Failures
A workload must have a deployment failure mitigation strategy defined so that unexpected mid-rollout issues can be addressed with rapid recovery. This typically combines multiple approaches such as rollback, feature disablement, or using deployment patterns.
Robust testing and monitoring practices are in place. Telemetry is used to correlate user issues with deployment phases and respond promptly to user-reported issues.
**NOTE - **
This process considers factors like the methods available for bypassing the issue, effort required for a fix, and the criticality of the workload.
**NOTE - **
You are well-versed in various mitigation strategies, including rollback, fallback, bypassing offending functions, and emergency deployments. You have clear guidelines on when to use each strategy.
**NOTE - **
You have a structured communication plan in place, ensuring that stakeholders and end-users are informed promptly during a deployment incident. Your status update cadence is standardized, and all relevant parties are aware of it.
**NOTE - **
After every failed deployment, there’s a post-mortem to identify areas for improvement. You view every failure as a learning opportunity and have a structured process for implementing the improvements identified.
**NOTE - **