Skip to main content

Repository → 💼 AWS Well-Architected → 💼 Reliability → 💼 Failure management

💼 Design your workload to withstand component failures

  • ID: /frameworks/aws-well-architected/reliability/failure-management/rel11

Description

Workloads with a requirement for high availability and low mean time to recovery (MTTR) must be architected for resiliency.

Similar

Sub Sections

SectionSub SectionsInternal RulesPoliciesFlagsCompliance
💼 REL11-BP01 Monitor all components of the workload to detect failuresno data
💼 REL11-BP02 Fail over to healthy resourcesno data
💼 REL11-BP03 Automate healing on all layersno data
💼 REL11-BP04 Rely on the data plane and not the control plane during recoveryno data
💼 REL11-BP05 Use static stability to prevent bimodal behaviorno data
💼 REL11-BP06 Send notifications when events impact availabilityno data
💼 REL11-BP07 Architect your product to meet availability targets and uptime service level agreements (SLAs)no data