Skip to content

HA + DR: failover playbooks, backup automation, RTO/RPO runbooks #356

@mikemcdougall

Description

@mikemcdougall

Context

Enterprise customers require documented and automated disaster recovery capabilities.

Scope

  • Active-passive failover playbooks (automated health check → failover trigger)
  • PostgreSQL backup automation (pg_basebackup + WAL archiving + point-in-time recovery)
  • Redis backup/restore for cache state
  • RTO/RPO documentation and validation testing
  • Terraform modules for multi-region deployment
  • Admin UI: backup status, last successful backup, recovery test results

References

  • ADR-0024: Enterprise tier feature (Deployment Options pillar)

Metadata

Metadata

Assignees

No one assigned

    Labels

    area/infrastructureDeployment, Terraform, Helm, CIedition/enterpriseEnterprise edition featureeffort/L🌳 L: 1-2 days (complex feature, multiple components)enhancementNew feature or requestphase/GAGA scopepriority/P3📋 Low priority - nice to have in phase, can be deferred

    Type

    No type

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions