doc/administration/dedicated/disaster_recovery.md
{{< details >}}
{{< /details >}}
GitLab Dedicated provides automatic disaster recovery to restore your instance if your primary region becomes unavailable. To be eligible for the full recovery objectives:
If no secondary region is configured, recovery is limited to backup restoration.
GitLab Dedicated provides disaster recovery with these recovery objectives:
When you create your instance, you select a primary region and a secondary region for your environment. Geo continuously replicates data between these regions, including:
GitLab performs automated backups of all GitLab Dedicated datastores (including databases and Git repositories) every four hours (six times daily) by creating snapshots.
Backups are tested, retained for 30 days, and stored in your chosen secondary region. They are also geographically replicated by AWS for additional protection.
Database backups:
Object storage backups use geographical replication and versioning to provide backup protection.
The four-hour backup frequency supports the Recovery Point Objective (RPO) to ensure you lose no more than four hours of data.
Disaster recovery covers these scenarios with guaranteed recovery objectives:
These scenarios are covered on a best-effort basis without guaranteed recovery objectives:
Disaster recovery has these service limitations:
GitLab does not provide:
When your instance becomes unavailable due to a complete region failure or critical component failure that cannot be quickly recovered, the GitLab Dedicated team:
<customer>.gitlab-dedicated.com to point to the newly
promoted region.If you use PrivateLink, you must update your internal networking configuration to target the PrivateLink endpoint for the secondary region. To minimize downtime, configure equivalent PrivateLink endpoints in your secondary region before a disaster occurs.
The failover process typically completes in 90 minutes or less. Throughout the process, GitLab communicates with you through one or more of:
GitLab may establish a temporary Slack channel and Zoom bridge to coordinate with your team throughout the recovery process.