Frappe Cloud undertakes multiple measures to ensure reliability:
Infrastructure & Operations
- Runs on established cloud providers such as AWS and OCI. Uptime commitments for all providers on Frappe Cloud can be seen here.
- Continuous monitoring of infrastructure and platform services by Frappe engineers.
- Incident tracking, root-cause analysis, and preventive fixes to reduce repeat failures.
- 24×7 incident response support for critical infrastructure issues.
High Availability
- Proactive detection and resolution of infrastructure issues before they impact users.
- Resource monitoring and operational alerts help identify issues early.
Overall, Frappe Cloud maintains a best-effort uptime of 99.9%.
Reliable Deployments & Upgrades
- Automated and repeatable deployment workflows reduce human error.
- Most deployments are designed for zero or near-zero downtime.
- Large upgrades can be scheduled during non-working hours.
- Automated backups are created before upgrades and migrations.
- Rollback mechanisms allow recovery to a known working state if a deployment fails.
Backup & Disaster Recovery
- Automatic daily site backups.
- Automatic server snapshots.
- Configurable backup schedules and retention policies.
- Off-site backup support on eligible plans.
- Backup files can be downloaded for additional protection.
- Server snapshots stored separately help support disaster recovery scenarios.
Recovery & Failover
- Automated recovery workflows help restore services quickly after failures.
- Systems can be restored from recent backups when required.
- Automated fallback mechanisms reduce recovery effort and downtime.
- Disaster recovery procedures are designed to minimise both recovery time and data loss.
Observability & Monitoring
- Real-time visibility into resource utilisation and infrastructure health.
- Monitoring of storage consumption trends and performance metrics.
- Built-in alerts notify users when resource thresholds are approaching limits.
- Developer tools provide deeper diagnostics when troubleshooting is required.
Capacity Planning
- Resource utilisation analytics help identify growth trends before they become bottlenecks.
- Threshold-based alerts allow teams to scale infrastructure proactively.
- Visibility into usage patterns helps prevent outages caused by resource exhaustion.
Watch this video to know more:
Last updated 4 days ago
Was this helpful?