to select ↑↓ to navigate
Cloud

Cloud

Open in ChatGPT
Ask ChatGPT about this page
Open in Claude
Ask Claude about this page

Reliability

Frappe Cloud undertakes multiple measures to ensure reliability:

Infrastructure & Operations

  • Runs on established cloud providers such as AWS and OCI. Uptime commitments for all providers on Frappe Cloud can be seen here.
  • Continuous monitoring of infrastructure and platform services by Frappe engineers.
  • Incident tracking, root-cause analysis, and preventive fixes to reduce repeat failures.
  • 24×7 incident response support for critical infrastructure issues.

High Availability

  • Proactive detection and resolution of infrastructure issues before they impact users.
  • Resource monitoring and operational alerts help identify issues early.

Overall, Frappe Cloud maintains a best-effort uptime of 99.9%.

Reliable Deployments & Upgrades

  • Automated and repeatable deployment workflows reduce human error.
  • Most deployments are designed for zero or near-zero downtime.
  • Large upgrades can be scheduled during non-working hours.
  • Automated backups are created before upgrades and migrations.
  • Rollback mechanisms allow recovery to a known working state if a deployment fails.

Backup & Disaster Recovery

  • Automatic daily site backups.
  • Automatic server snapshots.
  • Configurable backup schedules and retention policies.
  • Off-site backup support on eligible plans.
  • Backup files can be downloaded for additional protection.
  • Server snapshots stored separately help support disaster recovery scenarios.

Recovery & Failover

  • Automated recovery workflows help restore services quickly after failures.
  • Systems can be restored from recent backups when required.
  • Automated fallback mechanisms reduce recovery effort and downtime.
  • Disaster recovery procedures are designed to minimise both recovery time and data loss.

Observability & Monitoring

  • Real-time visibility into resource utilisation and infrastructure health.
  • Monitoring of storage consumption trends and performance metrics.
  • Built-in alerts notify users when resource thresholds are approaching limits.
  • Developer tools provide deeper diagnostics when troubleshooting is required.

Capacity Planning

  • Resource utilisation analytics help identify growth trends before they become bottlenecks.
  • Threshold-based alerts allow teams to scale infrastructure proactively.
  • Visibility into usage patterns helps prevent outages caused by resource exhaustion.

Watch this video to know more:

Last updated 4 days ago
Was this helpful?
Thanks!