Bifrost Enterprise sizing has two independent axes you can use to start from: the number of gateway instances you want to run, or the target RPS you need to serve. Pick whichever you have a number for - the other can be derived once the cluster is live.Documentation Index
Fetch the complete documentation index at: https://docs.getbifrost.ai/llms.txt
Use this file to discover all available pages before exploring further.
Gateway pods
For production deployments, run at least 3 Bifrost instances so that losing any one pod (rolling deploy, AZ failure, node eviction) still leaves quorum and active capacity.| Setting | Recommended baseline |
|---|---|
| Pod count | 3 (minimum for HA; scale horizontally from here) |
| vCPU per pod | 4 |
| RAM per pod | 16 GB |
| Topology | Spread pods across AZs or failure domains |
There is no Bifrost-specific reason to run fewer than 3 pods in production. Two-pod setups lose quorum during a single-node restart, and single-pod setups have no failure budget for rolling upgrades.
PostgreSQL
Any production-grade PostgreSQL distribution works: Amazon RDS, Aurora PostgreSQL, Google Cloud SQL, AlloyDB, Azure Database for PostgreSQL, Crunchy Bridge, or self-managed PG 16+. The right hardware depends on whether you offload large request/response payloads to object storage.Default sizing (PostgreSQL holds logs)
| Setting | Recommended |
|---|---|
| vCPU | 8 |
| RAM | 24 GB |
| Storage | SSD / gp3-class, sized for retention |
| Replication | Hot standby in a second AZ |
With object storage for large logs
| Setting | Recommended |
|---|---|
| vCPU | 8 |
| RAM | 16 GB |
| Storage | SSD / gp3-class, sized for retention |
| Object store | S3, GCS, Azure Blob, or compatible |

