> ## Documentation Index
> Fetch the complete documentation index at: https://docs.getbifrost.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# In-VPC Deployments

> Deploy Bifrost within your private cloud infrastructure with VPC isolation, custom networking, and enhanced security controls for enterprise environments.

In-VPC (Virtual Private Cloud) deployments allow you to run Bifrost entirely within your private cloud infrastructure, providing maximum security, compliance, and control over your AI gateway deployment.

## Supported Cloud Providers

Bifrost supports INVPC deployments across all major cloud providers:

<div className="grid grid-cols-2 md:grid-cols-3 gap-4 my-6">
  <div className="flex items-center gap-2 p-3 border rounded-lg">
    <span>Google Cloud Platform</span>
  </div>

  <div className="flex items-center gap-2 p-3 border rounded-lg">
    <span>Amazon Web Services</span>
  </div>

  <div className="flex items-center gap-2 p-3 border rounded-lg">
    <span>Microsoft Azure</span>
  </div>

  <div className="flex items-center gap-2 p-3 border rounded-lg">
    <span>Cloudflare</span>
  </div>

  <div className="flex items-center gap-2 p-3 border rounded-lg">
    <span>Vercel</span>
  </div>
</div>

## Architecture Benefits

### Security & Compliance

* **Network Isolation**: Complete isolation within your VPC with no external network dependencies
* **Data Sovereignty**: All data processing occurs within your controlled environment
* **Compliance Ready**: Meets requirements for HIPAA, SOC2, GDPR, and other regulatory frameworks
* **Zero Trust Architecture**: Implements principle of least privilege with granular access controls

### Performance & Reliability

* **Low Latency**: Direct communication between services within your network
* **High Availability**: Multi-zone deployment with automatic failover capabilities
* **Guaranteed Uptime**: 99.95% SLA with comprehensive monitoring and alerting

### Control & Customization

* **Custom Networking**: Configure subnets, routing, and security groups to your specifications
* **Resource Management**: Full control over compute, storage, and network resources
* **Scaling Policies**: Define auto-scaling rules based on your usage patterns

## Service Level Agreement

### Availability Commitment

* **Uptime Guarantee**: 99.95% monthly uptime for all core components
* **Downtime Calculation**: `(Total Minutes - Downtime Minutes) / Total Minutes × 100`
* **Partial Downtime**: Reduced functionality counted as 50% downtime

### Core Components Covered

The following components are monitored for SLA compliance:

* Gateway instance
* Log ingestion pipeline

### Exclusions

SLA excludes downtime due to:

* Scheduled maintenance (14-day advance notice)
* Downstream provider incidents
* Client hardware/software/network issues
* Third-party AI provider outages
* Client misuse or unauthorized modifications

## Support & Maintenance

### Technical Support

* **24/7 Critical Support**: Available for core component issues
* **Multiple Channels**: Platform, email ([contact@getmaxim.ai](mailto:contact@getmaxim.ai)), or Slack Connect
* **Audit Trail**: Detailed logs for any data access during troubleshooting

### Maintenance Windows

* **Scheduled Maintenance**: 14-day advance notice for major updates
* **Security Patches**: Immediate or 14-day delayed application (your choice)
* **Continuous Updates**: Regular feature improvements with 7-day advance notice

## Getting Started

### Prerequisites

* VPC with appropriate CIDR ranges
* Kubernetes cluster (GKE, EKS, or AKS)
* Container registry access
* DNS configuration for internal routing

### Deployment Process

1. **Infrastructure Setup**: Configure VPC, subnets, and security groups
2. **Cluster Preparation**: Set up Kubernetes cluster with required permissions
3. **Bifrost Installation**: Deploy using provided Helm charts or manifests
4. **Configuration**: Apply your specific settings and integrations
5. **Validation**: Run connectivity and performance tests
6. **Go Live**: Begin routing production traffic

## Cost Optimization

### Resource Sizing

* **Development**: 2 vCPU, 4GB RAM minimum
* **Production**: 4+ vCPU, 8GB+ RAM recommended
* **High Availability**: Multi-zone deployment with load balancing

### Scaling Strategies

* **Horizontal Pod Autoscaling**: Based on CPU/memory utilization
* **Vertical Pod Autoscaling**: Automatic resource adjustment
* **Cluster Autoscaling**: Node pool expansion/contraction
