https://gateway.truefoundry.ai.
gateway.truefoundry.ai is the unified endpoint for both the AI Model Gateway and the MCP Gateway.Whether you are routing LLM inference requests (OpenAI-compatible API, etc.) or connecting to MCP (Model Context Protocol) servers, all traffic goes through the same globally distributed infrastructure. This means MCP Gateway deployments benefit from the same multi-region, multi-cloud availability described on this page.
Features
- Globally Distributed: Deployed across more than 12 regions around the globe and across 3 multiple cloud providers for maximum availability while minimizing latency.
- Automated Failover: All traffic is routed to the nearest gateway for minimum latency. In case of regional downtime, traffic is automatically routed to closest healthy regions ensuring uninterrupted service.
- Multi-Cloud Deployment: Distributed across multiple cloud providers to be tolerant to cloud provider-specific disruptions.
- Data Encryption: Data is encrypted at rest and in transit.
- Compliance: Truefoundry Infrastructure is SOC2, ISO27001, GDPR, and HIPAA compliant
Architecture
The SAAS global deployment follows the same Gateway Plane Architecture used across all Truefoundry deployments. It consists of two key components:- Control Plane — Manages all gateway configuration including models, users, teams, virtual accounts, rate-limiting, and routing configs. The SAAS control plane is hosted in Ireland (Europe).
- Gateway Planes — Stateless, horizontally scalable gateway instances that handle all production traffic (LLM requests, MCP requests, etc.). These are deployed across the regions listed in the Regional Deployments section below.
The specific regions and locations where gateway planes are deployed are subject to change based on Truefoundry’s internal infrastructure needs. Regions may be added, removed, or relocated without prior notice.
Global Deployment
For most use cases, we recommend using the global endpoint which automatically routes to the nearest healthy gateway:| Deployment | Global Endpoint |
|---|---|
| Global (Auto-routed) | https://gateway.truefoundry.ai |
Regional Deployments
Each gateway region has its own URL and associated metadata. Every request routed through the SaaS gateway is automatically enriched with thetfy_gateway_region and tfy_gateway_zone metadata keys that identify which gateway region and zone handled the request. The Region and Zone columns in the table below show the values these keys will contain.
| Physical Location | Cloud Provider | Region | Zone | Regional Endpoint |
|---|---|---|---|---|
| North Virginia, United States (ORF) | AWS | US | ORF | https://orf.gateway.truefoundry.ai |
| San Francisco, United States (SFO) | Azure | US | SFO | https://sfo.gateway.truefoundry.ai |
| Dallas, Texas, United States (DFW) | GCP | US | DFW | https://dfw.gateway.truefoundry.ai |
| Toronto, Canada (YYZ) | GCP | CA | YYZ | https://yyz.gateway.truefoundry.ai |
| Sao Paulo, Brazil (GRU) | GCP | SA | GRU | https://gru.gateway.truefoundry.ai |
| London, United Kingdom (LHR) | AWS | EU | LHR | https://lhr.gateway.truefoundry.ai |
| Madrid, Spain (MAD) | GCP | EU | MAD | https://mad.gateway.truefoundry.ai |
| Gavle, Sweden (GVX) | Azure | EU | GVX | https://gvx.gateway.truefoundry.ai |
| Cape Town, South Africa (CPT) | AWS | AF | CPT | https://cpt.gateway.truefoundry.ai |
| Doha, Qatar (DIA) | GCP | US | DIA | https://dia.gateway.truefoundry.ai |
| Mumbai, India (BOM) | AWS | IN | BOM | https://bom.gateway.truefoundry.ai |
| Singapore, Singapore (SIN) | AWS | AP | SIN | https://sin.gateway.truefoundry.ai |
| Melbourne, Australia (MEL) | AWS | AU | MEL | https://mel.gateway.truefoundry.ai |
| Sydney, Australia (SYD) | AWS | AU | SYD | https://syd.gateway.truefoundry.ai |
Multi-regional Deployments
Multi-regional endpoints automatically route your requests to the closest healthy gateway within a specific geographic region. If all regional locations are unavailable, traffic is routed to the designated fallback regions.| Region | Multi-regional Endpoint | Primary Locations | Fallback Locations |
|---|---|---|---|
| United States | https://us.gateway.truefoundry.ai | North Virginia (ORF), San Francisco (SFO), Dallas (DFW) | Toronto, Canada (YYZ) |
| Europe | https://eu.gateway.truefoundry.ai | London (LHR), Madrid (MAD), Gavle (GVX) | Doha, Qatar (DIA) |
| Australia | https://au.gateway.truefoundry.ai | Sydney (SYD), Melbourne (MEL) | Singapore (SIN) |
Gateway Status Monitoring
To track the status of each gateway deployment and receive real-time updates on service availability, visit our status page: Gateway Status Page: status.truefoundry.com You can expand the AI Gateway section to see per-region uptime:
Subscribe to Status Updates
Stay informed about gateway availability by subscribing to status notifications:- Visit the Gateway Status Page
- Click the Get Updates button in the top right
- Choose your preferred notification method:
- Email notifications
- RSS Feed
- On a custom webhook

Connecting Your Private Models or MCP Servers to the Gateway
If your models or MCP servers run inside a private network (a VPC, on-prem cluster, etc.), the SAAS gateway needs a network path to reach them without exposing them to the public internet. See Connect Private Models and MCP Servers for the supported approaches.FAQ
What is the round trip latency to the SAAS gateway?
What is the round trip latency to the SAAS gateway?
Your client is automatically routed to the closest gateway region, so the round trip time (RTT) from your application to the gateway typically ranges from 20–50ms.If you are seeing higher latencies, please let us know and we will be happy to add another region closer to your use case.If you are self-hosting the gateway within your own infrastructure, the RTT from your application to the gateway will be on the order of ~1ms when both are running in the same cluster.