Skip to main content
Truefoundry comprises of a split-plane architecture that comprises of a control-plane as the orchestration layer and the compute, gateway and data planes as the execution and data layer.
Truefoundry Platform Architecture

Truefoundry Platform Architecture

The key components in the architecture are:
  1. Global Authentication/Licensing Server: This is a global licensing/authentication server. It is used to authenticate every user who logs into the control-plane. This is useful for Truefoundry to track the number of users in the platform and control licensing at a central layer. The global authentication server cannot be shipped to customer’s infrastructure.
The only information being passed from the control-plane to the global authentication server is the emails of the users logging into the Truefoundry platform. To understand how SSO works with our central authentication server, refer to this page.
  1. Control Plane: The control plane is where the users/developers log in to manage and monitor compute, models, pipelines, services. It stores all the configuration data and interacts with the compute, gateway and data planes as well as external services like Docker Registry, Secret Manager, etc. To read more on what the control plane comprises of, check Control Plane Architecture.
Truefoundry provides a hosted and managed control-plane to which you can attach your own compute and data plane. The control-plane can also be hosted within your own VPC as part of our enterprise plan.
  1. Compute Plane: The compute plane is the actual Kubernetes cluster where the models, services, jobs and pipelines deployed by the users run as part of the AI engineering module. You don’t need the compute plane if you are only using the AI gateway features. To read more on what the compute plane comprises of, refer to Compute Plane Architecture.
Truefoundry doesn’t provide compute plane and you need to bring your own cloud account or on-prem Kubernetes cluster.
  1. Gateway Plane: This is the LLM/MCP proxy that sits between the users and the LLM/MCP servers. It is responsible for providing unified API access to LLMs and MCP servers with enterprise-grade security and observability. To read more on what the gateway plane comprises of, refer to Gateway Plane Architecture.
Truefoundry provides a managed and globally distributed gateway plane. It can also be deployed on your own infrastructure.
  1. Data Plane: The data plane stores the models and artifacts for the AI engineering module. You can bring your own blob storage to store the models/artifacts or use the Truefoundry managed one.
I