ActivateUserResponse
Union, One Of:AddonComponent
unsupported_cluster_types
typing_extensions.Annotated[(typing.Optional[typing.List[ClusterType]], Any)]
π RecommendationRecommendations
AddOnComponentSource
Repository URL of the source
Chart of the source
AddonComponentStatus
Health status of the addon
Sync status of the addon
Indicates whether the addon is installed
Ai21KeyAuth
API key must not be empty
Ai21Model
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the AI21 model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
Ai21ProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the AI21 provider account
π CollaboratorList of users who have access to this provider account
AiFeaturesSettings
Enable/Disable AI features.
Other fields only apply when this is true
AI provider to use for AI features
When TrueFoundry is selected, TrueFoundry will manage and provide the AI models for the features.
Alert
AlertConfig
3 to 35 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π NotificationTargetForAlertRuleConfigure one or more notification targets where alerts will be sent. Each target specifies a notification channel (email or slack).
π PrometheusAlertRuleDefine one or more alert rules that specify the conditions to monitor, when to trigger alerts, and how they should be handled.
AlertConfigResource
π AlertConfigResourceTypeThe type of resource this alert config belongs to. Can be either an application or a cluster.
FQN of the resource this alert config belongs to.
AmqpInputConfig
AMQP Queue URL of Subscriber
AMQP Queue Name
Wait timeout for long polling.
AmqpMetricConfig
Upper limit of the number of backlog messages the auto-scaler will try to maintain per replica. If you set this number to 10 and have 30 messages in the stream and one replica, the auto-scaler will scale the number of replicas to 3.
AmqpOutputConfig
AMQP Queue URL of Publisher
AMQP Routing Key to publish to.
AMQP Exchange Name
AnthropicKeyAuth
API key must not be empty
AnthropicModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Anthropic model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AnthropicProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the Anthropic provider account
π CollaboratorList of users who have access to this provider account
Application
π RecommendationRecommendations for this application
π AlertAlerts for this application
alerts_summary
typing_extensions.Annotated[(typing.Optional[typing.Dict[(str, typing.Optional[typing.Any])]], Any)]
Summary of alerts for this application
Debug infos for this application
potential_problems
typing_extensions.Annotated[(typing.Optional[typing.List[ApplicationProblem]], Any)]
π ApplicationProblemPotential problems with the application
ApplicationDebugInfo
ApplicationMetadata
ApplicationProblem
ApplicationsCancelDeploymentResponse
Deployment cancelled
ApplicationSet
Name of the application set.
π ApplicationSetComponentsItemArray of components with their specifications.
Template to be used for the application set.
Values to be used to render components for the application set.
ApplicationSetComponentsItem
Union, One Of:ApplyMlEntityResponse
ApplyMlEntityResponseData
Union, One Of:Artifact
ArtifactManifest
Name of the entity
Key value metadata. Should be valid JSON. For e.g.
\{"business-unit": "sales", "quality": "good", "rating": 4.5\}
Version alias is alternate, ideally human readable, version string to reference an artifact version. It should start with
v
followed by alphanumeric and it can include .
and -
in between (e.g. v1.0.0
, v1-prod
, v3-dev
, etc)Name of the ML Repo
Version of the entity
Step/Epoch number in an iterative training loop the artifact version was created. Generally useful when logging a model version from a MLRepo Run
ID of the MLRepo Run that generated the artifact version
ArtifactManifestSource
Union, One Of:ArtifactPath
Local file or folder path
Relative path where the file or folder will be uploaded to in the artifact
ArtifactsCacheVolume
Storage class of the Volume where artifacts will be cached
Size of the Volume (in GB) where artifacts will be cached. Should be greater than twice the size of artifacts getting cached
ArtifactsDownload
π ArtifactsDownloadArtifactsItemList of artifacts to be cached
ArtifactsDownloadArtifactsItem
Union, One Of:ArtifactVersion
AssistantMessage
Role of the message
π AssistantMessageContentText content for the assistant message
π ToolCallList of tool calls made by the assistant
Name of the assistant this message is from
AssistantMessageContent
Union, One Of:AssistantMessageContentItem
Union, One Of:AsyncProcessorSidecar
URL for the processor to invoke
Timeout for the invoke request in seconds
Image for the processor sidecar (This field will be deprecated in the future)
AsyncService
π AsyncServiceReplicasDeploy multiple instances of your pods to distribute incoming traffic across them, ensuring effective load balancing.
AsyncServiceAutoscaling
π AsyncServiceAutoscalingMetricsMetrics to use for the autoscaler
AsyncServiceAutoscalingMetrics
Union, One Of:- π SqsQueueMetricConfig
- π NatsMetricConfig
- π KafkaMetricConfig
- π CronMetric
- π AmqpMetricConfig
AsyncServiceReplicas
Union, One Of:Autoshutdown
The period to wait after the last received request before scaling the replicas to 0. This value should be high enough to allow for the replicas of the service to come up to avoid premature scaling down.
AwsAccessKeyAuth
AWS Access Key ID
AWS Secret Access Key for the user to authenticate with
AWS Session Token, only required when using temporary credentials
AwsAccessKeyBasedAuth
AWS access key ID that you want to use for authentication.
AWS access key secret that you want to use for authentication.
AwsAssumedRoleBasedAuth
ARN of the role you want to assume.
AwsBedrockGuardrailConfig
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π AwsBedrockGuardrailConfigOperationThe operation type to use for the Guardrail. Validate guardrails are used to validate requests and mutate can validate as well as mutate requests.
Validate guardrails are run in parallel while mutate guardrails are run sequentially.
π AwsBedrockGuardrailConfigAuthDataAuthentication data for the AWS account
The ID of the Guardrail to use.
The version of the Guardrail to use.
AwsBedrockGuardrailConfigAuthData
Union, One Of:AwsBedrockProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π AwsBedrockProviderAccountAuthDataAuthentication data for the AWS account
π BedrockModelV2List of integrations that are associated with the AWS Bedrock provider account
π CollaboratorList of users who have access to this provider account
AwsBedrockProviderAccountAuthData
Union, One Of:AwsEcr
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The URL of the registry.
π AwsEcrAuthDataCustom authentication data for the integration.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AwsEcrAuthData
Union, One Of:AwsEksIntegration
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Name of the cluster given in the cloud account.
π AwsEksIntegrationAuthDataCustom authentication data for the integration.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AwsEksIntegrationAuthData
Union, One Of:AwsInferentia
Name of the AWS Inferentia Accccelerator. One of [INF1, INF2].
This field is required for Node Selector and can be ignored in Nodepool Selector.
Count of Inferentia accelerator chips to provide to the application
AwsIntegrations
Union, One Of:- π AwsEcr
- π AwsS3
- π AwsParameterStore
- π AwsSecretsManager
- π AwsEksIntegration
- π BedrockModel
AwsParameterStore
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π AwsParameterStoreAuthDataCustom authentication data for the integration.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AwsParameterStoreAuthData
Union, One Of:AwsProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The AWS account ID.
π AwsProviderAccountAuthDataAuthentication data for the AWS account.
π AwsIntegrationsList of integrations that are associated with the AWS provider account.
AwsProviderAccountAuthData
Union, One Of:AwsS3
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The root path of the storage.
π AwsS3AuthDataCustom authentication data for the integration.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AwsS3AuthData
Union, One Of:AwsSecretsManager
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π AwsSecretsManagerAuthDataCustom authentication data for the integration.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AwsSecretsManagerAuthData
Union, One Of:AzureAiInferenceModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
π ModelTypeTypes of models supported by this Azure AI deployment
deployment_details
typing_extensions.Annotated[(AzureAiInferenceModelDeploymentDetails, Any)]
required
π AzureAiInferenceModelDeploymentDetailsConfiguration details for the Azure AI deployment
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AzureAiInferenceModelDeploymentDetails
Union, One Of:AzureAiManagedDeployment
Name of the Azure AI deployment
Name of the Azure AI endpoint
AzureAiServerlessDeployment
Name of the Azure AI deployment
Region where the Azure AI deployment is located
AzureAksIntegration
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The resource group of the cluster in the cloud account.
Name of the cluster given in the cloud account.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AzureBasicAuth
The username for the Azure managed identity.
The password for the Azure managed identity.
AzureBlobStorage
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The root path of the storage.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AzureConnectionStringAuth
The connection string for the Azure managed identity.
AzureContainerRegistry
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The URL of the registry.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AzureContentSafetyGuardrailConfig
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The resource name where API is deployed.
API version for the Content Safety API
Custom host for the PII detection API
Array of blocklist names to check against
Severity threshold for the Content Safety API
π AzureContentSafetyCategoryCategories to check against
AzureFoundryModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
This is the Model Name on Azure
π ModelTypeTypes of models supported by this Azure AI Foundry deployment
Azure Foundry endpoint
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AzureFoundryModelV2
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
π ModelTypeTypes of models supported by this Azure AI Foundry deployment
The name of the Azure Foundry deployment
Endpoint URL must not be empty
The API version for the Azure Foundry model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AzureFoundryProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π AzureFoundryModelV2List of integrations that are associated with the Azure Foundry provider account
π CollaboratorList of users who have access to this provider account
AzureIntegrations
Union, One Of:- π AzureAksIntegration
- π AzureContainerRegistry
- π AzureBlobStorage
- π AzureOpenAiModel
- π AzureVault
- π AzureReposIntegration
- π AzureAiInferenceModel
- π AzureFoundryModel
AzureKeyAuth
API key must not be empty
AzureOAuth
The Azure Active Directory tenant (directory) ID.
The client (application) ID of an App Registration in the tenant.
A client secret that was generated for the App Registration.
The ID of the target subscription. The value must be an UUID.
AzureOpenAiModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
This is the name of the deployment
Azure OpenAI Service endpoint
π ModelTypeSpecify the type of the model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AzureOpenAiModelV2
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
API version must not be empty
π ModelTypeSpecify the type of the Azure OpenAI model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AzureOpenAiProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Endpoint URL must not be empty
π AzureOpenAiModelV2List of integrations that are associated with the Azure OpenAI provider account
π CollaboratorList of users who have access to this provider account
AzurePiiGuardrailConfig
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The resource name where API is deployed.
API version for the Content Safety API
Custom host for the PII detection API
π AzurePiiGuardrailConfigDomainDomain for PII detection
π AzurePiiCategoryCategories of PII to detect.
Version of the PII detection model to use
Language code for PII detection
AzureProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The ID of the target subscription. The value must be an UUID.
π AzureIntegrationsList of integrations that are associated with the Azure provider account.
AzureReposIntegration
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
AzureVault
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Name of key vault name.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
BaseArtifactVersion
BaseArtifactVersionManifest
Union, One Of:BaseAutoscaling
Minimum number of replicas to keep available
Maximum number of replicas allowed for the component.
This is the interval to check each trigger on.
BaseOAuth2Login
client ID for OAuth2.
Client secret or the TrueFoundry secret containing the client secret for OAuth2.
URL for the authorization request
The endpoint to exchange auth code for tokens.
List of scopes to request from the OAuth2 provider.
π BaseOAuth2LoginJwtSourceSource of the JWT token to be used for verification.
BaseOAuth2McpServerLogin
client ID for OAuth2 or the TrueFoundry secret FQN containing the client ID.
Client secret or the TrueFoundry secret FQN containing the client secret for OAuth2.
URL for the authorization request
The endpoint to exchange auth code for tokens.
List of scopes to request from the OAuth2 provider.
π BaseOAuth2McpServerLoginJwtSourceSource of the JWT token to be used for verification.
BaseService
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Configure environment variables to be injected in the service either as plain text or secrets. Docs
π BaseServiceMountsItemConfigure data to be mounted to service pod(s) as a string, secret or volume. Docs
BaseServiceImage
Union, One Of:BaseServiceMountsItem
Union, One Of:BaseWorkbenchInput
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Size of the home directory for the workbench (Persistent Storage)
Configure environment variables to be injected in the service either as plain text or secrets. Docs
π BaseWorkbenchInputMountsItemConfigure data to be mounted to workbench pod(s) as a string, secret or volume. Docs
BaseWorkbenchInputMountsItem
Union, One Of:BasicAuthCreds
Username for the user to authenticate with
Password for the user to authenticate with
BedrockKeyAuth
BedrockModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
π ModelTypeSpecify the type of the model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
BedrockModelAuthData
Union, One Of:BedrockModelV2
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Bedrock model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
BitbucketIntegration
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
BitbucketProviderAccount
BlobStorageReference
Path to the content in blob storage
π MimeTypeMIME type of the content
BlueGreen
BudgetConfig
Name of the budget configuration
π BudgetRuleList of budget rules
BudgetRule
Unique identifier for the rule
Numeric limit value for the budget
BudgetWhen
List of subjects that this rule applies to
List of models that this rule applies to
Metadata key-value pairs that this rule applies to
Build
FQN of the container registry. If you canβt find your registry here,
add it through the Integrations page
BuildBuildSource
Union, One Of:BuildBuildSpec
Union, One Of:BuildInfo
Status of the build: STARTED = 20, SUCCEEDED = 30, FAILED = 40
Canary
CanaryStep
The rest will continue to go to the existing deployment
If this is not set, the step will pause indefinitely on this step
ChangePasswordResponse
Union, One Of:ChatPromptManifest
Name of the entity
Key value metadata. Should be valid JSON. For e.g.
\{"business-unit": "sales", "quality": "good", "rating": 4.5\}
Version alias is alternate, ideally human readable, version string to reference an artifact version. It should start with
v
followed by alphanumeric and it can include .
and -
in between (e.g. v1.0.0
, v1-prod
, v3-dev
, etc)Name of the ML Repo
Version of the entity
π ChatPromptManifestMessagesItemList of messages in the chat conversation, must be non-empty
Variables referenced in messages and that can be replaced when running generation
π ToolSchemaList of tools to be used in the chat prompt
π ChatPromptManifestMcpServersItemA list of MCP servers FQNs or URLs and their tools
ChatPromptManifestMcpServersItem
Union, One Of:ChatPromptManifestMessagesItem
Union, One Of:Cluster
ClusterGateway
Used when there are 2 gateways with same host
ClusterManifest
3 to 35 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π ClusterManifestClusterTypeThe cluster type of this cluster
Tag the environment like dev, staging or production. You will need to create environments if you donβt have already.
Domain URL should be valid domain or IP address
π NodepoolAdd nodepools that are already created in your cluster.
When deploying, applications can choose to schedule from these nodepools.
π ClusterManifestNodeLabelKeysThe node label keys that this cluster supports.
Note: You will additionally need to add
truefoundry.com/gpu_type
label for GPU-supported node pools.
For more information, check out this documentation.π CollaboratorCollaborators who can access this cluster
ClusterManifestMonitoring
ClusterManifestNodeLabelKeys
The nodepool selector label that this cluster supports
ClusterManifestWorkbenchConfig
Must not contain any spaces.
ClustersDeleteResponse
Success message
Codeserver
CohereKeyAuth
API key must not be empty
CohereModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Cohere model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
CohereProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the Cohere provider account
π CollaboratorList of users who have access to this provider account
Collaborator
Fully Qualified Name of the subject. eg: user:email or team:teamname
Role ID for the resource
CommonToolsSettings
Enable web search tool
Allows model to search the web for information.
Enable code executor tool
Allows model to execute code and return the results.
Enable sequential thinking tool
Allows model to reason step-by-step to solve complex problems.
Enable vision OCR tool
Allows model to extract text from images or PDFs.
Enable web scraping tool.
Allows model to scrape content from web pages with intelligent format selection for structured or unstructured data.
ContainerTaskConfig
Configure environment variables to be injected in the task either as plain text or secrets. Docs
π VolumeMountConfigure data to be mounted to Workflow pod(s) as a volume.
ContainerTaskConfigImage
Union, One Of:CoreNatsOutputConfig
Output NATS URL
Output NATS root subject should only contain alphanumeric letters, dashes(-), and periods(.)
CpuUtilizationMetric
Percentage of cpu request averaged over all replicas which the autoscaler should try to maintain
CreateMultiPartUploadRequest
CreatePersonalAccessTokenResponse
Personal Access Token
CronMetric
Desired number of replicas during the given interval. Default value is max_replicas.
Cron expression indicating the start of the cron schedule.
Cron expression indicating the end of the cron schedule.
Timezone against which the cron schedule will be calculated, e.g. βAsia/Tokyoβ. Default is machineβs local time.
https://docs.truefoundry.com/docs/list-of-supported-timezones
CustomBasicAuth
The username for the basic authentication.
The password for the basic authentication.
CustomBearerAuth
The bearer token for the custom bearer authentication.
CustomBlobStorage
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The endpoint of the storage provider.
The root path of the storage.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
CustomGuardrailConfig
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π CustomGuardrailConfigOperationThe operation type to use for the Guardrail. Validate guardrails are used to validate requests and mutate can validate as well as mutate requests.
Validate guardrails are run in parallel while mutate guardrails are run sequentially.
π CustomGuardrailConfigTargetSpecify whether the guardrail should be applied to the request or response. Guardrails with target βRequestβ can be only used in input guardrails and guardrails with target βResponseβ can only be used in output guardrails.
The URL of the Guardrail to send a post request to.
π CustomGuardrailConfigAuthDataAuthentication data for the Guardrail Server.
Headers for the Guardrail Server. Forwarded to the Guardrail Server as is. For example:
\{"Authorization": "APIKey \<token\>"\}
The config for the Guardrail Server. This is a JSON object that will be sent as a config to Guardrail Server along with the request.
CustomGuardrailConfigAuthData
Union, One Of:CustomHelmRepo
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The URL of the Helm Repo.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
CustomIntegrations
Union, One Of:- π CustomUsernamePasswordArtifactsRegistry
- π CustomModel
- π EmailNotificationChannel
- π CustomHelmRepo
- π CustomBlobStorage
- π CustomJwtAuthIntegration
CustomJwtAuthIntegration
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The base URL of the authentication provider.
The JSON Web Key Set URI for JWT verification.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
CustomModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
enter valid https/http URL that should not end with trailing slash
π ModelTypeSpecify the type of the model
π CustomModelAuthDataCustom authentication data for the integration.
Custom headers for the integration. Forwarded to the provider as is. For example:
\{"Authorization": "APIKey \<token\>"\}
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
CustomModelAuthData
Union, One Of:CustomProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π CustomIntegrationsList of integrations that are associated with the provider account.
CustomUsernamePasswordArtifactsRegistry
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The URL of the registry.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
DatabricksApiKeyAuth
API key for Databricks authentication.
DatabricksModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Databricks model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
DatabricksProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π DatabricksProviderAccountAuthDataDatabricks authentication credentials
Base URL must not be empty
List of integrations that are associated with the Databricks provider account
π CollaboratorList of users who have access to this provider account
DatabricksProviderAccountAuthData
Union, One Of:DatabricksServicePrincipalAuth
OAuth secret of the Databricks service principal.
Client ID or Application ID of the Databricks service principal.
DataDirectory
DataDirectoryManifest
Name of the data directory
Name of the ML Repo to create the data directory under
Description for the data directory
Key Value metadata. Should be valid JSON. For e.g.
\{"business-unit": "sales", "quality": "good", "rating": 4.5\}
DataDirectoryManifestSource
Union, One Of:DeactivateUserResponse
Union, One Of:DeepinfraKeyAuth
API key must not be empty
DeepinfraModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the DeepInfra model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
DeepinfraProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the DeepInfra provider account
π CollaboratorList of users who have access to this provider account
DeleteApplicationResponse
Application deletion message
DeleteJobRunResponse
Job run deleted
DeletePersonalAccessTokenResponse
Union, One Of:DeleteSecretGroupResponse
Union, One Of:DeleteTeamResponse
Union, One Of:DeleteVirtualAccountResponse
Union, One Of:Deployment
deployment_statuses
typing_extensions.Annotated[(typing.Optional[typing.List[DeploymentStatus]], Any)]
applied_recommendations
typing_extensions.Annotated[(typing.Optional[typing.List[Recommendation]], Any)]
π RecommendationApplied recommendations for this deployment
DeploymentBuild
DeploymentManifest
Union, One Of:- π Service
- π AsyncService
- π Job
- π Notebook
- π Codeserver
- π SshServer
- π RStudio
- π Helm
- π Volume
- π ApplicationSet
- π Workflow
- π SparkJob
- π Intercept
DeploymentStatus
DeveloperMessage
The role of the messages author, in this case developer.
π DeveloperMessageContentThe contents of the developer message.
An optional name for the participant.
DeveloperMessageContent
Union, One Of:DockerFileBuild
The file path of the Dockerfile relative to project root path.
Build context path for the Dockerfile relative to project root path.
π DockerFileBuildCommandOverride the command to run when the container starts
When deploying a Job, the command can be templatized by defining
params
and referencing them in command
E.g. python main.py --learning_rate \{\{learning_rate\}\}
DockerFileBuildCommand
Union, One Of:DockerhubBasicAuth
DockerhubProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
DockerhubRegistry
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
DynamicVolumeConfig
Volume Type for the volume.
Name of the storage class to be used for the volume.
Size of volume in Gi
Specify the notification channel to send alerts to
List of recipientsβ email addresses if the notification channel is Email.
EmailNotificationChannel
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The email address from which the email will be sent.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
EmptyResponse
Union, One Of:Endpoint
Upto 253 characters, each part of host should be at most 63 characters long, can contain alphabets, digits and hypen, must begin and end with an alphanumeric characters. Parts must be separated by periods (.)
Should begin and end with a forward slash (/). Each part can can contain alphabets, digits and hypen, must begin and end with an alphanumeric characters. Parts should be separated by forward slashes (/)
EnkryptAiGuardrailConfig
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π EnkryptAiGuardrailConfigOperationThe operation type to use for the Guardrail. Validate guardrails are used to validate requests and mutate can validate as well as mutate requests.
Validate guardrails are run in parallel while mutate guardrails are run sequentially.
The name of the guardrail to use for the Enkrypt AI Guardrail.
EnkryptAiKeyAuth
API key must not be empty
Environment
EnvironmentColor
EnvironmentManifest
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Indicates if the environment is for production use
Event
Name of the event
Timestamp when the event was first observed
Timestamp when the event was last observed
π EventInvolvedObjectDetails of the involved object
Type of the event
Number of occurrences of the event
Reason for the event
Message describing the event
Namespace of the event
π EventChartChart of the event
EventChart
EventInvolvedObject
ExternalBlobStorageSource
Type of the source
URI referencing a path in the blob storage bucket linked to the MLRepo
FailureToleranceConfig
Number of failures allowed per minute before cooldown
Cooldown period in minutes after failure threshold is reached
HTTP status codes considered as failures
FallbackConfig
Name of the fallback configuration
π FallbackRuleList of fallback rules
FallbackModel
Target model or provider FQN to use as fallback
Optional parameters to override in the fallback request
FallbackRule
Unique identifier for the rule
π FallbackModelList of fallback models to try in sequence
FallbackWhen
List of subjects that this rule applies to
List of models that this rule applies to
Metadata key-value pairs that this rule applies to
HTTP status codes that trigger fallback behavior
FastAiFramework
Type of the framework
FileInfo
FlyteLaunchPlan
FlyteLaunchPlanId
resource_type
typing_extensions.Annotated[(typing.Literal[Any], Any)]
default:"LAUNCH_PLAN"
required
FlyteLaunchPlanSpec
FlyteTask
FlyteTaskCustom
FlyteTaskCustomTruefoundry
Union, One Of:FlyteTaskId
FlyteTaskTemplate
FlyteWorkflow
FlyteWorkflowId
FlyteWorkflowTemplate
ForwardAction
Eg - For a service called
sample-service
in workspace ws
this would be `sample-service.ws.svc.cluster.localFunction
Name of the tool/function being called
Arguments passed to the tool/function
FunctionSchema
Name of the function
Description of the function
Parameters schema for the function
Indicates if the function should be called strictly
GatewayConfig
Union, One Of:- π RateLimitConfig
- π FallbackConfig
- π LoadBalancingConfig
- π GuardrailsConfig
- π BudgetConfig
GcpApiKeyAuth
GcpGcr
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The URL of the registry.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
GcpGcs
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The root path of the storage.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
GcpGkeIntegration
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Name of the cluster given in the cloud account.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
GcpGsm
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
GcpIntegrations
Union, One Of:GcpKeyFileAuth
GcpProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The project id of the GCP account.
π GcpProviderAccountAuthDataAuthentication data for the GCP account.
π GcpIntegrationsList of integrations that are associated with the GCP provider account.
GcpProviderAccountAuthData
Union, One Of:GcpTpu
Name of the TPU Type. One of
tpu-v4-podslice
(TPU v4, ct4p)tpu-v5-lite-device
(TPU v5e, ct5l)tpu-v5-lite-podslice
(TPU v5e, ct5lp)tpu-v5p-slice
(TPU v5p, ct5p)
Topology of the TPU slices. Currently only single-host topology is supported.
Please refer to TPUs on GKE docs
Allowed Values:
2x2x1
fortpu-v4-podslice
- One of
1x1
,2x2
,2x4
fortpu-v5-lite-device
andtpu-v5-lite-podslice
2x2x1
fortpu-v5p-slice
GeminiModelV2
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Gemini model (e.g., chat, text, etc.)
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
GetAlertsResponse
π AlertObject containing alert data grouped by alert name
GetApplicationDeploymentResponse
Deployment
GetApplicationResponse
Application
GetArtifactResponse
GetArtifactVersionResponse
GetAuthenticatedVcsurlResponse
Authenticated URL to access the repository
GetAutoProvisioningStateResponse
State of auto-provisioning for the cluster.
GetChartsResponse
Step size
GetClusterResponse
π ClusterCluster
GetDataDirectoryResponse
GetEnvironmentResponse
π EnvironmentEnvironment
GetEventsResponse
GetJobRunResponse
π JobRunJobRun
GetLogsResponse
GetMlRepoResponse
π MlRepoML Repo
GetModelResponse
GetModelVersionResponse
GetOrCreatePersonalAccessTokenResponse
π VirtualAccountPersonal Access Token data
Personal Access Token
Whether the PAT was created (true) or already existed (false)
GetPromptResponse
GetPromptVersionResponse
GetSecretGroupResponse
π SecretGroupSecret Group
GetSecretResponse
Secret
GetSignedUrLsRequest
GetSignedUrLsResponse
GetSuggestedDeploymentEndpointResponse
Host name
Path
π ClusterGatewayGateway configuration
GetTeamResponse
GetUserResponse
GetVirtualAccountResponse
π VirtualAccountVirtual Account
Virtual Account token (present only when creating a virtual account)
GetWorkspaceResponse
π WorkspaceWorkspace
GitHelmRepo
Needs to be a valid URL.
Branch/Commit SHA/Tag of the git repo.
Path to the chart.
Helm values files for overriding values in the helm chart.
The path is relative to the Path directory defined above
GithubIntegration
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
GithubProviderAccount
GitlabIntegration
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
GitlabProviderAccount
GitRepositoryExistsResponse
The flag if repository exists or not
The integration id of the repository
GitSource
Needs to be a valid Github, Bitbucket, Azure Repos or Gitlab link
The commit SHA.
Selecting branch will select latest commit SHA of the branch.
GluonFramework
Type of the framework
GoogleGeminiProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π GeminiModelV2List of integrations that are associated with the Google Gemini provider account
π CollaboratorList of users who have access to this provider account
GoogleModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
π ModelTypeSpecify the type of the model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
GoogleVertexProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Project ID must not be empty
π VertexModelV2List of integrations that are associated with the Google Vertex provider account
π CollaboratorList of users who have access to this provider account
Graph
Name
Description
Chart name
π GraphChartTypeChart type
Chart params
GroqKeyAuth
API key must not be empty
GroqModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Groq model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
GroqProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the Groq provider account
π CollaboratorList of users who have access to this provider account
GuardrailConfigGroup
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π CollaboratorList of users who have access to this Guardrail Config Group.
π GuardrailConfigIntegrationsList of Guardrail Configs, which are part of this Guardrail Config Group.
GuardrailConfigIntegrations
Union, One Of:- π OpenAiModerationsGuardrailConfig
- π AwsBedrockGuardrailConfig
- π CustomGuardrailConfig
- π AzurePiiGuardrailConfig
- π AzureContentSafetyGuardrailConfig
- π EnkryptAiGuardrailConfig
Guardrails
List of guardrail names to apply to user input messages before processing.
List of guardrail names to apply to AI response messages before returning to user.
GuardrailsConfig
Name of the guardrails configuration
π GuardrailsRuleList of guardrail rules
GuardrailsRule
Unique identifier for the rule
Guardrail selectors of the guardrails to apply to the input (format: groupName/guardrailName)
Guardrail selectors of the guardrails to apply to the output (format: groupName/guardrailName)
GuardrailsWhen
List of subjects that this rule applies to
List of models that this rule applies to
Metadata key-value pairs that this rule applies to
H2OFramework
Type of the framework
HeaderMatch
HealthProbe
Time to wait after container has started before checking the endpoint
How often to check the endpoint
Time to wait for a response from the endpoint before considering it down
Number of successful responses from the endpoint before container is considered healthy
Number of consecutive failures before the container is considered down
Helm
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Add labels to base argo app
Values file as block file
ignore_differences
typing_extensions.Annotated[(typing.Optional[typing.List[typing.Dict[(str, typing.Optional[typing.Any])]]], Any)]
HelmRepo
Needs to be a valid URL.
FQN of the helm repo integration. If you canβt find your integration here,
add it through the Integrations page
The helm chart name
Helm chart version
HelmSource
Union, One Of:HttpError
HTTP Status Code
Error Message
π HttpErrorCodeError code
Error details
HttpErrorCode
Union, One Of:HttpProbe
Path to the health check endpoint
Listening port for the health check endpoint
Host name to connect to, defaults to the pod IP
Scheme to use for connecting to the host
HttpValidationError
HuggingfaceArtifactSource
Model ID of the artifact to be downloaded
Revision of the artifact to be downloaded
List of patterns to ignore while downloading the artifact
Environment variable which will contain the download path of the artifact
IChange
old_value
typing_extensions.Annotated[(typing.Optional[typing.Dict[(str, typing.Optional[typing.Any])]], Any)]
Image
The image URI. Specify the name of the image and the tag.
If the image is in Dockerhub, you can skip registry-url (for e.g.
tensorflow/tensorflow
).
You can use an image from a private registry using Advanced fieldsFQN of the container registry. If you canβt find your registry here,
add it through the Integrations page
π ImageCommandOverride the command to run when container starts.
When deploying a Job, the command can be templatized by defining
params
and referencing them in command
E.g. python main.py --learning_rate \{\{learning_rate\}\}
ImageCommand
Union, One Of:ImageContentPart
Type of the content part
π ImageUrlImage URL linking to the image
ImageUrl
π ImageUrlUrlImage URL linking to the image
ImageUrlUrl
Union, One Of:InfraProviderAccount
Union, One Of:- π AwsProviderAccount
- π AzureProviderAccount
- π GcpProviderAccount
- π DockerhubProviderAccount
- π BitbucketProviderAccount
- π CustomProviderAccount
- π GithubProviderAccount
- π GitlabProviderAccount
- π JfrogProviderAccount
- π TtlProviderAccount
- π TrueFoundryProviderAccount
- π QuayProviderAccount
- π SlackProviderAccount
- π WebhookProviderAccount
- π PagerDutyProviderAccount
InputOutputBasedCostMetricValue
Intercept
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
InterceptRulesItem
InterceptRulesItemAction
Union, One Of:InternalArtifactVersion
InternalListArtifactVersionsResponse
InternalListArtifactVersionsResponseDataItem
Union, One Of:InternalModelVersion
InviteUserResponse
Union, One Of:IsClusterConnectedResponse
Whether the cluster is connected
JfrogArtifactsRegistry
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The URL of the registry.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
JfrogBasicAuth
The username for the basic authentication.
The password for the basic authentication.
JfrogProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The name of the JFrog account.
List of integrations that are associated with the provider account.
Job
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Trigger the job after deploy immediately
π ParamConfigure params and pass it to create different job runs
Configure environment variables to be injected in the service either as plain text or secrets. Docs
π JobAlertConfigure alerts to be sent when the job starts/fails/completes
Specify the maximum number of attempts to retry a job before it is marked as failed.
Job timeout in seconds.
Number of runs that can run concurrently
π JobMountsItemConfigure data to be mounted to job pod(s) as a string, secret or volume. Docs
JobAlert
Specify the notification channel to send alerts to
List of recipientsβ email addresses if the notification channel is Email.
Send an alert when the job starts
Send an alert when the job fails
JobImage
Union, One Of:JobMountsItem
Union, One Of:JobRun
JobRun Name
Application Name
Deployment Version
Created At
End Time of JobRun
Duration of JobRun
Command
Total Retries
Error
π JobRunStatusStatus of JobRun
Triggered By
π SubjectTriggered By Subject
Exit Code
Spark UI Url
JobTrigger
Union, One Of:JobTriggerInput
π JobTriggerInputCommandCommand
Parameters used to populate template command
JobTriggerInputCommand
Union, One Of:JwtAuthConfig
FQN of the JWT Auth integration. If you canβt find your integration here,
add it through the Integrations page
Enable login for the service
π JwtAuthConfigClaimsItemList of key-value pairs of claims to verify in the JWT token
List of paths that will bypass auth.
needs to start with a forward slash(/) and should not contain wildcards(*)
JwtAuthConfigClaimsItem
KafkaInputConfig
βKafka Bootstrap servers - Comma separated list of Kafka brokers βhostname:portβ to connect to for bootstrapβ
Kafka topic to subscribe to
The name of the consumer group to join for dynamic partition assignment
TLS configuration for SASL authentication
Wait timeout for long polling.
KafkaMetricConfig
Upper limit of the number of backlog messages the auto-scaler will try to maintain per replica. If you set this number to 10 and have 30 messages in the stream and one replica, the auto-scaler will scale the number of replicas to 3.
KafkaOutputConfig
βKafka Bootstrap servers - Comma separated list of Kafka brokers βhostname:portβ to connect to for bootstrapβ
Kafka topic to publish to
TLS configuration for SASL authentication
KafkaSaslAuth
Username for SASL authentication
Password for SASL authentication
KerasFramework
Type of the framework
Kustomize
Content of kustomization.yaml to perform kustomize operation. Please do not include the
resources
section. It is filled in automaticallyAdditional kubernetes manifests to be included in the application
LatencyBasedLoadBalanceTarget
Target model or provider FQN
Optional parameters to override in the request
LatencyBasedLoadBalancingRule
Unique identifier for the rule
π LatencyBasedLoadBalanceTargetList of targets for latency-based load balancing
LightGbmFramework
Type of the framework
ListApplicationDeploymentsResponse
Array of Deployments
π PaginationPagination information
ListApplicationsResponse
Array of Applications
π PaginationPagination information
ListArtifactsResponse
ListArtifactVersionsResponse
ListClusterAddonsResponse
π AddonComponentArray of Cluster Addons
π PaginationPagination information
ListClustersResponse
π ClusterArray of cluster
π PaginationPagination information
ListDataDirectoriesResponse
ListEnvironmentsResponse
π EnvironmentArray of Environments
π PaginationPagination information
ListFilesRequest
ListFilesResponse
ListJobRunResponse
π JobRunJobRun History
π PaginationPagination Information
ListMlReposResponse
ListModelsResponse
ListModelVersionsResponse
ListPersonalAccessTokenResponse
π VirtualAccountArray of Personal Access Tokens
π PaginationPagination Information
ListPromptsResponse
ListPromptVersionsResponse
ListSecretGroupResponse
π SecretGroupArray of Secret Groups
π PaginationPagination information
ListSecretsResponse
Array of Secrets
π PaginationPagination Information
ListTeamsResponse
π PaginationPagination Information
ListUsersResponse
π PaginationPagination information
ListVirtualAccountResponse
π VirtualAccountArray of Virtual Accounts
π PaginationPagination Information
ListWorkspacesResponse
π WorkspaceArray of Workspace
π PaginationPagination information
LoadBalanceTarget
Target model or provider FQN
Relative weight for routing decisions (higher values mean more traffic)
Optional parameters to override in the request
LoadBalancingConfig
Name of the load balancing configuration
π ModelConfigConfiguration for specific models
π LoadBalancingRuleList of load balancing rules
LoadBalancingRule
Union, One Of:LoadBalancingWhen
List of subjects that this rule applies to
List of models that this rule applies to
Metadata key-value pairs that this rule applies to
LocalArtifactSource
Type of the source
π ArtifactPathArray of ArtifactPath objects representing the source and destination paths
LocalModelSource
Type of the source
Path to the model file or folder
LocalSource
Local project root path.
run docker build locally
Log
Job Name
Log Content
Log Stream
Log Time
Container Name
Manual
McpServerAuth
Union, One Of:McpServerHeaderAuth
McpServerIntegration
The name of the MCP Server.
1 to 1000 characters long, may contain any character except newlines
The endpoint URL for the MCP Server. The system will first try a connection using streamable-http transport on this URL. If that fails, it will attempt a connection using SSE transport on <url>/sse.
McpServerOAuth2
McpServerPassthrough
McpServerProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π CollaboratorList of users who have access to this MCP Server Group.
π McpServerIntegrationList of MCP Servers, which are part of this MCP Server Group.
McpServerWithFqn
The FQN of the MCP server
The FQN of the MCP server
Whether to enable all tools from the MCP server
π McpToolThe tools to enable from the MCP server
McpServerWithUrl
The URL of the MCP server
The URL of the MCP server
The headers to send to the MCP server
Whether to enable all tools from the MCP server
π McpToolThe tools to enable from the MCP server
McpTool
The name of the tool as it appears in the MCP server
Metadata
Job Run Name Alias
Metric
MirrorAction
Eg - For a service called
sample-service
in workspace ws
this would be `sample-service.ws.svc.cluster.localMistralAiKeyAuth
API key must not be empty
MistralAiModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Mistral AI model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
MistralAiProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the Mistral AI provider account
π CollaboratorList of users who have access to this provider account
MlRepo
Tenant Name
Created At
Experiment Id
Number of runs
Number of datasets
MlRepoManifest
Alphanumeric word, may contain β-β with a maximum length of 100 characters
Description can be maximum 500 characters
Select the storage-integration where you want to save your artifacts and models
View Docs
π CollaboratorUsers and Teams that have access to MLRepo
Model
ModelConfig
Model identifier
ModelConfiguration
Name of the provider, must be one of the integration providers configured for the Gateway
Name of the model to be used for generations. This model should be available in the provider
π ParametersParameters to pass to the model when generating
Arbitrary extra parameters to pass to the model when generating
ModelCostMetric
Union, One Of:ModelManifest
Name of the entity
Key value metadata. Should be valid JSON. For e.g.
\{"business-unit": "sales", "quality": "good", "rating": 4.5\}
Version alias is alternate, ideally human readable, version string to reference an artifact version. It should start with
v
followed by alphanumeric and it can include .
and -
in between (e.g. v1.0.0
, v1-prod
, v3-dev
, etc)Name of the ML Repo
Version of the entity
π ModelManifestFrameworkFramework for the model version like Transformers, PyTorch, Sklearn, Xgboost etc with framework specific metadata. This will be used to infer model deployment configuration
Step/Epoch number in an iterative training loop the model version was created. Generally useful when logging a model version from a MLRepo Run
ID of the MLRepo Run that generated the model version
ModelManifestFramework
Union, One Of:- π TransformersFramework
- π TensorFlowFramework
- π SklearnFramework
- π PyTorchFramework
- π KerasFramework
- π XgBoostFramework
- π LightGbmFramework
- π FastAiFramework
- π H2OFramework
- π OnnxFramework
- π SpaCyFramework
- π StatsModelsFramework
- π GluonFramework
- π PaddleFramework
ModelManifestSource
Union, One Of:ModelProviderAccount
Union, One Of:- π AwsBedrockProviderAccount
- π GoogleVertexProviderAccount
- π GoogleGeminiProviderAccount
- π AzureOpenAiProviderAccount
- π AzureFoundryProviderAccount
- π CohereProviderAccount
- π Ai21ProviderAccount
- π AnthropicProviderAccount
- π DeepinfraProviderAccount
- π GroqProviderAccount
- π MistralAiProviderAccount
- π PerplexityAiProviderAccount
- π TogetherAiProviderAccount
- π NomicProviderAccount
- π PalmProviderAccount
- π OllamaProviderAccount
- π OpenaiProviderAccount
- π DatabricksProviderAccount
- π SelfHostedModelProviderAccount
ModelVersion
ModelVersionEnvironment
Python version to use when running the model
pip dependencies needed for running the model
MultiPartUpload
MultiPartUploadResponse
NatsInputConfig
Input NATS URL
Name of the NATS stream
Input NATS root subject should only contain alphanumeric letters, dashes(-), and periods(.)
Consumer name should only contain alphanumeric letters, dashes(-), and underscores(_)
Wait timeout for long polling.
NATS Metrics URL should be a valid HTTP/HTTPS URL
NatsMetricConfig
Upper limit of the number of backlog messages the auto-scaler will try to maintain per replica. If you set this number to 10 and have 30 messages in the stream and one replica, the auto-scaler will scale the number of replicas to 3.
NatsOutputConfig
Output NATS URL
Output NATS root subject should only contain alphanumeric letters, dashes(-), and periods(.)
NatsUserPasswordAuth
Name of the NATS account
User for NATS authentication
Password for NATS authentication
Nodepool
NodepoolSelector
Nodepools where you want to run your workload. Multiple nodepools can be selected.
The workload is guaranteed to be scheduled on one of the nodepool
NodeSelector
Instance family of the underlying machine to use. Multiple instance families can be supplied.
The workload is guaranteed to be scheduled on one of them.
π NodeSelectorCapacityTypeConfigure what type of nodes to run the app. By default no placement logic is applied.
βspot_fallback_on_demandβ will try to place the application on spot nodes but will fallback to on-demand when spot nodes are not available.
βspotβ will strictly place the application on spot nodes.
βon_demandβ will strictly place the application on on-demand nodes.
NomicKeyAuth
API key must not be empty
NomicModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Nomic model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
NomicProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the Nomic provider account
π CollaboratorList of users who have access to this provider account
Notebook
Stop the notebook instance after this much time in minutes of inactivity.
The notebook instance will be stopped even if the notebook is open in your browser, but nothing is running on the notebook.
NotebookConfig
Must not contain any spaces.
NotificationTarget
Union, One Of:NotificationTargetForAlertRule
Union, One Of:NvidiaGpu
Name of the Nvidia GPU. One of [P4, P100, V100, T4, A10G, A100_40GB, A100_80GB]
This field is required for Node Selector and can be ignored in Nodepool Selector.
One instance of the card contains the following amount of memory -
P4: 8 GB, P100: 16 GB, V100: 16 GB, T4: 16 GB, A10G: 24 GB, A100_40GB: 40GB, A100_80GB: 80 GB
Count of GPUs to provide to the application
Note the exact count and max count available for a given GPU type depends on cloud provider and cluster type.
NvidiaMiggpu
Name of the Nvidia GPU. One of [P4, P100, V100, T4, A10G, A100_40GB, A100_80GB]
This field is required for Node Selector and can be ignored in Nodepool Selector.
One instance of the card contains the following amount of memory -
P4: 8 GB, P100: 16 GB, V100: 16 GB, T4: 16 GB, A10G: 24 GB, A100_40GB: 40GB, A100_80GB: 80 GB
π NvidiaMiggpuProfileName of the MIG profile to use. One of the following based on gpu type
Please refer to https://docs.nvidia.com/datacenter/tesla/mig-user-guide/#supported-mig-profiles for more details
A100 40 GB - [1g.5gb, 1g.10gb, 2g.10gb, 3g.20gb, 4g.20gb]
A100 80 GB / H100 80 GB - [1g.10gb, 1g.20gb, 2g.20gb, 3g.40gb, 4g.40gb]
H100 94 GB - [1g.12gb, 1g.24gb, 2g.24gb, 3g.47gb, 4g.47gb]
H100 96 GB - [1g.12gb, 1g.24gb, 2g.24gb, 3g.48gb, 4g.48gb]
H200 141 GB - [1g.18gb, 1g.35gb, 2g.35gb, 3g.71gb, 4g.71gb]
NvidiaTimeslicingGpu
Name of the Nvidia GPU. One of [P4, P100, V100, T4, A10G, A100_40GB, A100_80GB]
This field is required for Node Selector and can be ignored in Nodepool Selector.
One instance of the card contains the following amount of memory -
P4: 8 GB, P100: 16 GB, V100: 16 GB, T4: 16 GB, A10G: 24 GB, A100_40GB: 40GB, A100_80GB: 80 GB
Amount of GPU memory (in MB) to allocate. Please note, this limit is not being enforced today but will be in future. Applications are expected to operate in co-opertative mode
OAuth2LoginProvider
OciRepo
Need to be a valid URL.
FQN of the container registry. If you canβt find your registry here,
add it through the Integrations page
Helm chart version
OllamaKeyAuth
Host URL must not be empty
OllamaModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Ollama model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
OllamaProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the Ollama provider account
π CollaboratorList of users who have access to this provider account
OnnxFramework
Type of the framework
OpenaiApiKeyAuth
API key must not be empty
OpenAiModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the OpenAI model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
OpenAiModerationsGuardrailConfig
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Optional custom base URL for OpenAI API. If not provided, the default base URL will be used.
The model to use for the OpenAI Moderation API.
category_thresholds
typing.Optional[typing.Dict[(str, OpenAiModerationsGuardrailConfigCategoryThresholdsValue)]]
π OpenAiModerationsGuardrailConfigCategoryThresholdsValueThe thresholds for the OpenAI Moderation API.
OpenAiModerationsGuardrailConfigCategoryThresholdsValue
Union, One Of:OpenAiModerationsGuardrailConfigCategoryThresholdsValueHarassment
OpenaiProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Base URL must not be empty
List of integrations that are associated with the OpenAI provider account
π CollaboratorList of users who have access to this provider account
PaddleFramework
Type of the framework
PagerDuty
Specify the notification channel to send alerts to
PagerDutyIntegration
The name of the integration that will be displayed in the TrueFoundry UI.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
PagerDutyIntegrationKeyAuth
The integration key for the PagerDuty integration
The integration URL for the PagerDuty integration
PagerDutyProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π PagerDutyIntegrationList of integrations that are associated with the PagerDuty provider account
Pagination
Total number of items
Number of items skipped
Number of items per page
PalmKeyAuth
API key must not be empty
PalmModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the PaLM model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
PalmProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the PaLM provider account
π CollaboratorList of users who have access to this provider account
Param
name can contain lower case alphabets, digits, underscore (_) and hypen (-). It can be 32 characters long, should start with an alphabet, and should end with either an alphabet or digit.
description cannot be longer than 127 characters
default value cannot be longer than 127 characters
Parameters
ParametersStop
Union, One Of:Permissions
The fully qualified name of the resource
The type of the resource (cluster, workspace, etc.)
The role id of the role to be assigned to the service account for that resource
PerplexityAiKeyAuth
API key must not be empty
PerplexityAiModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Perplexity AI model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
PerplexityAiProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the Perplexity AI provider account
π CollaboratorList of users who have access to this provider account
PersonalAccessTokenManifest
3 to 25 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Expiration date of the personal access token
The fully qualified name of the user
PerThousandEmbeddingTokensCostMetric
PerThousandTokensCostMetric
PolicyFilters
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
PolicyManifest
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Description must be between 1 byte and 1024 bytes
π PolicyManifestOperationDefines if the policy validates or mutates resources. Validation policies check TrueFoundry manifests to ensure they meet conditions and can block deployments. Mutation policies modify Kubernetes manifests before theyβre applied to the cluster. See this documentation for more details.
π PolicyManifestModeMode of the policy:
Audit
logs all policy evaluations without blocking deployments. Enforce
blocks deployments if the policy fails. Disabled
deactivates the policy.π PolicyEntityTypesTypes of applications this policy applies to
Actions this policy applies to
Code size must be between 1 byte and 10 KB
PolicyManifestOperation
Union, One Of:PolicyMutationOperation
Mutates or Validates Resources
Order must be a positive integer less than or equal to 100
PolicyValidationOperation
Mutates or Validates Resources
Port
Port number to expose.
π PortProtocolProtocol for the port.
Expose the port
π PortAppProtocolApplication Protocol for the port.
Select the application protocol used by your service. For most use cases, this should be
http
(HTTP/1.1).
If you are running a gRPC server, select the grpc
option.
This is only applicable if expose=true
.Upto 253 characters, each part of host should be at most 63 characters long, can contain alphabets, digits and hypen, must begin and end with an alphanumeric characters. Parts must be separated by periods (.)
Should begin and end with a forward slash (/). Each part can can contain alphabets, digits and hypen, must begin and end with an alphanumeric characters. Parts should be separated by forward slashes (/)
Should begin and end with a forward slash (/). Each part can can contain alphabets, digits and hypen, must begin and end with an alphanumeric characters. Parts should be separated by forward slashes (/)
π PortAuthAuthentication method for inbound traffic
PortAuth
Union, One Of:PresignedUrlObject
PrometheusAlertRule
3 to 35 alphanumeric characters long, can contain - in between, cannot start with a number and cannot end with a -
Enter a valid PromQL expression that defines the condition for triggering this alert. The alert will fire when this expression evaluates to true for the duration specified in the duration to trigger alert field.
The prometheus expression must remain true for this duration (in seconds) before the alert is triggered. If the condition becomes false before this time elapses, the alert will not fire.
Description of the alert rule which will be displayed in the alert rule list. This can be used to provide more context about the alert rule.
When enabled, notifications will be sent to all configured target channels when the alert conditions are met.
Prompt
PromptVersion
ProviderAccounts
Union, One Of:- π InfraProviderAccount
- π ModelProviderAccount
- π McpServerProviderAccount
- π GuardrailConfigGroup
PublicCostMetric
PySparkTaskConfig
Extra configuration properties to be passed to the spark job. Docs
Configure environment variables to be injected in the task either as plain text or secrets. Docs
π PySparkTaskConfigMountsItemConfigure data to be mounted to Workflow pod(s) as a volume.
PySparkTaskConfigMountsItem
Union, One Of:PythonBuild
Please enter a valid Python version tag
Build path relative to project root path.
Path to
requirements.txt
relative to
Path to build context
Define pip package requirements.
In Python/YAML E.g. [βfastapi>=0.90,<1.0β, βuvicornβ]
Debian packages to install via
apt get
.
In Python/YAML E.g. [βgitβ, βffmpegβ, βhtopβ]π PythonBuildCommandCommand to run when the container starts.
Command will be set as the Entrypoint of the generated image.
When deploying a Job, the command can be templatized by defining
params
and referencing them in command
E.g. python main.py --learning_rate \{\{learning_rate\}\}
Version of CUDA Toolkit and CUDNN to install in the image
These combinations are based off of publically available docker images on docker hub
You can also specify a valid tag of the form {cuda_version_number}-cudnn{cudnn_version_number}-{runtime|devel}-ubuntu{ubuntu_version}
Refer https://hub.docker.com/r/nvidia/cuda/tags for valid set of values
Note: We use deadsnakes ubuntu ppa to add Python that currently supports only Ubuntu 18.04, 20.04 and 22.04
PythonBuildCommand
Union, One Of:PythonTaskConfig
π PythonTaskConfigImageSpecify the image spec for the task
Configure environment variables to be injected in the task either as plain text or secrets. Docs
π PythonTaskConfigMountsItemConfigure data to be mounted to Workflow pod(s) as a volume.
PythonTaskConfigImage
Union, One Of:PythonTaskConfigMountsItem
Union, One Of:PyTorchFramework
Type of the framework
QuayArtifactsRegistry
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The URL of the registry.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
QuayBasicAuth
The username for the basic authentication.
The password for the basic authentication.
QuayProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The name of the account on Quay.
List of integrations that are associated with the provider account.
RateLimitConfig
Name of the rate limiting configuration
π RateLimitRuleList of rate limit rules
RateLimitRule
Unique identifier for the rule
Maximum number of requests or tokens allowed
RateLimitWhen
List of subjects that this rule applies to
List of models that this rule applies to
Metadata key-value pairs that this rule applies to
Recommendation
recommendation_data
typing_extensions.Annotated[(typing.Dict[(str, typing.Optional[typing.Any])], Any)]
RefusalContentPart
Type of the content part
Reason
RegisterUsersResponse
Union, One Of:RemoteSource
Resources
Requested CPU which determines the minimum cost incurred. The CPU usage can exceed the requested
amount, but not the value specified in the limit. 1 CPU means 1 CPU core. Fractional CPU can be requested
like
0.5
or 0.05
CPU limit beyond which the usage cannot be exceeded. 1 CPU means 1 CPU core. Fractional CPU can be requested
like
0.5
. CPU limit should be >= cpu request.Requested memory which determines the minimum cost incurred. The unit of memory is in megabytes(MB).
So 1 means 1 MB and 2000 means 2GB.
Memory limit after which the application will be killed with an OOM error. The unit of memory is
in megabytes(MB). So 1 means 1 MB and 2000 means 2GB. MemoryLimit should be greater than memory request.
Requested disk storage. The unit of memory is in megabytes(MB).
This is ephemeral storage and will be wiped out on pod restarts or eviction
Disk storage limit. The unit of memory is in megabytes(MB). Exceeding this limit will result in eviction.
It should be greater than the request. This is ephemeral storage and will be wiped out on pod restarts or eviction
Define the shared memory requirements for your workload. Machine learning libraries like Pytorch can use Shared Memory
for inter-process communication. If you use this, we will mount a
tmpfs
backed volume at the /dev/shm
directory.
Any usage will also count against the workloadβs memory limit (resources.memory_limit
) along with your workloadβs memory usage.
If the overall usage goes above resources.memory_limit
the user process may get killed.
Shared Memory Size cannot be more than the defined Memory Limit for the workload.π ResourcesNodeThis field determines how the underlying node resource is to be utilized
π ResourcesDevicesItemDefine custom device or accelerator requirements for your workload. We currently support NVIDIA GPUs, AWS Inferentia Accelerators, Single Host TPU Slices.
ResourcesDevicesItem
Union, One Of:ResourcesNode
Union, One Of:RevokeAllPersonalAccessTokenResponse
Union, One Of:Rolling
Percentage of total replicas that can be brought down at one time.
For a value of 25 when replicas are set to 12 this would mean minimum (25% of 12) = 3 pods might be unavailable during the deployment.
Setting this to a higher value can help in speeding up the deployment process.
Percentage of total replicas of updated image that can be brought up over the total replicas count.
For a value of 25 when replicas are set to 12 this would mean (12
RpsMetric
Average request per second averaged over all replicas that autoscaler should try to maintain
RStudio
Schedule
Specify the schedule for this job to be run periodically in cron format.
π ScheduleConcurrencyPolicyChoose whether to allow this job to run while another instance of the job is running, or to replace the currently running instance. Allow
will enable multiple instances of this job to run. Forbid will keep the current instance of the job running and stop a new instance from being run.
Replace will terminate any currently running instance of the job and start a new one.
Timezone against which the cron schedule will be calculated, e.g. βAsia/Tokyoβ. Default is machineβs local time.
https://docs.truefoundry.com/docs/list-of-supported-timezones
Secret
SecretGroup
SecretInput
Key of the secret.
Value of the secret.
SecretMount
Please enter a valid file path
The TrueFoundry secret whose value will be the file content.
SecretVersion
SelfHostedModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
The name of the hosted model
enter valid https/http URL that should not end with trailing slash
π SelfHostedModelModelServerThe type of model server being used
π ModelTypeSpecify the type of the model
π SelfHostedModelAuthDataSelfHostedModel authentication data for the integration.
SelfHostedModel headers for the integration. Forwarded to the provider as is. For example:
\{"Authorization": "APIKey \<token\>"\}
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
SelfHostedModelAuthData
Union, One Of:SelfHostedModelProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π SelfHostedModelList of integrations that are associated with the provider account.
Service
π ServiceReplicasDeploy multiple instances of your pods to distribute incoming traffic across them, ensuring effective load balancing.
Whether to allow intercepts to be applied for this service.
This would inject an additional sidecar in each pod of the service. Not recommended on production
π ServiceRolloutStrategyStrategy to dictate how a rollout should happen when a new release for this service is made Docs
ServiceAutoscaling
π ServiceAutoscalingMetricsMetrics to use for the autoscaler
ServiceAutoscalingMetrics
Union, One Of:ServiceReplicas
Union, One Of:ServiceRolloutStrategy
Union, One Of:Session
SignedUrl
SklearnFramework
Type of the framework
Relative path to the model file in the model version contents
SklearnModelSchema
π InferMethodNameName of the method used for inference
Schema of the input
Schema of the output
SlackBot
Specify the notification channel to send alerts to
3 to 81 lower case characters long alphanumeric word, may contain - or _ in between, must start with #
SlackBotAuth
The bot token for the slack bot to send messages to the channels. This bot should have the following permissions: chat:write, chat:write.public
SlackBotIntegration
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
SlackIntegrations
Union, One Of:SlackProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π SlackIntegrationsList of integrations that are associated with the provider account.
SlackWebhook
Specify the notification channel to send alerts to
SlackWebhookAuth
The URL of the slack webhook
SlackWebhookIntegration
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
SmtpCredentials
The SMTP host of the email server.
The username for the SMTP server.
The password for the SMTP server.
The port of the SMTP server.
Whether to use TLS for the SMTP server.
SpaCyFramework
Type of the framework
SparkBuild
Spark version should match the spark version installed in the image.
Build path relative to project root path.
Path to
requirements.txt
relative to
Path to build context
SparkConfig
Must not contain any spaces.
SparkDriverConfig
SparkExecutorConfig
SparkExecutorConfigInstances
Union, One Of:SparkExecutorDynamicScaling
Minimum number of instances to start / scale down to
Maximum number of instances to scale up to
SparkExecutorFixedInstances
Number of instances to start
SparkImage
Spark version should match the spark version installed in the image.
The image URI. Specify the name of the image and the tag.
If the image is in Dockerhub, you can skip registry-url (for e.g.
tensorflow/tensorflow
).
You can use an image from a private registry using Advanced fieldsFQN of the container registry. If you canβt find your registry here,
add it through the Integrations page
SparkImageBuild
FQN of the container registry. If you canβt find your registry here,
add it through the Integrations page
π SparkImageBuildBuildSourceTODO (gw): The following is a hack till the uiType GitSelect is fixed fron frontend
SparkImageBuildBuildSource
Union, One Of:SparkJob
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π SparkJobImageThe image to use for driver and executors. Must have spark installed. Spark version must match the version in the image.
Configure environment variables to be injected in the service either as plain text. Docs
Extra configuration properties to be passed to the spark job. Docs
π VolumeMountConfigure volumes to be mounted to driver and executors. Docs
Specify the maximum number of attempts to retry a job before it is marked as failed.
SparkJobEntrypoint
Union, One Of:- π SparkJobPythonEntrypoint
- π SparkJobScalaEntrypoint
- π SparkJobJavaEntrypoint
- π SparkJobPythonNotebookEntrypoint
- π SparkJobScalaNotebookEntrypoint
SparkJobImage
Union, One Of:SparkJobJavaEntrypoint
Filename should have .jar extension
The main class must be a valid Java class name.
Arguments to be passed to the main application file.
SparkJobPythonEntrypoint
Filename should have .py extension
Arguments to be passed to the main application file.
SparkJobPythonNotebookEntrypoint
Filename should have .ipynb extension
SparkJobScalaEntrypoint
Filename should have .jar extension
The main class must be a valid Java class name.
Arguments to be passed to the main application file.
SparkJobScalaNotebookEntrypoint
Filename should have .ipynb extension
SparkJobTriggerInput
Main Class for Spark Job
Main Application File for Spark Job
Arguments to pass to the main application file
SqsInputConfig
AWS SQS Queue URL of Subscriber
AWS Region Name
A period during which Amazon SQS prevents all consumers from receiving and processing the message. If one message takes 5 seconds to process, you can set this number to 7 or any number higher than 5. This will ensure that while the message is being processed, it will not be available to other replicas. For more information, see here
Wait timeout for long polling. For more information, see here
SqsOutputConfig
AWS SQS Queue URL of Publisher
AWS Region Name
SqsQueueMetricConfig
Upper limit of the number of backlog messages the auto-scaler will try to maintain per replica. If you set this number to 10 and have 30 messages in the queue and one replica, the auto-scaler will scale the number of replicas to 3.
SshServer
Add Your SSH Public Key, this will be used to authenticate you to the SSH Server. You can find it using
cat ~/.ssh/id_rsa.pub
in Mac/Linux or type $home\.ssh\id_rsa.pub
in Windows Powershell. You can also generate a new SSH key pair using ssh-keygen -t rsa
in your local terminal. (same for both Mac/Linux and Windows Powershell)Stop the SSH Server instance after this much time in minutes of inactivity. The instance is considered active if there is at least one active SSH connection (a client connected to the SSH server), or if a background job is running using tmux or screen, or if the pod has restarted.
SshServerConfig
Must not contain any spaces.
The port for the cluster with which you can access your SSH containers
StageArtifactRequestManifest
Union, One Of:StageArtifactResponse
StaticVolumeConfig
Volume Type for the volume.
Persistent Volume Name of the volume to be used.
StatsModelsFramework
Type of the framework
StringDataMount
Please enter a valid file path
The file content.
Subject
Subject ID
π SubjectTypeSubject type
Subject slug
Subject display name
SystemMessage
Role of the message
π SystemMessageContentText content for the system message
Name of the system
SystemMessageContent
Union, One Of:TaskDockerFileBuild
FQN of the container registry. If you canβt find your registry here,
add it through the Integrations page
The file path of the Dockerfile relative to project root path.
TaskPySparkBuild
Spark version should match the spark version installed in the image.
FQN of the container registry. If you canβt find your registry here,
add it through the Integrations page
Path to
requirements.txt
relative to
Path to build context
Define pip package requirements.
In Python/YAML E.g. [βfastapi>=0.90,<1.0β, βuvicornβ]
Debian packages to install via
apt get
.
In Python/YAML E.g. [βgitβ, βffmpegβ, βhtopβ]TaskPythonBuild
FQN of the container registry. If you canβt find your registry here,
add it through the Integrations page
Please enter a valid Python version tag
Path to
requirements.txt
relative to
Path to build context
Define pip package requirements.
In Python/YAML E.g. [βfastapi>=0.90,<1.0β, βuvicornβ]
Debian packages to install via
apt get
.
In Python/YAML E.g. [βgitβ, βffmpegβ, βhtopβ]Version of CUDA Toolkit and CUDNN to install in the image
These combinations are based off of publically available docker images on docker hub
You can also specify a valid tag of the form {cuda_version_number}-cudnn{cudnn_version_number}-{runtime|devel}-ubuntu{ubuntu_version}
Refer https://hub.docker.com/r/nvidia/cuda/tags for valid set of values
Note: We use deadsnakes ubuntu ppa to add Python that currently supports only Ubuntu 18.04, 20.04 and 22.04
Team
TeamManifest
Use of only alpha numeric character and β-β is allowed as team name and must be less than 25 characters
Enter email of each of the user you want to add in the team.
TensorFlowFramework
Type of the framework
TerminateJobResponse
Terminate Job Message
π JobRunStatusJob run status
TextContentPart
Type of the content part
π TextContentPartTextText content for the message
TextContentPartText
Union, One Of:TogetherAiKeyAuth
API key must not be empty
TogetherAiModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Together AI model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
TogetherAiProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the Together AI provider account
π CollaboratorList of users who have access to this provider account
TokenPagination
ToolCall
Type of the tool call
Unique identifier for the tool call
ToolMessage
Role of the message
π ToolMessageContentContent of the tool call result
Unique identifier for the tool call
ToolMessageContent
Union, One Of:ToolSchema
Type of the tool
TracingProject
TracingProjectManifest
Name of the tracing project
Name of the ML Repo
TransformersFramework
Type of the framework
π LibraryNameName of the library for the framework
The
pipeline()
task this model can be used with e.g. text-generation
. See huggingface docs for all possible valuesBase model Id from HuggingFace Hub. If this is a finetuned model, this points to the base model id used for finetuning.
TriggerJobRequestInput
Union, One Of:TriggerJobRunResponse
Job triggered
Name of the job run
π JobRunDetails of the triggered job run
TrueFoundryApplyRequestManifest
Union, One Of:- π MlRepoManifest
- π ArtifactManifest
- π ModelManifest
- π ChatPromptManifest
- π DataDirectory
- π Service
- π ApplicationSet
- π ProviderAccounts
- π ClusterManifest
- π WorkspaceManifest
- π Job
- π Helm
- π Volume
- π Notebook
- π RStudio
- π Workflow
- π AsyncService
- π SshServer
- π SparkJob
- π GatewayConfig
- π TeamManifest
- π PolicyManifest
- π AlertConfig
- π VirtualAccountManifest
- π CommonToolsSettings
- π AiFeaturesSettings
TrueFoundryApplyResponse
existing_manifest
typing_extensions.Annotated[(typing.Optional[TrueFoundryApplyResponseExistingManifest], Any)]
π TrueFoundryApplyResponseExistingManifestThe existing manifest of the resource
π TrueFoundryApplyResponseActionThe action performed: CREATE or UPDATE
TrueFoundryApplyResponseExistingManifest
Union, One Of:- π MlRepoManifest
- π ArtifactManifest
- π ModelManifest
- π ChatPromptManifest
- π DataDirectory
- π Service
- π ApplicationSet
- π ProviderAccounts
- π ClusterManifest
- π WorkspaceManifest
- π Job
- π Helm
- π Volume
- π Notebook
- π RStudio
- π Workflow
- π AsyncService
- π SshServer
- π SparkJob
- π GatewayConfig
- π TeamManifest
- π PolicyManifest
- π AlertConfig
- π VirtualAccountManifest
- π CommonToolsSettings
- π AiFeaturesSettings
TrueFoundryArtifactSource
Artifact or Model Version FQN of the artifact to be downloaded
Environment variable which will contain the download path of the artifact
TrueFoundryDbssm
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
TrueFoundryDeleteRequestManifest
Union, One Of:- π MlRepoManifest
- π ArtifactManifest
- π ModelManifest
- π ChatPromptManifest
- π DataDirectory
- π Service
- π ApplicationSet
- π ProviderAccounts
- π ClusterManifest
- π WorkspaceManifest
- π Job
- π Helm
- π Volume
- π Notebook
- π RStudio
- π Workflow
- π AsyncService
- π SshServer
- π SparkJob
- π GatewayConfig
- π TeamManifest
- π PolicyManifest
- π AlertConfig
- π VirtualAccountManifest
- π CommonToolsSettings
- π AiFeaturesSettings
TrueFoundryInteractiveLogin
List of paths that will bypass auth.
needs to start with a forward slash(/) and should not contain wildcards(*)
TrueFoundryManagedSource
Type of the source
TrueFoundryProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the provider account.
TtlProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
List of integrations that are associated with the provider account.
TtlRegistry
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The URL of the registry.
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
UpdateSecretInput
Key of the secret.
Value of the secret.
UpdateUserRolesResponse
Union, One Of:UpgradeData
UsageLimits
Maximum number of tokens processed per minute
Maximum number of requests processed per minute
User
UserMessage
Role of the message
π UserMessageContentContent of the user message. can be a mix of text and images
Name of the user this message is from
UserMessageContent
Union, One Of:UserMessageContentItem
Union, One Of:UserMetadata
user_object
typing_extensions.Annotated[(typing.Optional[typing.Dict[(str, typing.Optional[typing.Any])]], Any)]
ValidationError
ValidationErrorLocItem
Union, One Of:VertexModel
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
π ModelTypeSpecify the type of the model
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
VertexModelV2
2 to 62 characters long alphanumeric word, may contain - in between, cannot start with a number
Model ID must not be empty
π ModelTypeSpecify the type of the Vertex AI model (e.g., chat, text, etc.)
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
VirtualAccount
VirtualAccountManifest
3 to 25 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Expiration date of the virtual account
Volume
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π VolumeConfigVolume Configuration, can be either Dynamically provisioned or statically provisioned.
VolumeBrowser
3 to 10 lower case characters long alphanumeric word, may contain - in between, cannot start with a number.
TFY Secret containing the password for logging in the volume browser.
Kubernetes Service account name for the volume browser.
VolumeConfig
Union, One Of:VolumeMount
Please enter a valid mount path
Sub path within the volume to mount. Defaults to root of the volume.
The TrueFoundry volume that needs to be mounted.
WebhookBasicAuth
Basic auth username
Basic auth password
WebhookBearerAuth
Bearer token for authentication
Prefix for the token
WebhookIntegration
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
The URL of the webhook endpoint
π WebhookIntegrationAuthDataAuthentication configuration for the webhook
List of subjects that are authorized to access this integration. List of user fqn in format <user_type>:<username>.
WebhookIntegrationAuthData
Union, One Of:WebhookProviderAccount
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
π WebhookIntegrationList of integrations that are associated with the provider account.
WeightBasedLoadBalancingRule
Unique identifier for the rule
π LoadBalanceTargetList of targets for load balancing with weights
WorkbenchImage
The image URI. Specify the name of the image and the tag.
If the image is in Dockerhub, you can skip registry-url (for e.g.
tensorflow/tensorflow
).
You can use an image from a private registry using Advanced fieldsThe build script to run when building the image.
This will be executed as the last step in the docker build process as the root user (RUN DEBIAN_FRONTEND=noninteractive bash -ex build_script.sh)
FQN of the container registry. If you canβt find your registry here,
add it through the Integrations page
WorkerConfig
π WorkerConfigInputConfigInput Config
π WorkerConfigOutputConfigOutput Config
Number of concurrent workers to spawn for the processor
WorkerConfigInputConfig
Union, One Of:WorkerConfigOutputConfig
Union, One Of:- π SqsOutputConfig
- π NatsOutputConfig
- π CoreNatsOutputConfig
- π KafkaOutputConfig
- π AmqpOutputConfig
Workflow
3 to 32 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
WorkflowAlert
Send an alert when the job completes
Send an alert when the job fails
WorkflowFlyteEntitiesItem
Union, One Of:WorkflowSource
Union, One Of:Workspace
WorkspaceManifest
3 to 25 lower case characters long alphanumeric word, may contain - in between, cannot start with a number
Tag the environment like dev, staging or production. You will need to create environments if you donβt have already.
WorkspacesDeleteResponse
Destroyed the workspace
XgBoostFramework
Type of the framework
Relative path to the model file in the model version contents
XgBoostModelSchema
Name of the method used for inference
Schema of the input
Schema of the output