Get Deployment Specifications

curl --request GET \
  --url https://{controlPlaneURL}/api/svc/v1/model-catalogues/deployment-specs \
  --header 'Authorization: Bearer <token>'

{
  "data": [
    {
      "name": "vLLM",
      "displayName": "<string>",
      "description": "<string>",
      "openAPIDescription": "<string>",
      "deployments": [
        {
          "name": "<string>",
          "spec": {},
          "isAvailableInWorkspace": true,
          "cost": {},
          "isTRTLLMEngineAvailable": true
        }
      ]
    }
  ]
}

GET

api

svc

model-catalogues

deployment-specs

Get Deployment Specifications

curl --request GET \
  --url https://{controlPlaneURL}/api/svc/v1/model-catalogues/deployment-specs \
  --header 'Authorization: Bearer <token>'

{
  "data": [
    {
      "name": "vLLM",
      "displayName": "<string>",
      "description": "<string>",
      "openAPIDescription": "<string>",
      "deployments": [
        {
          "name": "<string>",
          "spec": {},
          "isAvailableInWorkspace": true,
          "cost": {},
          "isTRTLLMEngineAvailable": true
        }
      ]
    }
  ]
}

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Query Parameters

workspaceId

string

required

Workspace Id

modelVersionFqn

string

FQN of the Model Version

huggingfaceHubUrl

string

huggingface hub url

huggingfaceHubTokenSecretFqn

string

Huggingface hub token secret

pipelineTagOverride

string

The pipeline tag of the model for which deployment spec is being generated

Response

200

application/json

Returns deployment specifications for the model version or HuggingFace model URL

The response is of type object.

List Associated Active Deployments Get Nvidia NIM Model Deployment Specifications

TrueFoundry SDK

Clusters

Workspaces

MLRepos

Applications

Jobs

Models

Artifacts

Prompts

Secret Groups

Secrets

Model Deployments

Users

Teams

Personal Access Tokens

Virtual Accounts

Logs

Audit Logs

Apply

Get Deployment Specifications

Authorizations

Query Parameters

Response