GET
/
api
/
svc
/
v1
/
model-catalogues
/
nim-deployment-specs
curl --request GET \
  --url https://{controlPlaneURL}/api/svc/v1/model-catalogues/nim-deployment-specs \
  --header 'Authorization: Bearer <token>'
{
  "data": [
    {
      "name": "vLLM",
      "displayName": "<string>",
      "description": "<string>",
      "openAPIDescription": "<string>",
      "deployments": [
        {
          "name": "<string>",
          "spec": {},
          "isAvailableInWorkspace": true,
          "cost": {},
          "isTRTLLMEngineAvailable": true
        }
      ]
    }
  ]
}

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Query Parameters

workspaceId
string
required

Workspace Id

nimModelId
string
required

Nvidia NIM container id Format: <org>/<team>/<model>, e.g. nim/meta/llama3-8b-instruct

nvcrDockerRegistryProviderIntegrationFqn
string
required

FQN of the Custom Registry Provider Integration pointing to nvcr.io

ngcApiKeySecretFqn
string
required

FQN of the Secret containing the NGC API key

Response

200
application/json

Returns deployment specifications for the NIM Model

The response is of type object.