Get Started
Developer Guide
- Providers
- Code Integration
Configure Gateway
- Access Control
- Rate Limiting
- Load Balancing
- Fallback
- Guardrails
Observability
Deployment
Embeddings
Rerank
Responses
Moderations
Batch
Create Batch
Creates a new batch process
POST
/
batches
curl --request POST \
--url https://{controlPlaneURL}/api/llm/api/inference/openai/batches \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: application/json' \
--data '{
"input_file_id": "<string>",
"completion_window": "<string>",
"endpoint": "<string>",
"job_name": "<string>"
}'
{
"id": "<string>",
"object": "<string>",
"endpoint": "<string>",
"errors": "<any>",
"input_file_id": "<string>",
"completion_window": "<string>",
"status": "<string>",
"output_file_id": "<string>",
"error_file_id": "<string>",
"created_at": 123,
"in_progress_at": 123,
"expires_at": 123,
"finalizing_at": 123,
"completed_at": 123,
"failed_at": 123,
"expired_at": 123,
"cancelling_at": 123,
"cancelled_at": 123,
"request_counts": {
"total": 123,
"completed": 123,
"failed": 123
},
"metadata": "<any>"
}
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
application/json
Batch processing parameters
The body is of type object
.
Response
200
application/json
Batch created successfully
The response is of type object
.
Was this page helpful?
curl --request POST \
--url https://{controlPlaneURL}/api/llm/api/inference/openai/batches \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: application/json' \
--data '{
"input_file_id": "<string>",
"completion_window": "<string>",
"endpoint": "<string>",
"job_name": "<string>"
}'
{
"id": "<string>",
"object": "<string>",
"endpoint": "<string>",
"errors": "<any>",
"input_file_id": "<string>",
"completion_window": "<string>",
"status": "<string>",
"output_file_id": "<string>",
"error_file_id": "<string>",
"created_at": 123,
"in_progress_at": 123,
"expires_at": 123,
"finalizing_at": 123,
"completed_at": 123,
"failed_at": 123,
"expired_at": 123,
"cancelling_at": 123,
"cancelled_at": 123,
"request_counts": {
"total": 123,
"completed": 123,
"failed": 123
},
"metadata": "<any>"
}