Skip to main content
PATCH
/
cloud
/
v3
/
inference
/
applications
/
{project_id}
/
deployments
/
{deployment_name}
Update inference application deployment
curl --request PATCH \
  --url https://api.gcore.com/cloud/v3/inference/applications/{project_id}/deployments/{deployment_name} \
  --header 'Authorization: <api-key>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "api_keys": [
    "key1",
    "key2"
  ],
  "components_configuration": {
    "model": {
      "scale": {
        "max": 2
      }
    }
  },
  "regions": [
    1,
    2
  ]
}
'
{
  "tasks": [
    "d478ae29-dedc-4869-82f0-96104425f565"
  ]
}

Documentation Index

Fetch the complete documentation index at: https://gcore.com/docs/llms.txt

Use this file to discover all available pages before exploring further.

Authorizations

Authorization
string
header
required

API key for authentication. Make sure to include the word apikey, followed by a single space and then your token. Example: apikey 1234$abcdef

Path Parameters

project_id
integer
required

Project ID

Example:

1

deployment_name
string
required

Name of deployment

Body

application/json
api_keys
string[]

List of API keys for the application

Example:
["key1", "key2"]
components_configuration
Components Configuration · object

Mapping of component names to their configuration (e.g., "model": {...})

Examples:
{ "model": { "scale": { "max": 2 } } }
{
"model": {
"flavor": "inference-16vcpu-232gib-1xh100-80gb"
}
}
regions
integer[]

Geographical regions to be updated for the deployment

Example:
[1, 2]

Response

200 - application/json

OK

tasks
string[]
required

List of task IDs representing asynchronous operations. Use these IDs to monitor operation progress:

  • GET /v1/tasks/{task_id} - Check individual task status and details Poll task status until completion (FINISHED/ERROR) before proceeding with dependent operations.
Example:
["d478ae29-dedc-4869-82f0-96104425f565"]