POST
/
cloud
/
v3
/
inference
/
applications
/
{project_id}
/
deployments
Python
from gcore import Gcore

client = Gcore(
    api_key="My API Key",
)
task_id_list = client.cloud.inference.applications.deployments.create(
    project_id=1,
    application_name="demo-app",
    components_configuration={
        "model": {
            "exposed": True,
            "flavor": "inference-16vcpu-232gib-1xh100-80gb",
            "scale": {
                "max": 1,
                "min": 1,
            },
        }
    },
    name="name",
    regions=[1, 2],
)
print(task_id_list.tasks)
{
  "tasks": [
    "d478ae29-dedc-4869-82f0-96104425f565"
  ]
}

Authorizations

Authorization
string
header
required

API key for authentication. Make sure to include the word apikey, followed by a single space and then your token. Example: apikey 1234$abcdef

Path Parameters

project_id
integer
required

Project ID

Examples:

1

Body

application/json
application_name
string
required

Identifier of the application from the catalog

Examples:

"demo-app"

components_configuration
object
required

Mapping of component names to their configuration (e.g., "model": {...})

Examples:
{
"model": {
"exposed": true,
"flavor": "inference-16vcpu-232gib-1xh100-80gb",
"scale": { "max": 1, "min": 1 }
}
}
name
string
required

Desired name for the new deployment

Maximum length: 15
regions
integer[]
required

Geographical regions where the deployment should be created

Examples:
[1, 2]
api_keys
string[]

List of API keys for the application

Examples:
["key1", "key2"]

Response

200 - application/json

OK

tasks
string[]
required

List of task IDs representing asynchronous operations. Use these IDs to monitor operation progress: * GET /v1/tasks/{task_id} - Check individual task status and details Poll task status until completion (FINISHED/ERROR) before proceeding with dependent operations.

Examples:
["d478ae29-dedc-4869-82f0-96104425f565"]