Inference Instances
Stop inference deployment
This operation shuts down an inference deployment, making it unavailable for handling requests. The deployment will scale down to 0 replicas, overriding any minimum replica settings.
- Once stopped, the deployment will not process any inference requests or SQS messages.
- It will not restart automatically and must be started manually.
- While stopped, the deployment will not incur any charges.
POST
Authorizations
API key for authentication.
Path Parameters
Project ID
Examples:
1
Inference instance name.
Required string length:
4 - 30
Examples:
"my-instance"
Response
204
No Content