You can only deploy AI models if they fit within your quota. Since a new Gcore account has a zero quota, you must request an increase before deploying models.
To create a new quota request, click this direct link or take the following steps:
Navigate to Everywhere Inference > Quotas in the Gcore Customer Portal. This will open the Account Quotas dialog, where you can view and modify your Everywhere Inference quotas.
The Account Quotas dialog shows an overview of the currently configured quotas, which you can use to update for new requests.
Let’s look at the following settings for a model deployment:
The deployment uses:
The deployment will run two pods, one in each region, so you should request at least the following quota:
If you configure autoscaling to go up to two pods per region, you should request this quota:
The Request form is on the right. Fill it out with a description explaining why you need the increase, then click the Send request button.
It can take up to 15 minutes until your quotas are updated. After the update is applied, you can deploy AI models or update your autoscaling settings for existing ones.
Was this article helpful?