Skip to main content
Gclaw uses the DeepSeek V4 Flash model for inference, running on Gcore H200 GPUs.

Model limits

LimitValue
Context window200,000 tokens
Maximum output32,000 tokens
ReasoningEnabled

Rate limits

Rate limits are managed at the Gcore infrastructure level and depend on the account tier. Specific values are not published and may vary. When a rate limit is exceeded, the API returns an error response — reduce request frequency and retry. If errors persist, contact Gcore support.

Instance limits

During the beta period, each account is limited to one Gclaw instance.