Rate Limits & Quotas
Rate limits protect the platform and make usage predictable. Pair exponential backoff with the usage endpoints so you can slow down before you hit hard limits.
Designed for early production traffic, internal tooling, and integration partners.
For multi-tenant products, agency workloads, and heavier scheduled processing.
Read /api/v1/usage/stats and /api/v1/usage/quota to monitor burn rate inside your app.
Backoff guidance
Use jittered exponential backoff on 429 responses and avoid parallel retry storms from multiple workers sharing one key.
Plan for graceful degradation if you are close to plan quota late in the billing period.
Quota exceeded
A quota error means the request is valid but your current billing period has exhausted the included units for that workload. Check the dashboard, review overage expectations, or upgrade the plan before resuming high-volume jobs.