Operations

Rate Limits & Quotas

Rate limits protect the platform and make usage predictable. Pair exponential backoff with the usage endpoints so you can slow down before you hit hard limits.

Developer
100 requests / minute

Designed for early production traffic, internal tooling, and integration partners.

Enterprise
500 requests / minute

For multi-tenant products, agency workloads, and heavier scheduled processing.

Usage
Quota visibility

Read /api/v1/usage/stats and /api/v1/usage/quota to monitor burn rate inside your app.

Backoff guidance

Use jittered exponential backoff on 429 responses and avoid parallel retry storms from multiple workers sharing one key.

Plan for graceful degradation if you are close to plan quota late in the billing period.

Quota exceeded

A quota error means the request is valid but your current billing period has exhausted the included units for that workload. Check the dashboard, review overage expectations, or upgrade the plan before resuming high-volume jobs.