Defaults
| Window | Default |
|---|---|
| Per minute | 60 requests |
| Per hour | 1,000 requests |
Response headers
Every response (success or failure) carries the rate-limit headers for both windows:Limit-*is the cap for that window.Remaining-*is how many requests the org has left.Reset-*is the Unix timestamp (seconds) at which the window rolls over.
Remaining-* proactively to slow down before you hit the cap, rather than reacting to 429s after the fact. Because the bucket is shared, the value reflects traffic from every credential in the org.
Hitting a cap
When you exceed either window, you get429 rate_limited. The response includes a standard Retry-After header (in seconds) alongside the per-window rate-limit headers:
Retry-After to know exactly how long to wait. The remaining count for each window tells you which limit you hit — a burst trips the minute window (Remaining-Minute: 0) while sustained over-quota drains the hour window.
Backoff strategy
Honor Retry-After
On a
429, sleep for at least the number of seconds in the Retry-After header before retrying. Do not retry sooner — you will just consume your quota faster as it refills.Exponential backoff on repeats
If a second
429 follows the first, double the next sleep. If a third follows, double again. Cap the backoff at 60 seconds.Burst vs sustained
The minute window absorbs short bursts. If your average is well under 1,000/hour but you batch 60 requests in five seconds, you’ll briefly hit the minute cap. SpreadPOSTs across the minute, or use the Idempotency header so retries are safe.
The hour window protects against sustained over-quota. If you’re hitting it regularly, the integration’s request profile is wrong for the current limits — either reduce the rate or request a raise.
One bucket per org
All of an organization’s credentials share the same counters. If one process saturates the minute cap, every other credential in the org — including OAuth tokens — sees429s until the window resets. Minting additional keys will not give you more throughput; it only changes how traffic is attributed, not how much is allowed. To scale, either smooth your request rate or ask support to raise the org’s limits.
