Docs / Rate Limits & Quota
Rate Limits & Quota
Plan quota is counted by request count, over a 5-hour window and a weekly window; token usage has no separate cap.
5-hour window
Rolling window, auto-resets. Over the limit = temporary throttle until reset; no charge.
Weekly window
7-day fixed window, same behavior.
Concurrency
Each plan has a concurrency cap; agent clients (Claude Code / Cline) are concurrency-heavy and may hit 429.
See exact numbers on the pricing page. To see usage live in Claude Code: Usage in Claude Code.
What a 429 means
- concurrency_limit: concurrency over your plan cap — slow down / reduce concurrency, or upgrade / enable API credit billing.
- window/quota: 5h or weekly window exhausted — wait for reset or upgrade.
- upstream_busy: transient upstream load — usually clears shortly.
Need help? Email [email protected] · Home