Rate Limited — All Models Failed Cooldown
Fix OpenClaw rate limit and cooldown errors when every model fallback is temporarily unavailable because of provider limits, quota, billing, missing auth, or overload.
Quick Fix
Run these commands first:
openclaw status
openclaw gateway status
openclaw models status
openclaw logs --followUse this only when you need a live provider check:
openclaw models status --probeDo not run --probe again and again during a rate limit. It can make real provider checks, and repeated checks may add more pressure instead of helping. OpenClaw docs list openclaw models status, auth checks, and model config checks as useful ways to confirm model availability.
What This Error Means
OpenClaw can fail over in two main steps:
- It can try another auth profile for the same provider.
- It can try the next model in your configured fallback list.
If all available routes fail, OpenClaw shows an “all models failed” or cooldown error.
HTTP 429
Rate limit exceeded
Too many requests
RESOURCE_EXHAUSTED
Quota exceeded
All models failed
No available auth profile
Provider is in cooldown
retry-afterThe final “all models failed” message is usually only a summary. The first real error in the logs tells you the real cause.
Most Common Causes
1. Provider Rate Limit
This is the most common reason.
AI providers limit usage by requests, tokens, projects, organizations, and sometimes model-specific limits.
OpenAI rate limits can include requests per minute, requests per day, tokens per minute, tokens per day, and other model-specific limits.
Anthropic Claude API rate limits are measured by requests per minute, input tokens per minute, and output tokens per minute. If you exceed a limit, Anthropic returns a 429 error with a retry-after value.
Google Gemini rate limits are applied per project and can vary by model. Google also notes that requests per day reset at midnight Pacific time.
openclaw logs --follow
openclaw models status
openclaw gateway status --deep- Wait for the retry or cooldown time.
- Reduce repeated requests.
- Reduce long prompts.
- Stop retry loops.
- Check provider billing and quota.
- Add a fallback from another provider or account.
2. All Fallbacks Use the Same Provider
Fallbacks will not help much if every fallback uses the same exhausted provider, account, or quota pool.
{
"agents": {
"defaults": {
"model": {
"primary": "provider-a/main-model",
"fallbacks": [
"provider-a/backup-model-1",
"provider-a/backup-model-2"
]
}
}
}
}{
"agents": {
"defaults": {
"model": {
"primary": "provider-a/main-model",
"fallbacks": [
"provider-b/available-model",
"provider-c/available-model"
]
}
}
}
}Use real model names from your own OpenClaw setup. Check them with:
openclaw models statusA good fallback should be:
- Already authenticated
- Available in your provider account
- Not blocked by the same quota pool
- Strong enough for the task
- Tested before production use
3. Your Session Is Locked to One Model
If you selected a model inside a chat session, OpenClaw may treat that as a strict choice.
That means fallback may not work the way you expect.
/model status/new/resetOpenClaw docs explain that explicit user session selections can be strict, while configured defaults and fallback policies can use configured fallbacks.
4. Long Chat or Huge Prompt
Long sessions can trigger rate limits faster because OpenClaw may send a lot of previous context.
This can happen with:
- Long conversations
- Large files
- Big logs
- Tool output
- Images
- PDFs
- Repeated automation runs
- Start a new session.
- Split large tasks into smaller tasks.
- Remove unnecessary logs.
- Avoid uploading the same large file again.
- Use a model that can handle the request size.
- Use smaller models for simple background jobs.
5. Too Many Automated Requests
OpenClaw may hit cooldown faster if automations run too often.
- Cron job runs every minute
- Discord bot responds to every message
- Webhook keeps retrying
- Multiple channels trigger the same workflow
- Several long tasks run at the same time
- Increase cron intervals.
- Stop retry loops.
- Use mention-only rules in busy group chats.
- Use cheaper models for background tasks.
- Do not run too many long tasks at once.
6. Billing or Credits Are Exhausted
Some errors look like rate limits but are actually billing problems.
insufficient credits
billing limit
monthly spend limit
quota exceeded
organization spending limit exceeded- Open the provider dashboard.
- Check billing.
- Add credits or payment method.
- Raise spend limits if needed.
- Use another provider/account with available quota.
OpenClaw cannot fix a provider account that has no credits or blocked billing. Annoying, yes. Magic, no.
7. Provider Is Temporarily Overloaded
Sometimes the provider is busy.
overloaded
server overloaded
upstream error
timeout
model not ready
internal server error- Wait a few minutes.
- Try a smaller or faster model.
- Use a fallback provider.
- Check provider status.
- Do not restart OpenClaw repeatedly unless the gateway itself is unhealthy.
openclaw gateway status
openclaw logs --followProvider vs OpenClaw Problem
| What You See | Likely Cause | What To Do |
|---|---|---|
| HTTP 429 | Provider rate limit | Wait, reduce usage, check provider dashboard |
| RESOURCE_EXHAUSTED | Google/Gemini quota | Check Google project quota |
| No available auth profile | Auth profile missing or cooling down | Run openclaw models status |
| insufficient credits | Billing issue | Add credits or update billing |
| Gateway not running | OpenClaw gateway issue | Run openclaw gateway status |
| Long chat fails only | Context too large | Start new session |
| Fallback also fails | Bad fallback setup | Add tested fallback from another provider |
Step-by-Step Fix
Step 1: Check OpenClaw Status
openclaw status
openclaw gateway statusIf the gateway is not running, fix that first.
Step 2: Read Logs
openclaw logs --followLook for the first provider error.
Do not only read the final “all models failed” line. That is usually just the summary.
Step 3: Check Model Status
openclaw models statusCheck:
- Primary model
- Fallback models
- Auth profile status
- Missing auth
- Cooldown state
- Selected session model
Step 4: Check Provider Dashboard
Check the dashboard for the provider shown in the error:
- OpenAI: usage, project limits, org limits
- Anthropic: Claude Console limits and spend limits
- Google Gemini: AI Studio limits and project quota
If the provider dashboard shows no available quota, OpenClaw cannot fix that locally.
Step 5: Add a Real Fallback
Use at least one fallback from another provider or account.
{
"agents": {
"defaults": {
"model": {
"primary": "provider-a/main-model",
"fallbacks": [
"provider-b/available-model",
"provider-c/available-model"
]
}
}
}
}Then verify:
openclaw models statusWhat Not To Do
Avoid these mistakes:
- Do not keep sending the same prompt every few seconds.
- Do not restart the gateway repeatedly if the provider is rate limited.
- Do not use fallbacks that all share the same exhausted provider.
- Do not ignore billing or credit errors.
- Do not run
--proberepeatedly during cooldown. - Do not assume “all models failed” means OpenClaw is broken.
- Do not run high-frequency automations on expensive long-context models.
Prevention Checklist
Before running heavy OpenClaw workflows:
- Run
openclaw models status - Add fallback models from another provider/account
- Keep provider billing active
- Avoid very long sessions
- Split large tasks
- Reduce cron frequency
- Stop retry loops
- Use smaller models for background jobs
- Monitor provider dashboards
- Use managed hosting if uptime matters
Frequently Asked Questions
Why does OpenClaw say all models failed?
Is this always an OpenClaw bug?
How long should I wait?
Why did fallback not work?
Can restarting OpenClaw fix it?
How do I check the real cause?
Does Ampere.sh prevent all rate limits?
Also Read
Stop fighting model cooldowns
Managed OpenClaw hosting helps with uptime, setup, gateway health, and day-to-day operations. Provider rate limits still apply, but the OpenClaw side is easier to manage.
Start 7-Day Free Trial →

