Rate Limited — All Models Failed Cooldown

Fix OpenClaw rate limit and cooldown errors when every model fallback is temporarily unavailable because of provider limits, quota, billing, missing auth, or overload.

Quick Fix

Run these commands first:

openclaw status openclaw gateway status openclaw models status openclaw logs --follow

Use this only when you need a live provider check:

openclaw models status --probe

Do not run --probe again and again during a rate limit. It can make real provider checks, and repeated checks may add more pressure instead of helping. OpenClaw docs list openclaw models status, auth checks, and model config checks as useful ways to confirm model availability.

What This Error Means

OpenClaw can fail over in two main steps:

  • It can try another auth profile for the same provider.
  • It can try the next model in your configured fallback list.

If all available routes fail, OpenClaw shows an “all models failed” or cooldown error.

Common error messages include:
HTTP 429 Rate limit exceeded Too many requests RESOURCE_EXHAUSTED Quota exceeded All models failed No available auth profile Provider is in cooldown retry-after

The final “all models failed” message is usually only a summary. The first real error in the logs tells you the real cause.

Most Common Causes

1. Provider Rate Limit

This is the most common reason.

AI providers limit usage by requests, tokens, projects, organizations, and sometimes model-specific limits.

OpenAI rate limits can include requests per minute, requests per day, tokens per minute, tokens per day, and other model-specific limits.

Anthropic Claude API rate limits are measured by requests per minute, input tokens per minute, and output tokens per minute. If you exceed a limit, Anthropic returns a 429 error with a retry-after value.

Google Gemini rate limits are applied per project and can vary by model. Google also notes that requests per day reset at midnight Pacific time.

How to check
openclaw logs --follow openclaw models status openclaw gateway status --deep
Fix
  • Wait for the retry or cooldown time.
  • Reduce repeated requests.
  • Reduce long prompts.
  • Stop retry loops.
  • Check provider billing and quota.
  • Add a fallback from another provider or account.

2. All Fallbacks Use the Same Provider

Fallbacks will not help much if every fallback uses the same exhausted provider, account, or quota pool.

Weak fallback setup:
{ "agents": { "defaults": { "model": { "primary": "provider-a/main-model", "fallbacks": [ "provider-a/backup-model-1", "provider-a/backup-model-2" ] } } } }
Better fallback setup:
{ "agents": { "defaults": { "model": { "primary": "provider-a/main-model", "fallbacks": [ "provider-b/available-model", "provider-c/available-model" ] } } } }

Use real model names from your own OpenClaw setup. Check them with:

openclaw models status

A good fallback should be:

  • Already authenticated
  • Available in your provider account
  • Not blocked by the same quota pool
  • Strong enough for the task
  • Tested before production use

3. Your Session Is Locked to One Model

If you selected a model inside a chat session, OpenClaw may treat that as a strict choice.

That means fallback may not work the way you expect.

Check the selected model:
/model status
Start a fresh session:
/new
Or reset the session:
/reset

OpenClaw docs explain that explicit user session selections can be strict, while configured defaults and fallback policies can use configured fallbacks.

4. Long Chat or Huge Prompt

Long sessions can trigger rate limits faster because OpenClaw may send a lot of previous context.

This can happen with:

  • Long conversations
  • Large files
  • Big logs
  • Tool output
  • Images
  • PDFs
  • Repeated automation runs
Fix
  • Start a new session.
  • Split large tasks into smaller tasks.
  • Remove unnecessary logs.
  • Avoid uploading the same large file again.
  • Use a model that can handle the request size.
  • Use smaller models for simple background jobs.

5. Too Many Automated Requests

OpenClaw may hit cooldown faster if automations run too often.

Common examples:
  • Cron job runs every minute
  • Discord bot responds to every message
  • Webhook keeps retrying
  • Multiple channels trigger the same workflow
  • Several long tasks run at the same time
Fix
  • Increase cron intervals.
  • Stop retry loops.
  • Use mention-only rules in busy group chats.
  • Use cheaper models for background tasks.
  • Do not run too many long tasks at once.

6. Billing or Credits Are Exhausted

Some errors look like rate limits but are actually billing problems.

Look for:
insufficient credits billing limit monthly spend limit quota exceeded organization spending limit exceeded
Fix
  • Open the provider dashboard.
  • Check billing.
  • Add credits or payment method.
  • Raise spend limits if needed.
  • Use another provider/account with available quota.

OpenClaw cannot fix a provider account that has no credits or blocked billing. Annoying, yes. Magic, no.

7. Provider Is Temporarily Overloaded

Sometimes the provider is busy.

You may see:
overloaded server overloaded upstream error timeout model not ready internal server error
Fix
  • Wait a few minutes.
  • Try a smaller or faster model.
  • Use a fallback provider.
  • Check provider status.
  • Do not restart OpenClaw repeatedly unless the gateway itself is unhealthy.
Check gateway health:
openclaw gateway status openclaw logs --follow

Provider vs OpenClaw Problem

What You SeeLikely CauseWhat To Do
HTTP 429Provider rate limitWait, reduce usage, check provider dashboard
RESOURCE_EXHAUSTEDGoogle/Gemini quotaCheck Google project quota
No available auth profileAuth profile missing or cooling downRun openclaw models status
insufficient creditsBilling issueAdd credits or update billing
Gateway not runningOpenClaw gateway issueRun openclaw gateway status
Long chat fails onlyContext too largeStart new session
Fallback also failsBad fallback setupAdd tested fallback from another provider

Step-by-Step Fix

Step 1: Check OpenClaw Status

openclaw status openclaw gateway status

If the gateway is not running, fix that first.

Step 2: Read Logs

openclaw logs --follow

Look for the first provider error.

Do not only read the final “all models failed” line. That is usually just the summary.

Step 3: Check Model Status

openclaw models status

Check:

  • Primary model
  • Fallback models
  • Auth profile status
  • Missing auth
  • Cooldown state
  • Selected session model

Step 4: Check Provider Dashboard

Check the dashboard for the provider shown in the error:

  • OpenAI: usage, project limits, org limits
  • Anthropic: Claude Console limits and spend limits
  • Google Gemini: AI Studio limits and project quota

If the provider dashboard shows no available quota, OpenClaw cannot fix that locally.

Step 5: Add a Real Fallback

Use at least one fallback from another provider or account.

Example:
{ "agents": { "defaults": { "model": { "primary": "provider-a/main-model", "fallbacks": [ "provider-b/available-model", "provider-c/available-model" ] } } } }

Then verify:

openclaw models status

What Not To Do

Avoid these mistakes:

  • Do not keep sending the same prompt every few seconds.
  • Do not restart the gateway repeatedly if the provider is rate limited.
  • Do not use fallbacks that all share the same exhausted provider.
  • Do not ignore billing or credit errors.
  • Do not run --probe repeatedly during cooldown.
  • Do not assume “all models failed” means OpenClaw is broken.
  • Do not run high-frequency automations on expensive long-context models.

Prevention Checklist

Before running heavy OpenClaw workflows:

  • Run openclaw models status
  • Add fallback models from another provider/account
  • Keep provider billing active
  • Avoid very long sessions
  • Split large tasks
  • Reduce cron frequency
  • Stop retry loops
  • Use smaller models for background jobs
  • Monitor provider dashboards
  • Use managed hosting if uptime matters

Frequently Asked Questions

Why does OpenClaw say all models failed?
It means OpenClaw tried the available model routes, but none could complete the request.
Is this always an OpenClaw bug?
No. It is usually caused by provider rate limits, quota, billing, missing auth, or bad fallback setup.
How long should I wait?
Wait for the retry time or cooldown shown in the error. If billing or quota is exhausted, waiting may not help.
Why did fallback not work?
Fallback may fail if it uses the same provider, same quota pool, missing auth, or unavailable model.
Can restarting OpenClaw fix it?
Restarting can help if the gateway is stuck or config changed. It does not refill provider quota or fix billing.
How do I check the real cause?
Run: openclaw logs --follow, openclaw models status, and openclaw gateway status. Read the first provider error in the logs.
Does Ampere.sh prevent all rate limits?
No. Provider rate limits still apply. Ampere.sh helps with managed OpenClaw hosting, uptime, setup, and operations.

Also Read

OpenClaw Bot Not Responding? Fix API Key & Gateway Issues
Guide

OpenClaw Bot Not Responding? Fix API Key & Gateway Issues

·
How to Change Model in OpenClaw: Complete Guide for Beginners
Guide

How to Change Model in OpenClaw: Complete Guide for Beginners

10 min·May 25, 2026
How to Reduce OpenClaw API Cost Without Losing Workflow Quality
Guide

How to Reduce OpenClaw API Cost Without Losing Workflow Quality

·
Emma Thompson

Written by

Emma Thompson

AI Research Writer

Emma is an AI researcher and technical writer with a PhD in Machine Learning from Stanford. She specializes in large language model evaluation, comparing model capabilities, and explaining complex AI concepts. Her research has been published in NeurIPS and ICML. She makes cutting-edge AI research accessible through clear, practical guides.

Stop fighting model cooldowns

Managed OpenClaw hosting helps with uptime, setup, gateway health, and day-to-day operations. Provider rate limits still apply, but the OpenClaw side is easier to manage.

Start 7-Day Free Trial →