HTTP 429In some cases, those 429 errors are specifically tied to long‑context requests. If you're deep into a long conversation and your prompt context has grown very large, the upstream provider may impose tighter quotas or outright refuse the request.
The fallback label sg-claude-opus-4.7 via custom
This matters because if your primary and fallback are both pointing to the same upstream gateway, API pool, or backend, you'll see the provider switch in the UI but experience no real relief — both paths hit the same exhausted quota. While we don't have enough evidence to confirm they share a pool in your specific case, it's a very high probability worth checking directly in your configuration.
Hermes stores custom endpoints in config.yaml, and the fallback chain lives there too under the fallback_providers key.
Before you start pulling your hair out, check these items in order:
~/.hermes/config.yaml and look at the fallback_providers list.sg-claude-opus-4.7 via customopenclaw gateway probeThis message loops because Hermes is faithfully executing its per‑turn fallback design while your primary model remains rate‑limited. To stop seeing it, you need to fix the underlying cause — whether that's adjusting your provider quotas, reconfiguring your fallback to point to a truly independent provider, or reducing your prompt context size.
If you want a precise diagnosis, you can read your active config.yaml file and trace exactly which primary is failing, where the sg-* fallback actually routes to, and why it insists on retrying the primary on every turn.
Comments
0 comments