Skip to content

fix: Handle INSUFFICIENT_MODEL_CAPACITY ThrottlingException as ModelOverloadedError#2503

Merged
evanliu048 merged 2 commits intomainfrom
chat_throttled_error
Aug 6, 2025
Merged

fix: Handle INSUFFICIENT_MODEL_CAPACITY ThrottlingException as ModelOverloadedError#2503
evanliu048 merged 2 commits intomainfrom
chat_throttled_error

Conversation

@evanliu048
Copy link
Copy Markdown
Contributor

Issue #, if available:

Description of changes:
The service team has updated the response field for ThrottlingException.
Update the CLI accordingly to handle the new format.
Since both throttling and quota breaches return a 429 status code, we should check for model_unavailable first.
Before:
image

After:
after

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@evanliu048 evanliu048 merged commit 7660ebe into main Aug 6, 2025
15 checks passed
@evanliu048 evanliu048 deleted the chat_throttled_error branch August 6, 2025 00:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants