Implement a retry mechanism for Google GenAI calls #15783

xpomul · 2025-06-06T09:33:47Z

What it does

Google imposes rate limits for its LLMs. Especially with lower tiers (including the free tier - see https://ai.google.dev/gemini-api/docs/rate-limits?hl=de ), it can quickly happen that an agent with tool calls (such as the Coder agent) hits the Requests Per Minute (RPM) rate limit and the agent execution terminates with an error.
Also, in longer conversations, it can happen that the LLM does not return proper JSON which leads to an error in the GenAI API client, or reports 500 Internal Server Errors occasionally.

To make the Google LanguageModel implementation in Theia more robust against these errors, a retry mechanism is now implemented that can resend the last request after a configurable delay in case of an error.

The retry mechanism can be configured using preferences:

maxRetriesOnErrors configures the maximum number of retries per request after which to give up. Defaults to 3. If smaller than 1, then the retry logic is disabled.
retryDelayOnRateLimitError configures the delay in seconds to wait in case of a rate limit error. Defaults to 60 (1 minute). If negative, then no retry is attempted and the error is propagated.
retryDelayOnOtherErrors configures the delay in seconds to wait in case of any other error. Defaults to -1 (disabled). If negative, then no retry is attempted and the error is propagated.

How to test

Get a Google GenAI API key for the free tier.
Set up Theia with that key and configure the Coder agent to use google/gemini-2.0-flash
Give a complex instruction to the Coder agent (e.g. with the Theia codebase loaded, ask it something like

@Coder Change the Breakpoint widget so that multi-selection of breakpoints is possible.

This means that the TreeWidget of the Breakpoint view should be changed to support multiSelect. If multiple breakpoints are selected, then in the context menu, there should be menu items:

* Enable all selected
* Disable all selected
* Delete all selected

which should behave like the counterpart commands for single breakpoints just applied to multiple ones.

Without the change, you should see an message 429 Rate Limit Exceeded after 10 tool calls.
With the change, the conversation will stall for some time in the middle, but eventually continue after 60s of waiting.

Follow-ups

Breaking changes

This PR introduces breaking changes and requires careful review. If yes, the breaking changes section in the changelog has been updated.

Attribution

Review checklist

As an author, I have thoroughly tested my changes and carefully followed the review guidelines

Reminder for reviewers

As a reviewer, I agree to behave in accordance with the review guidelines

In case of errors (either due to rate limit exceed or other unexpected errors) the Google LanguageModel now resends the last request after a configurable delay. Signed-off-by: Stefan Winkler <stefan@winklerweb.net>

eneufeld

This looks good overall.
I suggest to move the retrysettings to the frontend and pass them down from there as we do with other similar settings. (see anthropic package or openai package)

packages/ai-google/src/node/google-language-model.ts

Signed-off-by: Stefan Winkler <stefan@winklerweb.net>

eneufeld · 2025-06-15T21:49:44Z

@sdirix any opinion on this? I think the change is fine to be merged.

sdirix · 2025-06-16T11:02:39Z

Conceptually this looks good to me, however I did not test it or review it in detail

Implement a retry mechanism for Google GenAI calls

70f68d0

In case of errors (either due to rate limit exceed or other unexpected errors) the Google LanguageModel now resends the last request after a configurable delay. Signed-off-by: Stefan Winkler <stefan@winklerweb.net>

github-project-automation bot added this to PR Backlog Jun 6, 2025

github-project-automation bot moved this to Waiting on reviewers in PR Backlog Jun 6, 2025

eneufeld reviewed Jun 10, 2025

View reviewed changes

packages/ai-google/src/node/google-language-model.ts Outdated Show resolved Hide resolved

Changed retry settings to preferences

e07f3f9

Signed-off-by: Stefan Winkler <stefan@winklerweb.net>

xpomul requested a review from eneufeld June 10, 2025 19:24

Merge branch 'master' into google-ai-retry

57ee99e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Implement a retry mechanism for Google GenAI calls #15783

Implement a retry mechanism for Google GenAI calls #15783

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Implement a retry mechanism for Google GenAI calls #15783

Are you sure you want to change the base?

Implement a retry mechanism for Google GenAI calls #15783

Uh oh!

Conversation

Uh oh!

What it does

How to test

Follow-ups

Breaking changes

Attribution

Review checklist

Reminder for reviewers

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!