fix: make LLM review resilient to individual question failures by academo · Pull Request #550 · grafana/plugin-validator

academo · 2026-03-26T10:45:03Z

The LLM review was silently dropping all results when any single question failed. Two issues:

AskLLMAboutCode had a retry loop wrapping CallLLM, which itself already retries internally via callLLMWithRetry so a failing question would fire up to 9 API calls before giving up, then discard all previously successful answers
The 20s per-call timeout was too short for large plugin codebases, causing frequent context deadline exceeded errors on Anthropic

Now failed questions are skipped and processing continues, the redundant outer retry is removed, and the timeout is bumped to 90s. Also adds an OK report when LLM review completes clean.

- Increase LLM call timeout from 20s to 90s to handle large code contexts - Remove redundant outer retry loop in AskLLMAboutCode (inner callLLMWithRetry already handles retries) - Skip failed questions instead of aborting the entire review - Report OK status when LLM review completes without concerns

academo · 2026-03-26T10:47:27Z

pkg/llmvalidate/llmvalidate.go

-			}
-			lastErr = nil
-			break
+		agenticAnswers, err := agenticClient.CallLLM(c.ctx, []string{userPrompt}, absCodePath)


the main change here is removing the for retries := 3; retries > 0; retries-- loop

this was retrying 3 times but CallLLM already has retrying handling internally (3 times)

xnyo

LGTM!

github-project-automation bot added this to Grafana Catalog Team Mar 26, 2026

github-project-automation bot moved this to 📬 Triage in Grafana Catalog Team Mar 26, 2026

academo self-assigned this Mar 26, 2026

academo moved this from 📬 Triage to 🔬 In review in Grafana Catalog Team Mar 26, 2026

academo marked this pull request as ready for review March 26, 2026 10:46

academo requested review from a team as code owners March 26, 2026 10:46

academo requested review from Ukochka, oshirohugo, s4kh and xnyo March 26, 2026 10:46

academo commented Mar 26, 2026

View reviewed changes

tolzhabayev approved these changes Mar 26, 2026

View reviewed changes

xnyo approved these changes Mar 26, 2026

View reviewed changes

academo merged commit c653140 into main Mar 26, 2026
11 checks passed

academo deleted the academo/fix-llm-error-return branch March 26, 2026 11:14

github-project-automation bot moved this from 🔬 In review to 🚀 Shipped in Grafana Catalog Team Mar 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: make LLM review resilient to individual question failures#550

fix: make LLM review resilient to individual question failures#550
academo merged 1 commit intomainfrom
academo/fix-llm-error-return

academo commented Mar 26, 2026 •

edited

Loading

Uh oh!

academo Mar 26, 2026

Uh oh!

xnyo left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

academo commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

academo Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

xnyo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

academo commented Mar 26, 2026 •

edited

Loading