Fix Flaky TestScripts by kalverra · Pull Request #22838 · smartcontractkit/chainlink

kalverra · 2026-06-12T22:35:20Z

TestScripts is one of the most flaky areas in /chainlink. I ran the fix-flaky-tests skill on it with opus 4.8. It cost ~$2.00 and took ~30 minutes (excluding time to run the tests over and over to verify fixes).

Changes

Potentially Breaking!!: StartUpHealthReport now returns 503 errors for non /health paths instead of the old 404 error. This is more in line with what most clients expect for this sort of situation. curl --retry is used in TestScripts to keep calling these endpoints until they're ready, but it will not retry if it gets a 404.
Added tests for tools/txtar/visitor.go and updated it with some more modern Go patterns, should help performance a teeny-tiny bit.

Results

# Before
❯ make test ARGS="diagnose --iterations 100 -- -run TestScripts ."
   Estimated flake rate = 1.6% - 4.2%

# After
❯ make test ARGS="diagnose --iterations 600 -- -run TestScripts ."
   Estimated flake rate = 0% - 0.6%

…testScriptsStable

github-actions · 2026-06-12T22:35:38Z

👋 kalverra, thanks for creating this pull request!

To help reviewers, please consider creating future PRs as drafts first. This allows you to self-review and make any final changes before notifying the team.

Once you're ready, you can mark it as "Ready for review" to request feedback. Thanks!

github-actions · 2026-06-12T22:36:43Z

I see you updated files related to core. Please run make gocs in the root directory to add a changeset as well as in the text include at least one of the following tags:

#added For any new functionality added.
#breaking_change For any functionality that requires manual action for the node to boot.
#bugfix For bug fixes.
#changed For any change to the existing functionality.
#db_update For any feature that introduces updates to database schema.
#deprecation_notice For any upcoming deprecation functionality.
#internal For changesets that need to be excluded from the final changelog.
#nops For any feature that is NOP facing and needs to be in the official Release Notes for the release.
#removed For any functionality/config that is removed.
#updated For any functionality that is updated.
#wip For any change that is not ready yet and external communication about it should be held off till it is feature complete.

github-actions · 2026-06-12T22:37:10Z

✅ No conflicts with other open PRs targeting develop

cl-sonarqube-production · 2026-06-12T22:51:33Z

Quality Gate failed

Failed conditions
C Security Rating on New Code (required ≥ A)

See analysis details on SonarQube

Catch issues before they fail your Quality Gate with our IDE extension SonarQube IDE

jmank88 · 2026-06-12T22:55:33Z

-		// If we're not recursing, skip all other directories except the root.
-		if !bool(d.recurse) && !isRootDir {
-			return nil
+		if !bool(d.recurse) && filepath.Clean(path) != root {


Is it safe to use this less strict filepath.Clean comparison instead of os.SameFile?

jmank88 · 2026-06-12T22:57:18Z

+	"github.com/stretchr/testify/require"
+)
+
+func writeTreeFile(t *testing.T, path string) {


Would it make sense to use embedded testdata/ dir files rather than creating them along the way?

trunk-io · 2026-06-12T22:57:51Z

Failed Test	Failure Summary	Logs
`TestCCIPReader_MsgsBetweenSeqNums`	The test failed because it could not find the expected message or log within the specified sequence number range.	Logs ↗︎

_{View Full Report ↗︎ ⋅ Docs}

jmank88 · 2026-06-12T22:59:43Z

-	t.Parallel()

-	require.NoError(t, os.Setenv("TMPDIR", "/tmp")) // osx default is too long for go-plugin sockets
+	require.NoError(t, os.Setenv("GOTMPDIR", "/tmp")) // keep workspaces in /tmp


Should we use the t.TempDir()?

jmank88 · 2026-06-12T23:00:56Z

-				ContinueOnError: true,
+				Files:               filesToRun,
+				Setup:               commonEnv(t),
+				ContinueOnError:     false,


Why not? If there are multiple failing tests, we would rather know about them all at once.

kalverra added 5 commits June 12, 2026 14:57

tool version

f2c3019

Merge

97ee99e

Modern txtar

917471e

modernize TestScripts runner

a52d3c4

Fix flaky TestScripts

6bec747

kalverra requested review from a team as code owners June 12, 2026 22:35

Merge branch 'develop' of github.com:smartcontractkit/chainlink into …

5f095db

…testScriptsStable

product-security-plaid-production Bot requested review from jmank88 and patrickhuie19 June 12, 2026 22:35

/health test

6b4638f

jmank88 reviewed Jun 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Flaky TestScripts#22838

Fix Flaky TestScripts#22838
kalverra wants to merge 7 commits into
developfrom
testScriptsStable

kalverra commented Jun 12, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Jun 12, 2026

Uh oh!

github-actions Bot commented Jun 12, 2026

Uh oh!

github-actions Bot commented Jun 12, 2026 •

edited

Loading

Uh oh!

cl-sonarqube-production Bot commented Jun 12, 2026

Uh oh!

jmank88 Jun 12, 2026

Uh oh!

jmank88 Jun 12, 2026

Uh oh!

trunk-io Bot commented Jun 12, 2026 •

edited

Loading

Uh oh!

jmank88 Jun 12, 2026

Uh oh!

jmank88 Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kalverra commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Results

Uh oh!

github-actions Bot commented Jun 12, 2026

Uh oh!

github-actions Bot commented Jun 12, 2026

Uh oh!

github-actions Bot commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cl-sonarqube-production Bot commented Jun 12, 2026

Quality Gate failed

Uh oh!

jmank88 Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

jmank88 Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

trunk-io Bot commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jmank88 Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

jmank88 Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kalverra commented Jun 12, 2026 •

edited

Loading

github-actions Bot commented Jun 12, 2026 •

edited

Loading

trunk-io Bot commented Jun 12, 2026 •

edited

Loading