Skip to content

Conversation

@davidradl
Copy link
Contributor

@davidradl davidradl commented Jan 16, 2026

We were getting an intermittent errors in our CI pipeline,

  1. ExecutionTimeBasedSlowTaskDetectorTest.testBalancedInput : Expected size: 2 but was: 0
  2. ExecutionGraphCoLocationRestartTest.testConstraintsAfterRestart : do not have the symptoms
  3. ExecutionGraphRestartTest.testCancelWhileFailing expected: RUNNING FAILING
  4. ExecutionTimeBasedSlowTaskDetectorTest.testNoFinishedTaskButRatioIsZero: Expected size: 3 but was: 0

The original change was to introduce sleeps in line with the approach the existing test take in this area. This was rightly rejected in the review. I have changed the approach to add job listeners and futures so that the tests continue when they hit the expected state, without using sleeps.

…wTaskDetectorTest

Signed-off-by: davidradl <david_radley@uk.ibm.com>
@flinkbot
Copy link
Collaborator

flinkbot commented Jan 16, 2026

CI report:

Bot commands The @flinkbot bot supports the following commands:
  • @flinkbot run azure re-run the last Azure build

@davidradl davidradl changed the title [hotfix] Add sleep to stop intermittent failure in ExecutionTimeBasedSlowTaskDetectorTest [hotfix] Add 3 sleeps to address intermittent unit test failures we have hit in ExecutionGraph tests Jan 19, 2026
@davidradl davidradl marked this pull request as draft January 19, 2026 14:40
@davidradl davidradl force-pushed the hotfixsleep branch 3 times, most recently from e238c2f to 2613629 Compare January 20, 2026 10:30
@davidradl davidradl changed the title [hotfix] Add 3 sleeps to address intermittent unit test failures we have hit in ExecutionGraph tests [hotfix]Intermittent unit test failures we have hit in ExecutionGraph tests Jan 21, 2026
Signed-off-by: david_radley@uk.ibm.com <david_radley@uk.ibm.com>
Signed-off-by: david_radley@uk.ibm.com <david_radley@uk.ibm.com>
Signed-off-by: david_radley@uk.ibm.com <david_radley@uk.ibm.com>
Signed-off-by: david_radley@uk.ibm.com <david_radley@uk.ibm.com>
@davidradl davidradl marked this pull request as ready for review January 30, 2026 15:13
@davidradl davidradl changed the title [hotfix]Intermittent unit test failures we have hit in ExecutionGraph tests [hotfix] Intermittent unit test failures we have hit in ExecutionGraph tests Jan 30, 2026
@davidradl davidradl requested a review from snuyanzin January 30, 2026 15:14
@davidradl
Copy link
Contributor Author

@snuyanzin Please could you look at this new approach that avoid using sleeps please.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants