Skip to content

DAOS-623 test: add allowed error for FI#17959

Draft
mchaarawi wants to merge 1 commit intomasterfrom
mschaara/fi_allow_err
Draft

DAOS-623 test: add allowed error for FI#17959
mchaarawi wants to merge 1 commit intomasterfrom
mschaara/fi_allow_err

Conversation

@mchaarawi
Copy link
Copy Markdown
Contributor

Skip-func-hw-test: true
Skip-unit-test: true
Skip-unit-test-memcheck: true

Steps for the author:

  • Commit message follows the guidelines.
  • Appropriate Features or Test-tag pragmas were used.
  • Appropriate Functional Test Stages were run.
  • At least two positive code reviews including at least one code owner from each category referenced in the PR.
  • Testing is complete. If necessary, forced-landing label added and a reason added in a comment.

After all prior steps are complete:

  • Gatekeeper requested (daos-gatekeeper added as a reviewer).

@github-actions
Copy link
Copy Markdown

github-actions bot commented Apr 9, 2026

Ticket title is 'Generic ticket for minor code cleanup and improvement'
Status is 'Resolved'
Labels: 'request_for_2.6.5,request_for_2.6.6,request_for_2.8,scrubbed_2.6.5'
Job should run at elevated priority (1)
https://daosio.atlassian.net/browse/DAOS-623

@github-actions github-actions bot added the priority Ticket has high priority (automatically managed) label Apr 9, 2026
@mchaarawi mchaarawi force-pushed the mschaara/fi_allow_err branch 3 times, most recently from 15e151e to 8b2915c Compare April 9, 2026 23:37
@daosbuild3
Copy link
Copy Markdown
Collaborator

@mchaarawi mchaarawi force-pushed the mschaara/fi_allow_err branch 2 times, most recently from 30da5ea to c242234 Compare April 10, 2026 00:09
Skip-func-hw-test: true
Skip-unit-test: true
Skip-unit-test-memcheck: true

Signed-off-by: Mohamad Chaarawi <mohamad.chaarawi@hpe.com>
@mchaarawi mchaarawi force-pushed the mschaara/fi_allow_err branch from c242234 to e436ce2 Compare April 10, 2026 04:03
if 'Sluggish EC boundary report from rank' in log_msg:
return False

if 'The progress callback was not called for too long' in log_msg:
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Would it be better to increase the default value of the swim_prot_period_len parameter for NLT/FI tests using the SWIM_PROTOCOL_PERIOD_LEN environment variable?

Comment on lines +5030 to +5031
if 'Sluggish EC boundary report from rank' in log_msg:
return False
Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Wouldn't it be better to implement a solution similar to SWIM (see the next comment), which allows us to configure the timeout (600) introduced by the #17309 PR using an environment variable?

cur_ts > eph_ldr->cte_server_ephs[i].re_ec_agg_eph_update_ts + 600)

			if (pool->sp_reclaim != DAOS_RECLAIM_DISABLED &&
			    cur_ts > eph_ldr->cte_server_ephs[i].re_ec_agg_eph_update_ts + 600)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

priority Ticket has high priority (automatically managed)

Development

Successfully merging this pull request may close these issues.

3 participants