[BUG-447] Idempotency bug: OOO loop when response is lost but subsequent batches succeed by fresh-borzoni · Pull Request #448 · apache/fluss-rust

fresh-borzoni · 2026-03-20T15:44:10Z

Summary

closes #447

Fix idempotent writer infinite retry on OutOfOrderSequenceException when response is lost

When a batch response is lost (e.g. timeout) but the server committed it, and subsequent higher-sequence batches are acked, retrying the original batch causes the server to return OutOfOrderSequenceException. The client's !is_next retry heuristic incorrectly treats this as retriable, looping until retries exhaust and then failing a successfully committed batch.

Kafka deals with it at the protocol level with epoch bumping, we don't need here, so we fix at the client level.

Fix: before checking can_retry, if the batch sequence <= last_acked_sequence, complete it as success - a higher-sequence ack guarantees the batch was committed (server enforces sequential ordering).

Backport of apache/fluss#2827

…bsequent batches succeed

fresh-borzoni · 2026-03-20T15:52:19Z

@luoyuxia @leekeiabstraction @charlesdong1991 PTAL 🙏

leekeiabstraction

Nice catch and thank you for the PR!

I think this will be challenging to test (manually), is there a way to reproduce this to ensure that the change works? Curious about how this was detected in the first place.

~~Does Java side need the same change given that we mostly base on Java implementation?~~

Nvm: just saw Java side's Pr. TY.

crates/fluss/src/client/write/sender.rs

fresh-borzoni · 2026-03-20T21:37:14Z

@leekeiabstraction Ty for the review, there is unit test that checks this scenario, also updated comment.
PTAL 🙏

luoyuxia

+1

[BUG-447] Idempotency bug: OOO exception when response is lost but su…

36f4ebf

…bsequent batches succeed

fresh-borzoni changed the title ~~[BUG-447] Idempotency bug: OOO exception when response is lost but su…~~ [BUG-447] Idempotency bug: OOO loop when response is lost but subsequent batches succeed Mar 20, 2026

leekeiabstraction reviewed Mar 20, 2026

View reviewed changes

crates/fluss/src/client/write/sender.rs Outdated Show resolved Hide resolved

change comment

2c7cdec

luoyuxia approved these changes Mar 21, 2026

View reviewed changes

luoyuxia merged commit e26702e into apache:main Mar 21, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG-447] Idempotency bug: OOO loop when response is lost but subsequent batches succeed#448

[BUG-447] Idempotency bug: OOO loop when response is lost but subsequent batches succeed#448
luoyuxia merged 2 commits intoapache:mainfrom
fresh-borzoni:fix-ooo-sequence-lost-response

fresh-borzoni commented Mar 20, 2026 •

edited

Loading

Uh oh!

fresh-borzoni commented Mar 20, 2026

Uh oh!

leekeiabstraction left a comment •

edited

Loading

Uh oh!

Uh oh!

fresh-borzoni commented Mar 20, 2026

Uh oh!

luoyuxia left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

fresh-borzoni commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

fresh-borzoni commented Mar 20, 2026

Uh oh!

leekeiabstraction left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

fresh-borzoni commented Mar 20, 2026

Uh oh!

luoyuxia left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fresh-borzoni commented Mar 20, 2026 •

edited

Loading

leekeiabstraction left a comment •

edited

Loading