Add RPC provider failover for block ingestor by dimitrovmaksim · Pull Request #6430 · graphprotocol/graph-node

dimitrovmaksim · 2026-03-10T13:41:20Z

Resolves #6213
Related open issues: #5313, #5575

When the block ingestor's polling fails and the current RPC provider is unreachable, the ingestor now automatically switches to a healthy alternative provider. If no alternative providers are configured, the current one will be retried indefinetelly, similar to the current logic.

How it works

On do_poll() failure, the current provider is probed via eth_blockNumber first — do_poll() can fail for non-RPC reasons (DB errors, chain reorgs), so switching would not help in those cases.
If the current provider is unreachable, all other validated providers are probed in parallel. The first to respond is selected.
If all providers are unreachable, the ingestor stays on the current provider and re-probes on the next failure.
There is no automatic return to the original provider — the ingestor stays on whatever provider it switched to until that one fails.

Other changes

latest_block_ptr retries are now limited to ENV_VARS.request_retries instead of infinite, so failures surface to the failover logic.
retry_strategy uses saturating_sub to avoid underflow when limit is 0.
Added is_reachable() to the EthereumAdapter trait for lightweight provider health checks.
Added all_cheapest() to EthereumNetworkAdapters to expose all validated providers.

DaMandal0rian · 2026-03-11T21:01:57Z

@dimitrovmaksim seems related to #6126 ?

dimitrovmaksim · 2026-03-11T22:14:12Z

@dimitrovmaksim seems related to #6126 ?

@DaMandal0rian yeah, I noticed this PR today while going through the issues/PRs and started wondering if they are not conflicting (i haven't checked it myself). I'm okay with this one being closed, if the other one handles this issue and gets approved/merged

Use saturating_sub to avoid underflow when limit is set to 0.

When do_poll() fails, the ingestor now probes the current provider before switching. If the current provider is unreachable, all alternatives are probed in parallel and the first healthy one is selected. This avoids unnecessary switches on non-RPC failures (e.g. DB errors, chain reorgs). Also limits latest_block_ptr retries to ENV_VARS.request_retries so failures surface to the failover logic instead of retrying indefinitely.

Simplify resolve_provider_idx into resolve_provider by returning a reference to the Arc<A> instead of a usize index, eliminating the separate indexing step at the call site.

fordN requested a review from isum March 10, 2026 15:49

dimitrovmaksim self-assigned this Mar 19, 2026

isum approved these changes Mar 26, 2026

View reviewed changes

dimitrovmaksim added 4 commits March 27, 2026 13:03

graph: Fix retry_strategy underflow when limit is 0

2795ee9

Use saturating_sub to avoid underflow when limit is set to 0.

docs: Document block ingestor failover behavior

a63adfa

chain/ethereum: Return provider directly from resolve_provider

4d1e46a

Simplify resolve_provider_idx into resolve_provider by returning a reference to the Arc<A> instead of a usize index, eliminating the separate indexing step at the call site.

dimitrovmaksim force-pushed the rpc-provider-failover branch from 469986d to 4d1e46a Compare March 27, 2026 11:09

dimitrovmaksim merged commit 277f45e into graphprotocol:master Mar 27, 2026
6 checks passed

dimitrovmaksim deleted the rpc-provider-failover branch April 3, 2026 09:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add RPC provider failover for block ingestor#6430

Add RPC provider failover for block ingestor#6430
dimitrovmaksim merged 4 commits intographprotocol:masterfrom
dimitrovmaksim:rpc-provider-failover

dimitrovmaksim commented Mar 10, 2026 •

edited

Loading

Uh oh!

DaMandal0rian commented Mar 11, 2026

Uh oh!

dimitrovmaksim commented Mar 11, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dimitrovmaksim commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

How it works

Other changes

Uh oh!

DaMandal0rian commented Mar 11, 2026

Uh oh!

dimitrovmaksim commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dimitrovmaksim commented Mar 10, 2026 •

edited

Loading

dimitrovmaksim commented Mar 11, 2026 •

edited

Loading