feat(cloud-storage): add pluggable cloud storage support for distributed HugeGraph persistence#3061
Draft
vaijosh wants to merge 4 commits into
Draft
feat(cloud-storage): add pluggable cloud storage support for distributed HugeGraph persistence#3061vaijosh wants to merge 4 commits into
vaijosh wants to merge 4 commits into
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Why
This PR introduces a unified cloud storage capability for HugeGraph’s distributed stack, so store data can be synchronized to cloud object storage for stronger durability and recovery options.
The focus is not a provider-specific implementation detail, but a cloud-capable architecture with a default S3-compatible provider and extension points for additional providers.
What this PR delivers
1) Cloud storage support in distributed runtime
backend=hstoreflow).2) Pluggable provider architecture
ServiceLoaderdiscovery so new cloud providers can be added via plugin JARs.s3provider as default (S3-compatible API model), while allowing future providers without core rewrites.3) Cloud-neutral configuration model
cloud_*).4) Operational docs and examples
User Impact
backend=hstore) can choose their optimal trade-off between throughput and durability:sync): Guarantees zero data loss by ensuring local Raft commits inline with cloud storage flushes before returning success to the caller.async): Minimizes latency by performing background reconciliation to the cloud storage bucket within a bounded time horizon.Validation
Reviewer focus areas