feat: Add S3 support for artifacts by wmsnp · Pull Request #29 · google/adk-python-community

wmsnp · 2025-11-21T05:19:51Z

Link to Issue or Description of Change

Closes: Add S3 support to manage artifacts. adk-python#555
Related:
- feat: Add S3 support for artifacts adk-python#556
- feat: Add S3 support for artifacts adk-python#3563

Description

Introduce S3ArtifactService to provide an self-hosted Artifact storage solution

Solution

Supports asynchronous upload and download of S3 artifacts
Includes unit tests covering core functionality
Depends on aioboto3

Testing Plan

I have added or updated unit tests for my change.
All unit tests pass locally.

gemini-code-assist · 2025-11-21T05:20:05Z

Summary of Changes

Hello @wmsnp, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request significantly enhances the artifact storage capabilities by integrating S3-compatible object storage. It provides a robust, self-hosted option for managing application artifacts, ensuring data persistence and version control. The new service is designed for high performance through asynchronous operations and includes comprehensive testing to guarantee reliability.

Highlights

S3 Artifact Service Introduction: Introduced a new S3ArtifactService class to provide a self-hosted solution for storing and retrieving artifacts using Amazon S3 or S3-compatible services.
Asynchronous Operations: The S3ArtifactService supports asynchronous upload and download of artifacts, leveraging aioboto3 for non-blocking I/O operations.
Artifact Versioning and Metadata: Implemented atomic versioning for artifacts using S3's If-None-Match header, along with support for custom metadata which is JSON-serialized.
Comprehensive Unit Tests: Added a new test file with extensive unit tests covering the core functionality of the S3ArtifactService, including saving, loading, deleting, and listing artifacts and their versions, as well as handling user-scoped artifacts.
Dependency Update: Added aioboto3>=15.5.0 as a new dependency to pyproject.toml to enable asynchronous interaction with S3.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces an S3ArtifactService to support artifact storage on S3-compatible services. The implementation is robust, featuring asynchronous operations and atomic versioning with a retry mechanism. The accompanying unit tests are comprehensive and effectively mock the S3 interactions. I've identified a few areas for improvement, including a performance optimization for listing artifact versions, adding a test case for the version conflict retry logic, and some minor code cleanup in the project dependencies and tests. Overall, this is a solid contribution.

…equests and add version conflicts test for save_artifact

thedayisntgray · 2026-03-03T20:33:52Z

Can we git this PR or the other one referenced merged?

Is there anything I can do to help speed this along?

miyannishar · 2026-04-18T23:03:56Z

Hey @wmsnp — I've opened #115 which supersedes both this PR and my earlier #36.

Your implementation here was a big influence — I adopted both the aioboto3 pattern for native async I/O and the IfNoneMatch conditional writes for atomic versioning. The key additions in the new PR are:

Optional [s3] dependency group (so aioboto3 isn't required if you're not using S3)
User-scoped artifact key listing with user: prefix preservation
Comprehensive standalone test suite with full async mock infrastructure
README and documentation

Thanks for pioneering the async approach — it clearly belongs in the final version. 🙏

wmsnp · 2026-04-19T03:25:16Z

Hey @wmsnp — I've opened #115 which supersedes both this PR and my earlier #36.

Your implementation here was a big influence — I adopted both the aioboto3 pattern for native async I/O and the IfNoneMatch conditional writes for atomic versioning. The key additions in the new PR are:

Optional [s3] dependency group (so aioboto3 isn't required if you're not using S3)

User-scoped artifact key listing with user: prefix preservation

Comprehensive standalone test suite with full async mock infrastructure

README and documentation

Thanks for pioneering the async approach — it clearly belongs in the final version. 🙏

Thanks — really appreciate that. One thing I found later, though, is that not all S3-compatible implementations support If-None-Match, so that part may need a bit of care for compatibility.

DeanChensj · 2026-05-05T06:08:28Z

@gemini-cli /review

github-actions · 2026-05-05T06:08:41Z

🤖 Hi @DeanChensj, I've received your request, and I'm working on it now! You can track my progress in the logs for more details.

github-actions

Thanks for the contribution! The S3ArtifactService implementation is solid and follows the established patterns. I've left a few comments regarding:

Performance: The list_artifact_versions method performs a head_object call for every version, which could be a bottleneck.
Metadata Limits: S3 has a 2KB limit on metadata that we should be aware of when flattening custom metadata.
Robustness: A finite default for retries in save_artifact might be safer than infinite.

Overall, great work!

github-actions · 2026-05-05T06:13:30Z

+  async def _client(self):
+    session = await self._session()
+    async with session.client(service_name="s3", **self.aws_configs) as s3:
+      yield s3


S3 metadata has a total size limit of 2 KB (including keys and values). Since custom_metadata is flattened into JSON strings here, large metadata dictionaries might cause put_object to fail. It might be worth adding a check or a note about this limit.

github-actions · 2026-05-05T06:13:30Z

+      elif artifact.file_data:
+        raise NotImplementedError(
+            "Saving artifact with file_data is not supported yet in"
+            " S3ArtifactService."


With save_artifact_max_retries set to -1 (infinite), this loop could theoretically run forever if there's a consistent race condition or a logic error in version calculation. A high but finite default might be safer.

github-actions · 2026-05-05T06:13:31Z

+          metadata = head.get("Metadata", {})
+
+          canonical_uri = f"s3://{self.bucket_name}/{obj['Key']}"
+


Calling head_object for every version in a loop will be very slow if an artifact has many versions (O(N) network calls). S3 doesn't return custom metadata in list_objects_v2, so this might be necessary if metadata is required, but we should consider if there's a way to cache or avoid this for large version sets.

Add S3ArtifactService with unit tests

3a37dc0

gemini-code-assist Bot reviewed Nov 21, 2025

View reviewed changes

wmsnp added 2 commits November 21, 2025 17:54

refactor: optimize list_artifact_versions with parallel head_object r…

af3998a

…equests and add version conflicts test for save_artifact

refactor: use async context manager for S3 clients

5f684ca

Jainish-S mentioned this pull request Dec 11, 2025

Add S3ArtifactService for Amazon S3 artifact storage support #37

Open

miyannishar mentioned this pull request Apr 18, 2026

feat: add S3ArtifactService with native async and atomic versioning #115

Open

3 tasks

miyannishar mentioned this pull request Apr 18, 2026

feat: added s3 artifact #36

Closed

8 tasks

github-actions Bot reviewed May 5, 2026

View reviewed changes

		metadata = head.get("Metadata", {})

		canonical_uri = f"s3://{self.bucket_name}/{obj['Key']}"

Conversation

wmsnp commented Nov 21, 2025

Link to Issue or Description of Change

Description

Solution

Testing Plan

Uh oh!

gemini-code-assist Bot commented Nov 21, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thedayisntgray commented Mar 3, 2026

Uh oh!

miyannishar commented Apr 18, 2026

Uh oh!

wmsnp commented Apr 19, 2026

Uh oh!

DeanChensj commented May 5, 2026

Uh oh!

github-actions Bot commented May 5, 2026

Uh oh!

github-actions Bot left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 5, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 5, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot May 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants