fix: preserve cache_metadata in VertexAiSessionService event round-trip by OiPunk · Pull Request #4714 · google/adk-python

OiPunk · 2026-03-05T03:46:40Z

Summary

VertexAiSessionService drops cache_metadata and usage_metadata fields when serializing/deserializing Event objects. This causes ContextCacheRequestProcessor to never find previous cache metadata from session events, so it creates a new context cache on every LLM call instead of reusing existing ones.

Root cause

Write path (append_event): cache_metadata and usage_metadata were not included in the event_metadata dict sent to the Vertex AI API.
Read path (_from_api_event): cache_metadata and usage_metadata were not extracted from the API response when reconstructing the Event object.

Other session service implementations (InMemorySessionService, database-backed StorageEvent) preserve these fields correctly.

Fix

Serialize cache_metadata and usage_metadata into event_metadata during append_event.
Deserialize them back in _from_api_event using _session_util.decode_model, consistent with how grounding_metadata is already handled.

Test plan

Added test_append_event_with_cache_and_usage_metadata that verifies both fields survive a full round-trip (append -> get_session).
All 22 existing tests continue to pass.

tests/unittests/sessions/test_vertex_ai_session_service.py  22 passed

Fixes #4698

…vice event round-trip VertexAiSessionService was dropping cache_metadata and usage_metadata fields during Event serialization/deserialization. This caused ContextCacheRequestProcessor to never find previous cache metadata, creating a new cache on every LLM call instead of reusing existing ones. The fix adds cache_metadata and usage_metadata to the event_metadata dict during append_event (write path) and restores them in _from_api_event (read path), matching the behavior of other session service implementations. Fixes google#4698

gemini-code-assist · 2026-03-05T03:46:58Z

Summary of Changes

Hello, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request fixes a critical bug in the VertexAiSessionService that prevented the proper round-trip of cache_metadata and usage_metadata for Event objects. This oversight led to inefficient caching behavior, as the system was unable to correctly identify and reuse existing context caches. The changes ensure that these essential metadata fields are preserved throughout the event lifecycle, thereby restoring intended caching functionality and improving performance.

Highlights

Bug Fix: Addressed an issue where VertexAiSessionService was dropping cache_metadata and usage_metadata during event serialization and deserialization.
Impact: Resolved the problem where ContextCacheRequestProcessor would inefficiently create new context caches instead of reusing existing ones due to missing metadata.
Implementation: Ensured cache_metadata and usage_metadata are properly serialized into event_metadata during append_event and deserialized back in _from_api_event.
Testing: Introduced a new test, test_append_event_with_cache_and_usage_metadata, to confirm the end-to-end preservation of these metadata fields.

🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Changelog

src/google/adk/sessions/vertex_ai_session_service.py
- Imported CacheMetadata for type hinting and deserialization.
- Modified append_event to include usage_metadata and cache_metadata in the event_metadata dictionary when present.
- Updated _from_api_event to decode and extract usage_metadata and cache_metadata from the API event response.
- Ensured usage_metadata and cache_metadata are passed to the Event constructor during object reconstruction.
tests/unittests/sessions/test_vertex_ai_session_service.py
- Imported CacheMetadata for test data.
- Extended the _convert_to_object helper to handle cache_metadata and usage_metadata.
- Added test_append_event_with_cache_and_usage_metadata to validate the correct serialization and deserialization of these metadata types.

Activity

A new test, test_append_event_with_cache_and_usage_metadata, was added to verify the fix.
All 22 existing unit tests continue to pass, ensuring no regressions.
This pull request addresses and closes issue Context caching not working with Agent Engine (VertexAiSessionService drops cache_metadata) #4698.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request correctly addresses the issue where cache_metadata and usage_metadata were not being preserved during the event round-trip in VertexAiSessionService. The changes to serialize and deserialize these fields are implemented correctly, following the existing pattern for other metadata. The addition of a new unit test, test_append_event_with_cache_and_usage_metadata, is great for verifying the fix. I have one suggestion to make the new test more concise and maintainable.

gemini-code-assist · 2026-03-05T03:48:18Z

tests/unittests/sessions/test_vertex_ai_session_service.py

+  # cache_metadata is preserved
+  assert appended_event.cache_metadata is not None
+  assert appended_event.cache_metadata.cache_name == (
+      'projects/123/locations/us-central1/cachedContents/456'
+  )
+  assert appended_event.cache_metadata.fingerprint == 'abc123hash'
+  assert appended_event.cache_metadata.invocations_used == 3
+  assert appended_event.cache_metadata.contents_count == 10
+  assert appended_event.cache_metadata.created_at == 1700000000.0
+  # usage_metadata is preserved
+  assert appended_event.usage_metadata is not None
+  assert appended_event.usage_metadata.prompt_token_count == 100
+  assert appended_event.usage_metadata.candidates_token_count == 50
+  assert appended_event.usage_metadata.total_token_count == 150
+  assert appended_event.usage_metadata.cached_content_token_count == 80


To improve the readability and maintainability of this test, you can directly compare the cache_metadata and usage_metadata objects with the originals (cache_meta and usage_meta) instead of asserting each field individually. Pydantic models, which are used here, support equality comparison out of the box. This makes the test more concise and robust against future changes to these models.

# cache_metadata is preserved assert appended_event.cache_metadata == cache_meta # usage_metadata is preserved assert appended_event.usage_metadata == usage_meta

adk-bot added the services [Component] This issue is related to runtime services, e.g. sessions, memory, artifacts, etc label Mar 5, 2026

gemini-code-assist bot reviewed Mar 5, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: preserve cache_metadata in VertexAiSessionService event round-trip#4714

fix: preserve cache_metadata in VertexAiSessionService event round-trip#4714
OiPunk wants to merge 1 commit intogoogle:mainfrom
OiPunk:codex/adk-python-4698-vertexai-cache-metadata

OiPunk commented Mar 5, 2026

Uh oh!

gemini-code-assist bot commented Mar 5, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

gemini-code-assist bot Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

OiPunk commented Mar 5, 2026

Summary

Root cause

Fix

Test plan

Uh oh!

gemini-code-assist bot commented Mar 5, 2026

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants