Skip to content

Gemini Live Speech-to-Text and Text-to-Speech Evaluations with Auditing Audio Session with ADK#1984

Open
KVishnuVardhanR wants to merge 4 commits into
google:mainfrom
KVishnuVardhanR:eval_audit
Open

Gemini Live Speech-to-Text and Text-to-Speech Evaluations with Auditing Audio Session with ADK#1984
KVishnuVardhanR wants to merge 4 commits into
google:mainfrom
KVishnuVardhanR:eval_audit

Conversation

@KVishnuVardhanR
Copy link
Copy Markdown

@KVishnuVardhanR KVishnuVardhanR commented May 25, 2026

A comprehensive Gemini Live Speech-to-Text (STT) and Text-to-Speech (TTS) Evaluation and Audio session recording framework built upon the Agent Development Kit (ADK), Gemini Live API (gemini-live-2.5-flash-native-audio), and FastAPI.

The sample is strategically designed for developers building production voice-native conversational AI systems that require rigorous post-session compliance audits, automated archival workflows, and real-time Automatic Speech Recognition (ASR) performance benchmarking against Ground Truth datasets.

@KVishnuVardhanR KVishnuVardhanR changed the title Gemini Live Speech-to-Text Evaluation & Auditing Audio Session Sample with FastAPI Gemini Live Speech-to-Text Evaluation & Auditing Audio Session Sample with ADK May 25, 2026
@KVishnuVardhanR KVishnuVardhanR changed the title Gemini Live Speech-to-Text Evaluation & Auditing Audio Session Sample with ADK Gemini Live Speech-to-Text Evaluation & Auditing Audio Session with ADK May 25, 2026
@KVishnuVardhanR KVishnuVardhanR changed the title Gemini Live Speech-to-Text Evaluation & Auditing Audio Session with ADK Gemini Live Speech-to-Text and Text-to-Speech Evaluations with Auditing Audio Session with ADK May 31, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant