Findings

Official Documentation

Google's cookbook examples (Get_started_LiveAPI.py, Get_started_LiveAPI_NativeAudio.py) explicitly state:

Important: Use headphones. This script uses the system default audio input and output, which often won't include echo cancellation. So to prevent the model from interrupting itself it is important that you use headphones.

The root cause: VAD (Voice Activity Detection) on the server can't distinguish between the user speaking and the model's own audio leaking back through the mic. It treats echo as a user interruption and cancels the ongoing generation.

Solutions (ranked by practicality for our iOS app)

iOS AEC via .voiceChat mode — Use AVAudioSession with .voiceChat mode + .defaultToSpeaker. Check isEchoCancelledInputAvailable at runtime. This is the native platform solution.
Client-side mic suppression — Stop sending audio frames to the WebSocket while playback is active. Resume ~200-500ms after playback stops. Simple half-duplex approach, but prevents user barge-in.
NO_INTERRUPTION activity handling — Set activityHandling: NO_INTERRUPTION in the setup config. Model continues speaking even if VAD fires. Downside: user can't interrupt at all.
Disable auto-VAD + manual control — Set automaticActivityDetection.disabled: true, then send ActivityStart/ActivityEnd manually. Since we know when playback is happening, we can suppress activity signals during echo.
Tune VAD sensitivity — Set startOfSpeechSensitivity: LOW to raise the trigger threshold. Reduces false positives but community reports this alone is insufficient for speakerphone.
Proactive Audio (preview) — New feature where the model distinguishes speech directed at the device vs background audio. Could help ignore echo, but unconfirmed and in preview.

Key Community Links

Recommended Layered Strategy for Heard

Ensure we're using .voiceChat audio session mode (enables hardware AEC)
Tune VAD sensitivity to LOW as a baseline
Consider disabling auto-VAD and implementing echo-aware manual turn detection — we already know when playback is active
Fall back to mic suppression during playback if AEC proves insufficient on speakerphone

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Findings

Official Documentation

Solutions (ranked by practicality for our iOS app)

Key Community Links

Recommended Layered Strategy for Heard

FilesExpand file tree

interruption.md

Latest commit

History

interruption.md

File metadata and controls

Findings

Official Documentation

Solutions (ranked by practicality for our iOS app)

Key Community Links

Recommended Layered Strategy for Heard