feat(voice-server): Add Google Cloud TTS as alternative provider#687
Open
fayerman-source wants to merge 1 commit intodanielmiessler:mainfrom
Open
feat(voice-server): Add Google Cloud TTS as alternative provider#687fayerman-source wants to merge 1 commit intodanielmiessler:mainfrom
fayerman-source wants to merge 1 commit intodanielmiessler:mainfrom
Conversation
Adds Google Cloud Text-to-Speech as a second TTS backend alongside
ElevenLabs. Provider is selected via settings.json daidentity.ttsProvider
("elevenlabs" or "google-cloud"). Defaults to ElevenLabs for backwards
compatibility.
Google Cloud TTS supports WaveNet, Neural2, and Standard voice types,
configurable via daidentity.googleCloudVoice in settings.json. Uses the
REST API directly (no SDK dependency) with GOOGLE_CLOUD_API_KEY from
~/.env.
Free tier comparison: Google Cloud offers 4M chars/month (Standard) vs
ElevenLabs' 10K chars/month.
Closes danielmiessler#682
Collaborator
|
Nice addition of Google Cloud TTS! This has merge conflicts with recent VoiceServer changes (commit 95d65cc). Could you rebase on main? We'll merge once the conflicts are resolved. Thanks! 🙏 |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
Adds Google Cloud Text-to-Speech as a second TTS backend alongside ElevenLabs:
settings.json→daidentity.ttsProvider("elevenlabs"or"google-cloud")ttsProvideris not setfetch(no SDK)daidentity.googleCloudVoice(language, voice name, type, rate, pitch)Why Google Cloud TTS
Configuration
Add to
~/.env:Add to
~/.claude/settings.json:{ "daidentity": { "ttsProvider": "google-cloud", "googleCloudVoice": { "languageCode": "en-US", "voiceName": "en-US-Neural2-D", "voiceType": "NEURAL2", "speakingRate": 1.0, "pitch": 0.0 } } }Or keep using ElevenLabs by not setting
ttsProvider(or setting it to"elevenlabs").Files Changed
Releases/v3.0/.claude/VoiceServer/server.ts— Multi-provider TTS routing, Google Cloud TTS implementationContext
This is a re-implementation of PR #285 (merged 2026-01-01, lost in v3.0 restructuring) targeting the current v3.0 architecture. The original code lived in
Packs/kai-voice-system/which no longer exists.Closes #682