Feat/speak module by 74th · Pull Request #15 · 74th/websocket-control-stackchan

74th · 2026-03-09T12:49:04Z

This pull request adds support for speech synthesis (text-to-speech, TTS) to the StackChan server, enabling both VoiceVox and Google Gemini (Vertex AI) TTS backends. It introduces a new SpeakHandler for managing TTS streaming over WebSockets, updates the application and type system to support speech synthesizers, and improves logging and dependency management.

Speech Synthesis Integration:

Added the SpeakHandler class in stackchan_server/speak.py to handle text-to-speech streaming, segmenting, and playback over WebSockets, supporting both standard and streaming synthesizers.
Introduced SpeechSynthesizer and StreamingSpeechSynthesizer protocols, along with the AudioFormat dataclass, to standardize TTS interfaces in stackchan_server/types.py. [1] [2]

TTS Backend Implementations:

Added VoiceVoxSpeechSynthesizer for VoiceVox-based TTS in stackchan_server/speech_synthesis/voicevox.py, and GoogleCloudTextToSpeech for Google Gemini TTS in stackchan_server/speech_synthesis/google_cloud.py. [1] [2]
Created a factory function create_speech_synthesizer in stackchan_server/speech_synthesis/__init__.py to instantiate the default TTS backend.

Application and WebSocket Proxy Updates:

Updated StackChanApp and WsProxy to accept and use a SpeechSynthesizer for TTS, passing it through the WebSocket handler for use in sessions. [1] [2] [3]

Dependency and Logging Improvements:

Added google-genai as a required dependency and cleaned up extras in pyproject.toml. [1] [2]
Improved logging configuration in example_apps/echo.py for better debugging and observability.

…io synthesis

…o format handling

74th added 6 commits March 9, 2026 20:27

feat: implement SpeakHandler for audio synthesis and streaming

6f2d659

feat: add SpeechSynthesizer support and refactor SpeakHandler for aud…

ef342de

…io synthesis

feat: add GoogleCloudTextToSpeech integration for audio synthesis

6e8a9e4

feat: enhance SpeakHandler with debug recording and logging improvements

2fa9162

feat: refactor speech synthesis to support streaming and enhance audi…

74db05d

…o format handling

fix: ruff

0ebce90

74th merged commit 4008987 into main Mar 9, 2026
1 check passed

74th deleted the feat/speak-module branch March 9, 2026 12:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/speak module#15

Feat/speak module#15
74th merged 6 commits intomainfrom
feat/speak-module

74th commented Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

74th commented Mar 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant