A Survey of Spoken Dialogue Models (60 pages)
-
Updated
Nov 28, 2024
A Survey of Spoken Dialogue Models (60 pages)
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Survey spoken dialogue models with speech input and output, and explore their designs, timelines, and intermediate representations
Add a description, image, and links to the wavtokenizer topic page so that developers can more easily learn about it.
To associate your repository with the wavtokenizer topic, visit your repo's landing page and select "manage topics."