A Survey of Spoken Dialogue Models (60 pages)
-
Updated
Nov 28, 2024
A Survey of Spoken Dialogue Models (60 pages)
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
Survey spoken dialogue models with speech input and output, and explore their designs, timelines, and intermediate representations
Add a description, image, and links to the mini-omni topic page so that developers can more easily learn about it.
To associate your repository with the mini-omni topic, visit your repo's landing page and select "manage topics."