wavtokenizer

Here are 3 public repositories matching this topic...

A Survey of Spoken Dialogue Models (60 pages)

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

audio text-to-speech streaming transformers tts codec omni voice-assistant neural-speech-synthesis mbzuai llm multimodal-large-language-models mini-omni wavtokenizer audiollm llmvox

Survey spoken dialogue models with speech input and output, and explore their designs, timelines, and intermediate representations

Add a description, image, and links to the wavtokenizer topic page so that developers can more easily learn about it.

To associate your repository with the wavtokenizer topic, visit your repo's landing page and select "manage topics."