Skip to content

[TTS][Magpietts] Added CFG distillation#15568

Open
artem-gorodetskii wants to merge 1 commit intomainfrom
magpietts_online_cfg_distillation
Open

[TTS][Magpietts] Added CFG distillation#15568
artem-gorodetskii wants to merge 1 commit intomainfrom
magpietts_online_cfg_distillation

Conversation

@artem-gorodetskii
Copy link
Copy Markdown
Collaborator

@artem-gorodetskii artem-gorodetskii commented Mar 31, 2026

Summary

This PR adds support for online classifier-free guidance (CFG) distillation for MagpieTTS.

What’s included

  • Added a new OnlineCFGDistillation model that extends MagpieTTSModel for online CFG distillation training.
  • Added MagpieTTS distillation loss implementations, including KL-divergence, cross-entropy, and normalized RMSE losses.
  • Added a new training mode, online_cfg_distillation_train, to examples/tts/magpietts.py.

Training flow

  • Teacher model is loaded from checkpoint and frozen.
  • Teacher performs autoregressive CFG rollout generation.
  • Generated rollout is fed into the student in teacher-forced mode.
  • The student is optimized using a weighted combination of KL-divergence, cross-entropy, normalized RMSE, and optional MoE loss.

@artem-gorodetskii artem-gorodetskii force-pushed the magpietts_online_cfg_distillation branch 4 times, most recently from 199e69e to 6500845 Compare March 31, 2026 15:19
@artem-gorodetskii artem-gorodetskii self-assigned this Mar 31, 2026
@artem-gorodetskii artem-gorodetskii force-pushed the magpietts_online_cfg_distillation branch 2 times, most recently from 6fabaa8 to d7efe7d Compare April 3, 2026 10:57
@github-actions github-actions bot added the CI label Apr 3, 2026
@artem-gorodetskii artem-gorodetskii force-pushed the magpietts_online_cfg_distillation branch 5 times, most recently from 7b08f50 to 5f08f95 Compare April 3, 2026 16:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants