May I ask whether the modality CFG mentioned in the LTX-2 paper is supported?
I notice that there is a related implementation in the inference code of the official repository (https://github.com/Lightricks/LTX-2), but I could not find it in this repository.
It seems that None is being passed directly here (perturbations=None):
|
vx, ax = self._forward(video=video, audio=audio, perturbations=None, use_gradient_checkpointing=use_gradient_checkpointing, use_gradient_checkpointing_offload=use_gradient_checkpointing_offload) |
Are there any plans to support modality CFG?
Thank you.
May I ask whether the modality CFG mentioned in the LTX-2 paper is supported?
I notice that there is a related implementation in the inference code of the official repository (https://github.com/Lightricks/LTX-2), but I could not find it in this repository.
It seems that None is being passed directly here (perturbations=None):
DiffSynth-Studio/diffsynth/models/ltx2_dit.py
Line 1682 in 84a75de
Are there any plans to support modality CFG?
Thank you.