Taming Hallucinations

Taming Hallucinations: Boosting MLLMs’ Video Understanding via Counterfactual Video Generation

CVPR 2026 Findings

🏠 Project Page | Paper | Dataset

TL;DR: Taming Hallucinations introduces DualityForge, a controllable diffusion-based framework that turns real videos into counterfactual ones, automatically generating paired videos and QA data for contrastive training. Based on the large-scale DualityVidQA dataset and the proposed DNA-Train SFT–RL regime with ℓ1-normalized advantages, our approach reduces hallucinations in multimodal LLMs by 24% and shows strong generalization across benchmarks. Dataset and code will be released.

Method Overview

Update

📎 Citation

If you find this repository useful, please consider citing:

@article{huang2025taming,
  title={Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation},
  author={Huang, Zhe and Wen, Hao and Hao, Aiming and Song, Bingze and Wu, Meiqi and Wu, Jiahong and Chu, Xiangxiang and Lu, Sheng and Wang, Haoqian},
  journal={arXiv preprint arXiv:2512.24271},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
assets		assets
static		static
LICENSE		LICENSE
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Taming Hallucinations

🏠 Project Page | Paper | Dataset

Method Overview

Update

📎 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Taming Hallucinations

🏠 Project Page | Paper | Dataset

Method Overview

Update

📎 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages