Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
audio eagle quantization diffusion vlm llm qwen speculative-decoding llm-compression hunyuan deepseek fp4 dflash
-
Updated
Mar 16, 2026 - Python