Skip to content

Pull requests: intel/auto-round

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix merge error of llmc bug
#1817 opened May 14, 2026 by n1ck-guo Contributor Loading…
4 tasks
Fix forwarding of ExtraConfig overrides
#1816 opened May 14, 2026 by dhruvil237 Loading…
2 of 4 tasks
0.13.0
Add XPU MoE decode kernel (FP16/BF16 + INT4 sym/asym)
#1813 opened May 14, 2026 by Copilot AI Draft
7 tasks done
add mimo-audio, Qwen-TTS model backbone quantization
#1810 opened May 13, 2026 by WeiweiZhang1 Contributor Loading…
4 tasks
0.13.0
add auto_round_rtn cli and remove fast
#1808 opened May 13, 2026 by n1ck-guo Contributor Loading…
4 tasks
0.13.0
add smooth K for sagev1
#1806 opened May 13, 2026 by luoyu-intel Contributor Loading…
support AR format FP8 in vLLM
#1798 opened May 11, 2026 by Zhenzhong1 Contributor Loading…
support quarot/spinquant rotation before quantization
#1797 opened May 11, 2026 by lkk12014402 Contributor Loading… 0.13.0
Refine device by vibecoding
#1790 opened May 8, 2026 by wenhuach21 Contributor Loading…
4 tasks
Reduce VRAM usage of quantizing VLM models
#1777 opened May 4, 2026 by lvliang-intel Contributor Loading…
1 of 4 tasks
0.13.0
Fix QDQ inference OOM issue.
#1763 opened Apr 29, 2026 by changwangss Loading…
Awq algorithm
#1749 opened Apr 28, 2026 by WeiweiZhang1 Contributor Loading…
3 of 4 tasks
0.13.0
fix qwen3.6 vllm infer bug ready only add when the PR is ready to merge
#1746 opened Apr 27, 2026 by n1ck-guo Contributor Loading…
4 tasks
0.13.0
Fix rotation
#1724 opened Apr 23, 2026 by wenhuach21 Contributor Loading…
2 of 9 tasks
feat: support Nemotron-H / Nemotron-Cascade-2 (#1711)
#1712 opened Apr 20, 2026 by michael-rabe Loading…
4 of 9 tasks
Continuously optimize AutoScheme RAM consumption
#1703 opened Apr 17, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
chore: add shared agent config layout
#1700 opened Apr 17, 2026 by yiliu30 Contributor Loading…
Fix Qwen Omni quantization model issue for long form audio generation
#1698 opened Apr 17, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
Feats: Quantize/save/evaluate the Wan-AI/WAN2.2 models in w4a16 format
#1678 opened Apr 14, 2026 by lvliang-intel Contributor Loading…
2 of 9 tasks
refine alg_ext code to better support torch compile
#1649 opened Apr 2, 2026 by wenhuach21 Contributor Loading…
2 of 9 tasks
0.13.0
ProTip! Filter pull requests by the default branch with base:main.