-
Notifications
You must be signed in to change notification settings - Fork 488
Pull requests: vllm-project/llm-compressor
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[AWQ] explicitly include identity scales for duo_scaling=True grid search
bug
Something isn't working
ready
When a PR is ready for review
#2640
opened Apr 21, 2026 by
brian-dellabetta
Collaborator
Loading…
Add SmoothQuant layer mappings for Cohere, DeepSeek V3, and Phi3
#2639
opened Apr 21, 2026 by
jayakumarpujar
•
Draft
2 of 3 tasks
[Example] Add Qwen3.6-35B-A3B W4A4 FP4 quantization example
documentation
Improvements or additions to documentation
enhancement
New feature or request
nvfp4
For any PR / issue related to NVFP4 support
qwen
For any PR / issue related to Qwen support
ready
When a PR is ready for review
#2638
opened Apr 21, 2026 by
dsikka
Collaborator
Loading…
[AWQ] Per-output-slice grid search for fused q_proj (Qwen3.5 attn_output_gate)
awq
For any issue / PR related to AWQ support
enhancement
New feature or request
quality-failed
qwen
For any PR / issue related to Qwen support
Refactor
Code cleanup and/or improvements to existing features
transforms
Related to transforms-based modifiers like SpinQuant and Quip
[AWQ] Seed grid search with identity baseline + fail fast on non-finite loss
awq
For any issue / PR related to AWQ support
enhancement
New feature or request
Refactor
Code cleanup and/or improvements to existing features
#2635
opened Apr 21, 2026 by
juju812
Loading…
2 of 3 tasks
[Deprecation] [Offload] [Tracing] Remove legacy offloading logic in tracing
Refactor
Code cleanup and/or improvements to existing features
tracing
Issues related to model tracing
#2633
opened Apr 20, 2026 by
kylesayrs
Collaborator
Loading…
[Deprecation] Replace deprecated function usage
autoround
For any PR / issue related to autoround support
quality-failed
Refactor
Code cleanup and/or improvements to existing features
#2632
opened Apr 20, 2026 by
kylesayrs
Collaborator
Loading…
add example of w8a8fp8 for qwen3.5
documentation
Improvements or additions to documentation
enhancement
New feature or request
fp8
For any issue / PR related to FP8 support
qwen
For any PR / issue related to Qwen support
#2631
opened Apr 20, 2026 by
zhangxin81
Loading…
Adding test_group to lm-eval configs
enhancement
New feature or request
fp8
For any issue / PR related to FP8 support
nvfp4
For any PR / issue related to NVFP4 support
w4a16
#2623
opened Apr 16, 2026 by
debroy-rh
Loading…
Defer weight qparams to epoch end, unify calibration lifecycle
#2621
opened Apr 15, 2026 by
HDCharles
Collaborator
Loading…
2 of 5 tasks
test gptq issue [not for land]
enhancement
New feature or request
gptq
For any PR / issue related to GPTQ support
nvfp4
For any PR / issue related to NVFP4 support
quality-failed
#2617
opened Apr 14, 2026 by
HDCharles
Collaborator
Loading…
Add actorder support for GPTQ block quantization
enhancement
New feature or request
fp8
For any issue / PR related to FP8 support
gptq
For any PR / issue related to GPTQ support
ready
When a PR is ready for review
Refactor
Code cleanup and/or improvements to existing features
#2616
opened Apr 14, 2026 by
rk119
Loading…
[Tests] Add transformers v5 modeling tests and clean up import guards
qwen
For any PR / issue related to Qwen support
Refactor
Code cleanup and/or improvements to existing features
#2614
opened Apr 13, 2026 by
dsikka
Collaborator
Loading…
[not for land] DDP regression tests
awq
For any issue / PR related to AWQ support
documentation
Improvements or additions to documentation
enhancement
New feature or request
llama
For any PR / issue related to Llama herd support
quality-failed
qwen
For any PR / issue related to Qwen support
#2613
opened Apr 13, 2026 by
HDCharles
Collaborator
Loading…
4 tasks done
refactor: modernize observers module with Python 3.10+ type hints
Refactor
Code cleanup and/or improvements to existing features
#2607
opened Apr 12, 2026 by
elwhyjay
Contributor
Loading…
3 tasks done
[oneshot] clean offload_dir during post-processing
#2605
opened Apr 10, 2026 by
brian-dellabetta
Collaborator
•
Draft
3 tasks
[docs] deepseek v3.2 docs
documentation
Improvements or additions to documentation
ready
When a PR is ready for review
#2602
opened Apr 10, 2026 by
brian-dellabetta
Collaborator
Loading…
fix: correct TOKENIZERS_PARALLELISM_ENV constant value
ready
When a PR is ready for review
#2596
opened Apr 10, 2026 by
kuishou68
Loading…
[Refactor] Refactor splits to only use the "calibration" split (#2551)
ready
When a PR is ready for review
Refactor
Code cleanup and/or improvements to existing features
#2589
opened Apr 8, 2026 by
arpitkh101
Loading…
[save_pretrained] UX improvement for
save_compressed=False
needs-rebase
#2588
opened Apr 8, 2026 by
brian-dellabetta
Collaborator
Loading…
1 task
[Refactor] Consolidate Intermediate Offloading
needs-rebase
#2583
opened Apr 8, 2026 by
menogrey
Contributor
Loading…
[AWQ] [gemma3] remove input layernorm mapping
#2571
opened Apr 6, 2026 by
brian-dellabetta
Collaborator
Loading…
1 task
Previous Next
ProTip!
Follow long discussions with comments:>50.