Skip to content

Pull requests: ml-explore/mlx-lm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Auto-discover tool-call markers from tokenizer config fields
#1163 opened Apr 18, 2026 by michaelstingl Loading…
6 tasks done
Add reasoning → tool state machine transition
#1160 opened Apr 16, 2026 by christiangenco Loading…
Fix Gemma 4 KV-shared layers creating unused projections
#1158 opened Apr 15, 2026 by glyphVault Loading…
5 tasks done
feature: dynamic quantized model support
#1155 opened Apr 15, 2026 by dsrenesanse Loading…
Feat/mamba mlx kernels
#1153 opened Apr 15, 2026 by Gal-bloch Loading…
9 of 11 tasks
feat: Add KL Divergence command
#1146 opened Apr 13, 2026 by spicyneuron Contributor Loading…
[Fix] Add alias for served model
#1140 opened Apr 9, 2026 by austin362667 Loading…
perf: reduce peak memory during model quantization
#1102 opened Apr 3, 2026 by matteocelani Contributor Loading…
5 tasks done
perf: reduce peak memory when loading AutoAWQ/GPTQ models
#1098 opened Apr 2, 2026 by matteocelani Contributor Loading…
4 tasks done
perf: reduce GPU sync frequency in GPTQ quantization
#1094 opened Apr 2, 2026 by matteocelani Contributor Loading…
3 tasks done
Thread local generation stream
#1090 opened Apr 2, 2026 by angeloskath Member Loading…
feat: add KV cache quantization args to server
#1073 opened Mar 30, 2026 by deceptech-packet-ninja Loading…
4 tasks done
ProTip! Mix and match filters to narrow down what you’re looking for.