Skip to content

Pull requests: InternLM/lmdeploy

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[WIP] Support long-context and MTP prefix-cache hits
#4688 opened Jun 17, 2026 by grimoire Collaborator Draft
fix: gate multimodal preprocessing concurrency
#4687 opened Jun 17, 2026 by CUHKSZzxy Collaborator Loading…
[WIP]: Remove dlblas from lmdeploy
#4682 opened Jun 16, 2026 by RunningLeon Collaborator Loading…
bump version to v0.14.0
#4681 opened Jun 16, 2026 by lvhan028 Collaborator Loading…
fix: parse multimodal tool messages Bug:P1
#4680 opened Jun 16, 2026 by CUHKSZzxy Collaborator Loading…
Add /get_ppl endpoint enhancement New feature or request
#4679 opened Jun 15, 2026 by irexyc Collaborator Loading…
Pading one more block for fa3 prefill
#4674 opened Jun 11, 2026 by RunningLeon Collaborator Draft
Batch invariant support PART1
#4666 opened Jun 10, 2026 by grimoire Collaborator Draft
refactor: unify interleaved MRoPE rotary embedding
#4644 opened Jun 3, 2026 by CUHKSZzxy Collaborator Draft
Add multimodal and preemption metrics
#4640 opened Jun 1, 2026 by CUHKSZzxy Collaborator Loading…
TEST: Improve tool test
#4632 opened May 28, 2026 by littlegy Contributor Loading…
Interleave long-context prefill chunks with decode
#4631 opened May 28, 2026 by grimoire Collaborator Loading…
1 task done
modify save model in lite module improvement
#4624 opened May 26, 2026 by 43758726 Contributor Loading…
feat(turbomind): support priority schedule policy
#4614 opened May 22, 2026 by 4mengy Loading…
3 of 4 tasks
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605 opened May 21, 2026 by windreamer Collaborator Loading…
1 of 4 tasks
Intern s2 preview lite awq fix bug
#4600 opened May 19, 2026 by 43758726 Contributor Loading…
[WIP]: Support reuse routed experts on eviction
#4599 opened May 19, 2026 by RunningLeon Collaborator Loading…
update anthropic endpoint test
#4594 opened May 18, 2026 by littlegy Contributor Loading…
docs(advance): add Add a New Speculative Decoding Method guide documentation Improvements or additions to documentation
#4589 opened May 17, 2026 by SuperMarioYL Loading…
4 tasks done
refactor ascend multinode
#4588 opened May 15, 2026 by yao-fengchen Collaborator Draft
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.