-
Notifications
You must be signed in to change notification settings - Fork 700
Pull requests: InternLM/lmdeploy
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: gate multimodal preprocessing concurrency
#4687
opened Jun 17, 2026 by
CUHKSZzxy
Collaborator
Loading…
[Bugfix] Fix double-counted max_q_seqlen in decode delta kv_seqlens
#4685
opened Jun 16, 2026 by
waynehacking8
Loading…
[Bugfix] Fix InternVL/InternVL3 LoRA loading TypeError in adapter fallback
#4684
opened Jun 16, 2026 by
waynehacking8
Loading…
fix: parse multimodal tool messages
Bug:P1
#4680
opened Jun 16, 2026 by
CUHKSZzxy
Collaborator
Loading…
Add /get_ppl endpoint
enhancement
New feature or request
#4679
opened Jun 15, 2026 by
irexyc
Collaborator
Loading…
Add usage.prompt_tokens_details.cached_tokens for prefix caching
improvement
#4670
opened Jun 10, 2026 by
lvhan028
Collaborator
Loading…
refactor(proxy): split monolithic proxy into modular serve/proxy package
improvement
#4647
opened Jun 4, 2026 by
lvhan028
Collaborator
Loading…
Interleave long-context prefill chunks with decode
#4631
opened May 28, 2026 by
grimoire
Collaborator
Loading…
1 task done
modify save model in lite module
improvement
#4624
opened May 26, 2026 by
43758726
Contributor
Loading…
feat(turbomind): support priority schedule policy
#4614
opened May 22, 2026 by
4mengy
Loading…
3 of 4 tasks
perf: optimize guided decoding with xgrammar upgrade, batched API, and async D2H overlap
#4605
opened May 21, 2026 by
windreamer
Collaborator
Loading…
1 of 4 tasks
[WIP]: Support reuse routed experts on eviction
#4599
opened May 19, 2026 by
RunningLeon
Collaborator
Loading…
docs(advance): add Add a New Speculative Decoding Method guide
documentation
Improvements or additions to documentation
#4589
opened May 17, 2026 by
SuperMarioYL
Loading…
4 tasks done
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.