Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[None][feat] Merge back DeepSeek V4 cache manager updates
#15633 opened Jun 25, 2026 by jiaganc Collaborator Draft
1 task done
[TRTLLM-12950][perf] DSv4 follow-up: DeepGEMM and MegaMoE
#15632 opened Jun 25, 2026 by lfr-0531 Collaborator Loading…
1 task done
[TRTLLM-12622][feat] Reland: Add native post-processing hook to trtllm-serve api-compatible Accepted LLM API contract change that is backwards-compatible
#15631 opened Jun 25, 2026 by xwang233 Collaborator Loading…
1 task done
[None][infra] AutoDeploy: Add trtllm runner for standalone llm-c
#15630 opened Jun 25, 2026 by bmarimuthu-nv Collaborator Loading…
1 task done
[TRTLLM-12982][chore] improve multi-item scoring request validation
#15627 opened Jun 25, 2026 by ixlmar Collaborator Loading…
1 task done
[None][perf] DSv4 follow-up: autotuner updates
#15626 opened Jun 25, 2026 by lfr-0531 Collaborator Loading…
1 task done
[None][perf] DSv4 follow-up: disagg routing improvements
#15625 opened Jun 25, 2026 by lfr-0531 Collaborator Loading…
1 task done
[TRTLLM-13629][test] Optimize MoE CI test-db
#15624 opened Jun 25, 2026 by xxi-nv Collaborator Loading…
[#15565][fix] AutoDeploy: Fix Super MTP IMA introduced by checkpointing replay
#15622 opened Jun 25, 2026 by galagam Collaborator Loading…
1 task done
[https://nvbugs/6242591][fix] Fix bugs in Beam Search kernels
#15621 opened Jun 25, 2026 by wili-65535 Collaborator Draft
1 task done
[None][feat] Disaggregated KV-cache bounce transfer
#15618 opened Jun 25, 2026 by Shixiaowei02 Collaborator Loading…
1 task done
[None][test] Add Kimi-K2.5 disaggregated GSM8K accuracy test
#15617 opened Jun 25, 2026 by Shixiaowei02 Collaborator Loading…
1 task done
[None][infra] Test cw-dfw cluster
#15616 opened Jun 25, 2026 by yiqingy0 Collaborator Draft
1 task
[None][infra] take test durations into account to determine cbts splits num
#15614 opened Jun 25, 2026 by crazydemo Collaborator Loading…
1 task done
[None][chore] Update .gitattributes
#15606 opened Jun 25, 2026 by ziyixiong-nv Collaborator Loading…
1 task
[None][fix] Align GPTOSS router tokenization and disagg draft scheduling
#15605 opened Jun 24, 2026 by SimengLiu-nv Collaborator Loading…
1 task done
[None][infra] add error hints to PR title check
#15604 opened Jun 24, 2026 by tburt-nv Collaborator Loading…
1 task done
[None][feat] VisualGen: enable CUDA graph capture with torch.compile
#15603 opened Jun 24, 2026 by chang-l Collaborator Loading…
6 tasks done
[None][infra] Add support to run NGC container scanning in pre-merge
#15602 opened Jun 24, 2026 by yuanjingx87 Collaborator Loading…
1 task
ProTip! Follow long discussions with comments:>50.