Skip to content

Pull requests: NVIDIA/TensorRT-LLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[TRTLLM-12228][fix] ADP dummy type mismatch in disagg CTX server
#13460 opened Apr 25, 2026 by Shixiaowei02 Collaborator Draft
1 task done
[None][test] Waive 2 failed cases for main in QA CI
#13459 opened Apr 25, 2026 by xinhe-nv Collaborator Draft
[None][test] Waive 4 failed cases for main in QA CI
#13458 opened Apr 25, 2026 by xinhe-nv Collaborator Draft
[None][test] Waive 6 failed cases for main in QA CI
#13457 opened Apr 25, 2026 by xinhe-nv Collaborator Draft
[None][test] Waive 4 failed cases for main in QA CI
#13456 opened Apr 25, 2026 by xinhe-nv Collaborator Draft
[None][fix] Deduce default max_tokens for LLMAPI SamplingParams
#13455 opened Apr 25, 2026 by indrajit96 Collaborator Loading…
1 task done
[None][feat] Use a replay method for state rollback in Mamba-2 speculative decoding
#13453 opened Apr 24, 2026 by hnover-nv Collaborator Loading…
1 task done
[TRTLLM-11285][perf] Force enable TF32 tensor cores for DSA indexer fused GEMM
#13452 opened Apr 24, 2026 by peihu-nv Collaborator Loading…
1 task done
[None][fix] Avoid unfinished_test race condition in Pytest hooks
#13451 opened Apr 24, 2026 by tburt-nv Collaborator Loading…
1 task done
Add Qwen image support Community want to contribute PRs initiated from Community VisualGen
#13449 opened Apr 24, 2026 by pst2154 Loading…
[None][feat] Assert attention DP disabled when KV connector is in use
#13448 opened Apr 24, 2026 by jthomson04 Collaborator Loading…
2 tasks done
[None][feat] Add Llama sequence classification / reward-model support Community want to contribute PRs initiated from Community
#13444 opened Apr 24, 2026 by pst2154 Loading…
1 task done
[None][feat] WIP AutoDeploy: Add DeepSeek v4 support
#13443 opened Apr 24, 2026 by bmarimuthu-nv Collaborator Loading…
1 task
[None][feat] AutoDeploy: onboard DeepSeek V4 (Flash-Base, Pro) + KV-cache decode
#13442 opened Apr 24, 2026 by suyoggupta Collaborator Loading…
8 tasks done
[https://nvbugs/6111076][fix] ulysses+sage
#13440 opened Apr 24, 2026 by xrq-phys Collaborator Loading…
1 task done
[Draft] Harden disagg transceiver request lifetime Community want to contribute PRs initiated from Community
#13439 opened Apr 24, 2026 by yifjiang Contributor Draft
[None][perf] Extend customMoeRouting kernel to support Qwen3.5
#13433 opened Apr 24, 2026 by nv-guomingz Collaborator Loading…
1 task done
[TRTLLM-12092][infra] Add PR Base Freshness Check Action
#13430 opened Apr 24, 2026 by crazydemo Collaborator Loading…
1 task done
[None][feat] Add FlashInfer MLA attention backend support
#13428 opened Apr 24, 2026 by Tracin Collaborator Loading…
1 task
[None][perf] EAGLE3 dynamic tree kernel optimizations
#13426 opened Apr 24, 2026 by sunnyqgg Collaborator Loading…
[TRTLLM-12128][feat] enable SageAttention for Wan/FLUX
#13425 opened Apr 24, 2026 by o-stoner Collaborator Loading…
1 task done
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.