-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[None][perf] DSv4 follow-up: autotuner updates
#15626
opened Jun 25, 2026 by
lfr-0531
Collaborator
Loading…
1 task done
[None][perf] DSv4 follow-up: disagg routing improvements
#15625
opened Jun 25, 2026 by
lfr-0531
Collaborator
Loading…
1 task done
[TRTLLM-13629][test] Optimize MoE CI test-db
#15624
opened Jun 25, 2026 by
xxi-nv
Collaborator
Loading…
[None][infra] Fix node list query failing on tcsh login nodes
#15623
opened Jun 25, 2026 by
yiqingy0
Collaborator
Loading…
1 task
[#15565][fix] AutoDeploy: Fix Super MTP IMA introduced by checkpointing replay
#15622
opened Jun 25, 2026 by
galagam
Collaborator
Loading…
1 task done
[https://nvbugs/6242591][fix] Fix bugs in Beam Search kernels
#15621
opened Jun 25, 2026 by
wili-65535
Collaborator
•
Draft
1 task done
[None][feat] Disaggregated KV-cache bounce transfer
#15618
opened Jun 25, 2026 by
Shixiaowei02
Collaborator
Loading…
1 task done
[None][test] Add Kimi-K2.5 disaggregated GSM8K accuracy test
#15617
opened Jun 25, 2026 by
Shixiaowei02
Collaborator
Loading…
1 task done
[TRTLLM-13613][test] Trim duplicated and dead multimodal accuracy tests from pre-merge CI
#15615
opened Jun 25, 2026 by
Wanli-Jiang
Collaborator
Loading…
1 task done
[None][infra] take test durations into account to determine cbts splits num
#15614
opened Jun 25, 2026 by
crazydemo
Collaborator
Loading…
1 task done
[TRTLLM-13409][feat] hard-exit on HangDetector fire + cross-rank propagation
#15612
opened Jun 25, 2026 by
JunyiXu-nv
Collaborator
•
Draft
1 task
[https://nvbugs/6368480][fix] Cache the SM count once in FmhaDispatcher's constructor and reuse the cached…
#15611
opened Jun 25, 2026 by
chenfeiz0326
Collaborator
Loading…
2 tasks done
[None][feat] add in-process NeMo-Skills benchmarks and Nemotron-3-Super guards
#15608
opened Jun 25, 2026 by
Wanli-Jiang
Collaborator
•
Draft
1 task done
[https://nvbugs/6293536][fix] Stage v2 KV block offsets through fresh host buffers
#15607
opened Jun 25, 2026 by
thorjohnsen
Collaborator
•
Draft
[None][chore] Update .gitattributes
#15606
opened Jun 25, 2026 by
ziyixiong-nv
Collaborator
Loading…
1 task
[None][fix] Align GPTOSS router tokenization and disagg draft scheduling
#15605
opened Jun 24, 2026 by
SimengLiu-nv
Collaborator
Loading…
1 task done
[None][infra] add error hints to PR title check
#15604
opened Jun 24, 2026 by
tburt-nv
Collaborator
Loading…
1 task done
[None][feat] VisualGen: enable CUDA graph capture with torch.compile
#15603
opened Jun 24, 2026 by
chang-l
Collaborator
Loading…
6 tasks done
[None][infra] Add support to run NGC container scanning in pre-merge
#15602
opened Jun 24, 2026 by
yuanjingx87
Collaborator
Loading…
1 task
[None][fix] Honor Qwen Image quant ignore list
#15599
opened Jun 24, 2026 by
pst2154
Contributor
Loading…
[None][perf] Improve Qwen3-VL Preprocessing Perf
#15598
opened Jun 24, 2026 by
aswinvisva
Collaborator
Loading…
1 task done
[https://nvbugs/6020038][feat] Add NCCL-EP v0.1 MoE communication support
#15597
opened Jun 24, 2026 by
nv-lschneider
Collaborator
•
Draft
1 task done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.