-
Notifications
You must be signed in to change notification settings - Fork 2.3k
Pull requests: NVIDIA/TensorRT-LLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[TRTLLM-12228][fix] ADP dummy type mismatch in disagg CTX server
#13460
opened Apr 25, 2026 by
Shixiaowei02
Collaborator
•
Draft
1 task done
[None][fix] Deduce default
max_tokens for LLMAPI SamplingParams
#13455
opened Apr 25, 2026 by
indrajit96
Collaborator
Loading…
1 task done
[None][feat] Use a replay method for state rollback in Mamba-2 speculative decoding
#13453
opened Apr 24, 2026 by
hnover-nv
Collaborator
Loading…
1 task done
[TRTLLM-11285][perf] Force enable TF32 tensor cores for DSA indexer fused GEMM
#13452
opened Apr 24, 2026 by
peihu-nv
Collaborator
Loading…
1 task done
[None][fix] Avoid unfinished_test race condition in Pytest hooks
#13451
opened Apr 24, 2026 by
tburt-nv
Collaborator
Loading…
1 task done
[https://nvbugs/6110326][fix] Guard output_dir None in both test_list_parsers, add tempfile fallback in PerfSa
#13450
opened Apr 24, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
Add Qwen image support
Community want to contribute
PRs initiated from Community
VisualGen
#13449
opened Apr 24, 2026 by
pst2154
Loading…
[None][feat] Assert attention DP disabled when KV connector is in use
#13448
opened Apr 24, 2026 by
jthomson04
Collaborator
Loading…
2 tasks done
[https://nvbugs/6106174][fix] Add @pytest.mark.skip_less_device_memory(80000) to the 5 affected VSWA tests to
#13445
opened Apr 24, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][feat] Add Llama sequence classification / reward-model support
Community want to contribute
PRs initiated from Community
#13444
opened Apr 24, 2026 by
pst2154
Loading…
1 task done
[None][feat] WIP AutoDeploy: Add DeepSeek v4 support
#13443
opened Apr 24, 2026 by
bmarimuthu-nv
Collaborator
Loading…
1 task
[None][feat] AutoDeploy: onboard DeepSeek V4 (Flash-Base, Pro) + KV-cache decode
#13442
opened Apr 24, 2026 by
suyoggupta
Collaborator
Loading…
8 tasks done
[https://nvbugs/6111076][fix] ulysses+sage
#13440
opened Apr 24, 2026 by
xrq-phys
Collaborator
Loading…
1 task done
[Draft] Harden disagg transceiver request lifetime
Community want to contribute
PRs initiated from Community
[https://nvbugs/6111076][fix] Pass
attention_metadata_state={"metadata": None, "capacity": (0, 0)} to both `
#13436
opened Apr 24, 2026 by
tensorrt-cicd
Collaborator
Loading…
2 tasks done
[None][feat] Add support for DeepSeek-V4 in PyTorch backend
Community want to contribute
PRs initiated from Community
#13434
opened Apr 24, 2026 by
weikuo0506
Loading…
[None][perf] Extend customMoeRouting kernel to support Qwen3.5
#13433
opened Apr 24, 2026 by
nv-guomingz
Collaborator
Loading…
1 task done
[TRTLLM-12092][infra] Add PR Base Freshness Check Action
#13430
opened Apr 24, 2026 by
crazydemo
Collaborator
Loading…
1 task done
[None][feat] Add FlashInfer MLA attention backend support
#13428
opened Apr 24, 2026 by
Tracin
Collaborator
Loading…
1 task
[None][perf] EAGLE3 dynamic tree kernel optimizations
#13426
opened Apr 24, 2026 by
sunnyqgg
Collaborator
Loading…
[TRTLLM-12128][feat] enable SageAttention for Wan/FLUX
#13425
opened Apr 24, 2026 by
o-stoner
Collaborator
Loading…
1 task done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.