Skip to content

Pull requests: SemiAnalysisAI/InferenceX

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Update DSv4 B200 TRT image to 2dd03e6 (non-MTP + MTP) full-sweep-enabled
#1664 opened Jun 4, 2026 by Oseltamivir Collaborator Loading…
3 tasks
[WIP] Initial work to add llm-d-vllm framework with H200
#1660 opened Jun 4, 2026 by ezrasilvera Collaborator Loading…
Throwaway: conc-64 gsm8k eval for DEP8+MTP3 dispatch token bug non-canary-full-sweep-enabled Run the full sweep without the canary gate (full search space, no trim)
#1659 opened Jun 3, 2026 by Oseltamivir Collaborator Loading…
[WIP] Update Dsv4 B300 configs full-sweep-enabled
#1656 opened Jun 3, 2026 by wzhao18 Collaborator Draft
[DNM][AMD] agentx-v0.4
#1654 opened Jun 3, 2026 by seungrokj Collaborator Loading…
[NV] Add GitHub Action to collect SPEED-Bench AL matrix
#1650 opened Jun 2, 2026 by qiching Loading…
3 tasks done
fix(power): classify zero-decode-GPU multinode runs as aggregated
#1646 opened Jun 2, 2026 by arygupt Collaborator Loading…
[WIP] agentX v0.4
#1640 opened Jun 2, 2026 by cquil11 Collaborator Draft
feat(power): vendor-agnostic GPU power/telemetry aggregation core
#1635 opened Jun 1, 2026 by arygupt Collaborator Loading…
2 of 3 tasks
Update new fixed-AR-MTP CI workflow for kimik2.5_int4, kimik2.5_fp4, …
#1633 opened Jun 1, 2026 by haic0 Collaborator Loading…
ProTip! What’s not been updated in a month: updated:<2026-05-05.