-
Notifications
You must be signed in to change notification settings - Fork 65
Pull requests: InfiniTensor/InfiniLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: support chunkprefill and prefill cuda graph
#371
opened May 12, 2026 by
Simon12345777
Loading…
48 tasks
refactor: use processor in infer backup
#368
opened May 12, 2026 by
wooway777
Collaborator
Loading…
29 of 48 tasks
refactor: unify linear/quantization architecture and remove deprecate…
#366
opened May 12, 2026 by
qinyiqun
Contributor
Loading…
38 of 48 tasks
feat: add prefix hashing for mm_data
#364
opened May 11, 2026 by
PanZezhong1725
Collaborator
Loading…
1 of 48 tasks
refactor: inline InfiniCore into InfiniLM, switch xmake -> CMake
#324
opened Apr 25, 2026 by
zhangyue207
Collaborator
•
Draft
6 tasks
issue/296 - feat: add Worker and ModelRunner for PD disaggregation
#304
opened Apr 15, 2026 by
spike-zhu
Collaborator
Loading…
enable FA and fix total_kv_lengths check in infer_engine.
#256
opened Mar 6, 2026 by
gongchensu
Collaborator
Loading…
Issue/224:add warmup before InfiniLM generation,and use muDNN silu_and_mul to replace elementwise swiglu in moore gpu
#225
opened Feb 11, 2026 by
spike-zhu
Collaborator
Loading…
feat: add operator fusion support with dynamic scheduling
#212
opened Jan 30, 2026 by
hootandy321
Loading…
[2025秋季][2-2-3 MLA 模块]feat: implement Absorb mode optimization
#182
opened Jan 12, 2026 by
goog00
Loading…
Issue/136 - 使用add_rms_norm融合算子
enhancement
New feature or request
#162
opened Dec 25, 2025 by
gongchensu
Collaborator
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.