Skip to content

Pull requests: InfiniTensor/InfiniLM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: support chunkprefill and prefill cuda graph
#371 opened May 12, 2026 by Simon12345777 Loading…
48 tasks
issue/349 - Support GLM4 model
#370 opened May 12, 2026 by rubik-hua Loading…
refactor: use processor in infer backup
#368 opened May 12, 2026 by wooway777 Collaborator Loading…
29 of 48 tasks
refactor: unify linear/quantization architecture and remove deprecate…
#366 opened May 12, 2026 by qinyiqun Contributor Loading…
38 of 48 tasks
feat: add prefix hashing for mm_data
#364 opened May 11, 2026 by PanZezhong1725 Collaborator Loading…
1 of 48 tasks
issue/296 - feat: add Worker and ModelRunner for PD disaggregation
#304 opened Apr 15, 2026 by spike-zhu Collaborator Loading…
issue/294: minicpm-sala model
#295 opened Apr 8, 2026 by Ceng23333 Contributor Loading…
【比赛2025秋】T2-1-3
#268 opened Mar 16, 2026 by PanZezhong1725 Collaborator Draft
enable FA and fix total_kv_lengths check in infer_engine.
#256 opened Mar 6, 2026 by gongchensu Collaborator Loading…
Kv compression 0.2.0
#201 opened Jan 23, 2026 by Ringssss Loading…
LLaDA实现
#197 opened Jan 22, 2026 by MoringLotus Contributor Loading…
issue/143 支持计算图预先编译
#192 opened Jan 19, 2026 by PanZezhong1725 Collaborator Loading…
[2025秋季][T-2-1-1] https://github.com/zincjay
#185 opened Jan 12, 2026 by zincjay Loading…
[2025秋季][T2-2-1] lviy
#181 opened Jan 11, 2026 by lviy Loading…
[2025秋季][2-1-4] CearX
#179 opened Jan 11, 2026 by CearX Loading…
issue/175: 增加QY机器测试example/jiuge.py
#176 opened Jan 6, 2026 by xgqdut2016 Contributor Loading…
issue/155: 服务端支持repetition_penalty
#164 opened Dec 25, 2025 by Ceng23333 Contributor Loading…
Issue/136 - 使用add_rms_norm融合算子 enhancement New feature or request
#162 opened Dec 25, 2025 by gongchensu Collaborator Loading…
ProTip! Add no:assignee to see everything that’s not assigned.