Skip to content

feat(moe): add MoE inference and expert parallel support#444

Open
qinyiqun wants to merge 3 commits into
InfiniTensor:mainfrom
qinyiqun:moe
Open

feat(moe): add MoE inference and expert parallel support#444
qinyiqun wants to merge 3 commits into
InfiniTensor:mainfrom
qinyiqun:moe

Fix DeepSeek MLA dense prefill attention

83102c6
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs