Commit fe9813b
Skip redundant decode FFN operators when swiglu_fused is active
When use_aie_ffn_swiglu_fused is True, the fused operator handles all
of gate+up+silu+mul+down in one design. Skip creating the 3 separate
decode GEMVs (aie_fc1_gemv, aie_fc2_gemv, aie_fc3_gemv) which would
waste compilation time and device memory.
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 563812f commit fe9813b
2 files changed
Lines changed: 9 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
123 | 123 | | |
124 | 124 | | |
125 | 125 | | |
126 | | - | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
127 | 132 | | |
128 | 133 | | |
129 | 134 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
94 | 94 | | |
95 | 95 | | |
96 | 96 | | |
97 | | - | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
98 | 100 | | |
99 | 101 | | |
100 | 102 | | |
| |||
0 commit comments