Commit bc35df3
[xla:cpu] Optimize ThunkExecutor::Execute part #1
name old cpu/op new cpu/op delta
BM_SelectAndScatterF32/128/process_time 889µs ± 1% 740µs ± 3% -16.70%
BM_SelectAndScatterF32/256/process_time 3.64ms ± 2% 3.00ms ± 1% -17.64%
BM_SelectAndScatterF32/512/process_time 15.3ms ± 1% 13.1ms ± 3% -14.61%
PiperOrigin-RevId: 6580638461 parent 2556f9f commit bc35df3
1 file changed
Lines changed: 6 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
162 | 162 | | |
163 | 163 | | |
164 | 164 | | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
165 | 171 | | |
166 | 172 | | |
167 | 173 | | |
| |||
0 commit comments