Skip to content

Fix k_norms dimension index for sequence length in quantized SDPA

fbd6385
Select commit
Loading
Failed to load commit list.
Open

Add TurboQuant KV cache compression with native Metal SDPA kernel #3328

Fix k_norms dimension index for sequence length in quantized SDPA
fbd6385
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs