Cache JIT-compiled CUDA kernels by zcbenz · Pull Request #3587 · ml-explore/mlx

zcbenz · 2026-05-24T23:53:10Z

This reduces test run time to 7m, it is a must when we start JIT-compiling CUTLASS/CuTe kernels which would spend over an hour on running tests in CI.

The cache key is the hash of all files under mlx/backend/cuda, which is not 100% bullet-proof but should be robust enough to dodge most cache invalidation problems.

Cache JIT-compiled CUDA kernels

183667f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache JIT-compiled CUDA kernels#3587

Cache JIT-compiled CUDA kernels#3587
zcbenz wants to merge 1 commit into
mainfrom
test/compile-cache

zcbenz commented May 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

zcbenz commented May 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant