Skip to content

Cache JIT-compiled CUDA kernels#3587

Open
zcbenz wants to merge 1 commit into
mainfrom
test/compile-cache
Open

Cache JIT-compiled CUDA kernels#3587
zcbenz wants to merge 1 commit into
mainfrom
test/compile-cache

Conversation

@zcbenz
Copy link
Copy Markdown
Collaborator

@zcbenz zcbenz commented May 24, 2026

This reduces test run time to 7m, it is a must when we start JIT-compiling CUTLASS/CuTe kernels which would spend over an hour on running tests in CI.

The cache key is the hash of all files under mlx/backend/cuda, which is not 100% bullet-proof but should be robust enough to dodge most cache invalidation problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant