llama.cpp b8665 with CUDA Support

Pre-built binaries of llama.cpp with CUDA support for multiple CUDA versions.

Source: https://github.com/ggml-org/llama.cpp/releases/tag/b8665
Commit: b8635075ffe27b135c49afb9a8b5c434bd42c502

CUDA Versions

Download the appropriate tarball for your CUDA version and extract:

tar -xzf llama.cpp-b8665-cuda-12.8.tar.gz
./llama-cli --help