Releases: PrismML-Eng/llama.cpp
Releases · PrismML-Eng/llama.cpp
prism-b8846-d104cf1
Pre-built binaries (PrismML fork with Q1_0 1-bit quantization support).
macOS/iOS:
- macOS Apple Silicon (arm64)
- macOS Apple Silicon (arm64, KleidiAI enabled)
- macOS Intel (x64)
- iOS XCFramework
Linux (CPU):
Linux (CUDA):
Windows (CPU):
Linux (Vulkan):
Linux (AMD):
Windows (CUDA):
prism-b8796-e2d6742
Pre-built binaries (PrismML fork with Q1_0 1-bit quantization support).
macOS/iOS:
- macOS Apple Silicon (arm64)
- macOS Apple Silicon (arm64, KleidiAI enabled)
- macOS Intel (x64)
- iOS XCFramework
Linux (CPU):
Linux (CUDA):
Windows (CPU):
Linux (Vulkan):
Linux (AMD):
Windows (CUDA):
prism-b8201-ba7e817
Pre-built binaries (PrismML fork with Q1_0 1-bit quantization support).
macOS:
Linux:
Linux (AMD):
Windows:
prism-b8196-f5dda72
Pre-built binaries (PrismML fork with Q1_0 1-bit quantization support).
macOS:
Linux:
Windows:
prism-b8194-1179bfc
Pre-built binaries (PrismML fork with Q1_0 1-bit quantization support).
macOS:
Linux:
Windows: