vulkan: fix 32-bit integer overflow in CEIL_DIV by hokanosekai · Pull Request #25245 · ggml-org/llama.cpp

hokanosekai · 2026-07-02T17:23:47Z

Overview

Mobile Vulkan drivers (Mali, Adreno, CIX) report maxComputeWorkGroupCount = UINT32_MAX. CEIL_DIV's numerator (M + N - 1) wraps in 32-bit arithmetic and yields 0, so ggml_vk_matmul requests 0 descriptor sets for batched matmuls while still dispatching one, tripping GGML_ASSERT(descriptor_set_idx < descriptor_sets.size()) at model load.

This rewrites the macro with division-based math that has no overflowing intermediate, as suggested by @jeffbolznv in the issue.

Tested on a Mali-G68 (MediaTek Dimensity 900): Gemma 3n E2B Q4_K_M with full Vulkan offload went from a 100% reproducible abort at warmup to generating normally, same throughput as a 64-bit promotion variant.

Additional information

Full root cause analysis and instrumentation logs are in #23057.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: YES, I used an AI assistant to help debug and instrument the descriptor set accounting on my device, which located the overflow (details in Vulkan: GGML_ASSERT(descriptor_set_idx < descriptor_sets.size()) crash on ARM UMA (Mali-G720-Immortalis, CIX CP8180) #23057). The fix itself is the one-line macro rewrite suggested by @jeffbolznv; I applied it and validated it on my hardware. I can retest on the affected device if follow-up is needed.

0cc4m · 2026-07-03T12:24:40Z

Thank you!

vulkan: fix 32-bit integer overflow in CEIL_DIV

56a10ef

hokanosekai requested a review from a team as a code owner July 2, 2026 17:23

github-actions Bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Jul 2, 2026

jeffbolznv approved these changes Jul 2, 2026

View reviewed changes

0cc4m approved these changes Jul 3, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: fix 32-bit integer overflow in CEIL_DIV#25245

vulkan: fix 32-bit integer overflow in CEIL_DIV#25245
hokanosekai wants to merge 1 commit into
ggml-org:masterfrom
hokanosekai:fix/vulkan-ceil-div-overflow

hokanosekai commented Jul 2, 2026

Uh oh!

0cc4m commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

hokanosekai commented Jul 2, 2026

Overview

Additional information

Requirements

Uh oh!

0cc4m commented Jul 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants