Skip to content

vulkan: for small AMD GPUs, reduce submission threshold based on CU count#25240

Open
0cc4m wants to merge 1 commit into
masterfrom
0cc4m/vulkan-submission-threshold-flops-cu-count
Open

vulkan: for small AMD GPUs, reduce submission threshold based on CU count#25240
0cc4m wants to merge 1 commit into
masterfrom
0cc4m/vulkan-submission-threshold-flops-cu-count

Conversation

@0cc4m

@0cc4m 0cc4m commented Jul 2, 2026

Copy link
Copy Markdown
Contributor

Overview

There are still driver timeouts with Vega 8 iGPUs on stable diffusion workloads, so this heuristic reduces submission flops thresholds even further based on CU count and architecture. @wbruna Please check if this works for you.

Requirements

@0cc4m 0cc4m requested a review from a team as a code owner July 2, 2026 12:42
@github-actions github-actions Bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Jul 2, 2026
@wbruna

wbruna commented Jul 2, 2026

Copy link
Copy Markdown
Contributor

Seems to be working fine with my 3400G.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants