[ET-VK][qlinear] Add bias support to q4gsw and dq8ca_q4gsw quantized linear ops by SS-JIA · Pull Request #18061 · pytorch/executorch

SS-JIA · 2026-03-10T17:01:35Z

Stack from ghstack (oldest at bottom):

Wire bias through the q4gsw and dq8ca_q4gsw quantized linear operators.
Add add_bias_to_out_tile() helper in the output tile computation header and call
it from all three shader variants (tiled, coop, dq8ca_tiled). Remove the bias
guard in the pattern matcher to allow biased linear layers.

Differential Revision: D95970172

…linear ops Wire bias through the q4gsw and dq8ca_q4gsw quantized linear operators. Add add_bias_to_out_tile() helper in the output tile computation header and call it from all three shader variants (tiled, coop, dq8ca_tiled). Remove the bias guard in the pattern matcher to allow biased linear layers. Differential Revision: [D95970172](https://our.internmc.facebook.com/intern/diff/D95970172/) [ghstack-poisoned]

pytorch-bot · 2026-03-10T17:01:40Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18061

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 4 New Failures, 1 Pending, 1 Unrelated Failure

As of commit eb4c833 with merge base 8a285b7 ():

NEW FAILURES - The following jobs have failed:

pull / unittest / linux / linux-job (gh)
export/tests/test_target_recipes.py::TestTargetRecipes::test_mv2_model
pull / unittest-arm-backend-with-no-deps (test_pytest_ops_tosa) / linux-job (gh)
RuntimeError: Command docker exec -t a61611012b21b7aa8eb4086704084d2cbab161a974e3a09b208d12b02269a8e0 /exec failed with exit code 1
Test CUDA Builds / test-model-cuda-e2e (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed) / linux-job (gh)
Internal Server Error occurred while resolving "actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683". Internal Server Error occurred while resolving "actions/download-artifact@d3f86a106a0bac45b974a628896c90dbdf5c8093". Internal Server Error occurred while resolving "actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02". Internal Server Error occurred while resolving "pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1". Internal Server Error occurred while resolving "pytorch/pytorch/.github/actions/setup-rocm@main"
Test CUDA Builds / test-model-cuda-e2e (nvidia, parakeet-tdt, quantized-int4-weight-only) / linux-job (gh)
Internal Server Error occurred while resolving "nick-fields/retry@3e91a01664abd3c5cd539100d10d33b9c5b68482"

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest-editable / macos / macos-job (gh) (trunk failure)
export/tests/test_target_recipes.py::TestTargetRecipes::test_linear_model

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-03-10T17:02:23Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

… quantized linear ops" Wire bias through the q4gsw and dq8ca_q4gsw quantized linear operators. Add add_bias_to_out_tile() helper in the output tile computation header and call it from all three shader variants (tiled, coop, dq8ca_tiled). Remove the bias guard in the pattern matcher to allow biased linear layers. Differential Revision: [D95970172](https://our.internmc.facebook.com/intern/diff/D95970172/) [ghstack-poisoned]

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 10, 2026

meta-codesync bot added fb-exported meta-exported labels Mar 10, 2026

ssjia added 3 commits March 11, 2026 09:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ET-VK][qlinear] Add bias support to q4gsw and dq8ca_q4gsw quantized linear ops#18061

[ET-VK][qlinear] Add bias support to q4gsw and dq8ca_q4gsw quantized linear ops#18061
SS-JIA wants to merge 4 commits intogh/SS-JIA/478/basefrom
gh/SS-JIA/478/head

SS-JIA commented Mar 10, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 10, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

SS-JIA commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18061

❌ 4 New Failures, 1 Pending, 1 Unrelated Failure

Uh oh!

github-actions bot commented Mar 10, 2026

This PR needs a release notes: label

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

SS-JIA commented Mar 10, 2026 •

edited

Loading

pytorch-bot bot commented Mar 10, 2026 •

edited

Loading

This PR needs a `release notes:` label