Cortex-M backend: Refactor quantized_op_fusion_pass by AdrianLundell · Pull Request #20179 · pytorch/executorch

AdrianLundell · 2026-06-10T09:42:19Z

Move to use the AtenToCortexMPass instead.

cc @digantdesai @freddan80 @per @zingo @oscarandersson8218 @mansnils @Sebastian-Larsson @robell @rascani

Move to use the AtenToCortexMPass instead. Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I00020beb4248b24f344c47a1728c06925f1b7c7a

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I9f94f018d95ec58a4a12022679ddd66340344fa0

pytorch-bot · 2026-06-10T09:42:23Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20179

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 78 Pending, 4 Unrelated Failures

As of commit d107ddb with merge base 2b9e9bf ():

NEW FAILURE - The following job has failed:

pull / test-multimodal-linux (gemma3-4b) / linux-job (gh)
RuntimeError: Command docker exec -t f418c26acd9e63cd25a438a1c5d7535f406d1a6afd72682dcdb137ba7d75adb1 /exec failed with exit code 139

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

periodic / test-models-linux (cmake, mv3, portable, linux.2xlarge, 90) / linux-job (gh) (matched linux rule in flaky-rules.json)
The process '/usr/bin/git' failed with exit code 128
pull / test-arm-backend-no-driver (test_run_tosa) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)
trunk / test-arm-backend-vkml (test_pytest_models_vkml) / linux-job (gh) (matched linux rule in flaky-rules.json)
The process '/usr/bin/git' failed with exit code 128
trunk / test-torchao-huggingface-checkpoints (phi_4_mini, linux.2xlarge, executorch-ubuntu-22.04-clang12,... / linux-job (gh) (matched linux rule in flaky-rules.json)
The process '/usr/bin/git' failed with exit code 128

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Pull request overview

This PR refactors the Cortex-M backend pass pipeline by removing the standalone QuantizedOpFusionPass and moving its operator replacement/fusion responsibilities into AtenToCortexMPass, simplifying pass management and consolidation of Cortex-M dialect substitutions.

Changes:

Deleted quantized_op_fusion_pass.py and removed it from imports/build rules and the Cortex-M pass list.
Extended AtenToCortexMPass with additional dialect substitutions for quantized add/mul/softmax/maxpool/min/max/permute/pad (and quantize/dequantize per-tensor).
Updated BUCK targets and __init__.py exports to reflect the refactor.

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
`backends/cortex_m/passes/quantized_op_fusion_pass.py`	Removed the old ExportPass-based fusion/replacement implementation.
`backends/cortex_m/passes/cortex_m_pass_manager.py`	Dropped `QuantizedOpFusionPass` from the pass list and imports.
`backends/cortex_m/passes/BUCK`	Removed the deleted file from build sources.
`backends/cortex_m/passes/aten_to_cortex_m_pass.py`	Added/ported substitution logic that previously lived in `QuantizedOpFusionPass`.
`backends/cortex_m/passes/__init__.py`	Removed export/import of `QuantizedOpFusionPass`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+    del dialect_pass
+    input_tensor = _get_input_tensor_data(node)
+    if input_tensor.dtype != torch.int8:
+        return None


rascani

LGTM, just a couple minor nits.

rascani · 2026-06-10T17:46:14Z

+
+
+@AtenToCortexMPass.register_dialect_substitution(
+    exir_ops.edge.quantized_decomposed.quantize_per_tensor.default


Should we also remove the ReplaceQuantNodePass?

It is used in aot_arm_compiler so it would be a slightly larger fix and should perhaps be treated as an API which should be deprecated properly, so maybe in another PR?

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: Ibf36386cff49fe0fc16f60b8f582d0d49e061095

AdrianLundell added 2 commits June 10, 2026 09:47

Cortex-M backend: Refactor quantized_op_fusion_pass

4fb243a

Move to use the AtenToCortexMPass instead. Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I00020beb4248b24f344c47a1728c06925f1b7c7a

Mypy fixes

dd00104

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I9f94f018d95ec58a4a12022679ddd66340344fa0

Copilot AI review requested due to automatic review settings June 10, 2026 09:42

AdrianLundell requested a review from rascani as a code owner June 10, 2026 09:42

AdrianLundell added partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm ciflow/trunk release notes: none Do not include this in the release notes labels Jun 10, 2026

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 10, 2026

Copilot started reviewing on behalf of AdrianLundell June 10, 2026 09:42 View session

Copilot AI reviewed Jun 10, 2026

View reviewed changes

rascani approved these changes Jun 10, 2026

View reviewed changes

Potential fix for pull request finding

28479c7

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings June 11, 2026 14:05

Copilot started reviewing on behalf of AdrianLundell June 11, 2026 14:05 View session

Copilot AI reviewed Jun 11, 2026

View reviewed changes

Comment thread backends/cortex_m/passes/aten_to_cortex_m_pass.py

Lintrunner fix

d107ddb

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: Ibf36386cff49fe0fc16f60b8f582d0d49e061095

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cortex-M backend: Refactor quantized_op_fusion_pass#20179

Cortex-M backend: Refactor quantized_op_fusion_pass#20179
AdrianLundell wants to merge 4 commits into
pytorch:mainfrom
AdrianLundell:change-1257545

AdrianLundell commented Jun 10, 2026 •

edited by pytorch-bot Bot

Loading

Uh oh!

pytorch-bot Bot commented Jun 10, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rascani left a comment

Uh oh!

Uh oh!

rascani Jun 10, 2026

Uh oh!

AdrianLundell Jun 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		@AtenToCortexMPass.register_dialect_substitution(
		exir_ops.edge.quantized_decomposed.quantize_per_tensor.default

Conversation

AdrianLundell commented Jun 10, 2026 • edited by pytorch-bot Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Jun 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/20179

❌ 1 New Failure, 78 Pending, 4 Unrelated Failures

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rascani left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rascani Jun 10, 2026

Choose a reason for hiding this comment

Uh oh!

AdrianLundell Jun 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AdrianLundell commented Jun 10, 2026 •

edited by pytorch-bot Bot

Loading

pytorch-bot Bot commented Jun 10, 2026 •

edited

Loading