Improve ELU support by AdrianLundell · Pull Request #19694 · pytorch/executorch

AdrianLundell · 2026-05-20T14:00:55Z

Arm backend: Improve ELU support

Adds support for different scale and input_scale values
Adds support for related SELU and CELU operators, corresponding
to ELU with particular scales.
Use initial float values for alpha, scale and input_scale
rather than rounded values.

- Adds support for different scale and input_scale values - Adds support for related SELU and CELU operators, corresponding to ELU with particular scales. - Use initial float values for alpha, scale and input_scale rather than rounded values. Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I34dc97ce661213ffdcc9cf122028a115211b56e2

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I541b65ce989d2dff5a0af3f8605e946b1055a899

pytorch-bot · 2026-05-20T14:01:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19694

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Run pull request jobs on OSDC runners in shadow mode

❌ 2 New Failures, 4 Unrelated Failures

As of commit cc9d7e2 with merge base 7724fd7 ():

NEW FAILURES - The following jobs have failed:

pull / test-coreml-bc-macos (macos-m1-stable) / macos-job (gh)
RuntimeError: Command bash /Users/ec2-user/runner/_work/_temp/exec_script failed with exit code 1
Test Vulkan Backend / test-vulkan / package-golden-artifacts (gh)
Unable to download artifact(s): Failed to ListArtifacts: Received non-retryable error: Failed request: (403) Forbidden: Error from intermediary with HTTP status code 403 "Forbidden"

FLAKY - The following job failed but was likely due to flakiness present on trunk:

pull / test-qnn-testsuite-linux / test-backend-linux (qnn, models) / linux-job (gh) (detected as infra flaky with no log or failing log classifier)

BROKEN TRUNK - The following jobs failed but was present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / unittest / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.
pull / unittest-editable / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.
trunk / unittest-release / macos / macos-job (gh) (trunk failure)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Copilot

Pull request overview

This PR extends Arm backend handling of the ELU-family activations by adding explicit support for SELU and CELU through conversion/decomposition passes, updating operator support/partitioning metadata, and expanding the ELU test suite accordingly.

Changes:

Add pass to convert SELU/CELU to parameterized ELU, and update ELU decomposition to respect alpha/scale/input_scale.
Extend quantization annotation and TOSA operator support lists to include selu/celu, and adjust partitioner “do-not-decompose” policy for float.
Refactor/expand ELU tests to cover ELU parameters as well as nn.SELU and nn.CELU, and update related docs.

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 4 comments.

Show a summary per file

File	Description
backends/arm/tosa/partitioner.py	Prevents decomposition of `selu`/`celu` in FP flows so backend passes can handle them.
backends/arm/test/ops/test_elu.py	Refactors test structure and adds SELU/CELU + additional ELU parameter coverage across pipelines.
backends/arm/scripts/docgen/vgf/vgf-getting-started-tutorial.md.in	Clarifies compiler script positioning (quick tests vs production API).
backends/arm/scripts/docgen/ethos-u/ethos-u-getting-started-tutorial.md.in	Same documentation clarification as VGF tutorial.
backends/arm/quantizer/quantization_annotator.py	Adds `selu`/`celu` to one-to-one quantization annotation set.
backends/arm/operator_support/tosa_profile_supported_op_lists.py	Declares `selu`/`celu` as supported ops in relevant TOSA profiles.
backends/arm/_passes/insert_table_ops.py	Updates ELU table-op computation to use stored float params; adds SELU/CELU to special set.
backends/arm/_passes/decompose_elu_pass.py	Adds ELU-family conversion pass and extends ELU decomposition to handle SELU/CELU + extra params.
backends/arm/_passes/convert_elu_params.py	Adjusts quantized ELU parameter handling to avoid int8 kernel crashes while preserving float params via metadata.
backends/arm/_passes/arm_pass_manager.py	Inserts the new ELU-family conversion pass into both backend and annotation pipelines.
backends/arm/_passes/init.py	Exposes the new `ConvertEluFamilyToEluPass` for pipeline construction.

Comments suppressed due to low confidence (1)

backends/arm/_passes/decompose_elu_pass.py:33

get_elu_decomposition’s docstring still describes the old ELU rewrite (and doesn’t mention scale/input_scale or expm1). Please update the docstring so it matches the implementation and the updated decomposition used in DecomposeEluPass.

def get_elu_decomposition(op) -> tuple:
    """Returns the decomposition of the given aten.elu operation into its
    equivalent TOSA-supported operations.

    This handles both edge dialect ops and core PyTorch ops. The decomposition strategy
    is:
        elu(x, y) → where(greater_or_eq(x, 0), (exp(x)-1), x)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

+def _get_elu_parameter(args, kwargs, index, name):
+    if len(args) > index:
+        return args[index]
+
+    return kwargs.get(name, 1.0)
+
+


+                )
                replace_node.kwargs = updated_kwargs

+                # Save corret parameters


        ops_to_not_decompose_if_fp = {
+            torch.ops.aten.celu.default,
            torch.ops.aten.eye.default,
            torch.ops.aten.logit.default,
            torch.ops.aten.linear.default,
            torch.ops.aten.linspace.default,
            torch.ops.aten.pad.default,
+            torch.ops.aten.selu.default,
        }


Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot

Pull request overview

Copilot reviewed 11 out of 11 changed files in this pull request and generated 3 comments.

Comments suppressed due to low confidence (1)

backends/arm/_passes/decompose_elu_pass.py:33

The get_elu_decomposition() docstring no longer matches the implementation: it claims to handle core PyTorch ops, and the decomposition formula shown is outdated (it omits scale/input_scale and uses a different branch ordering). Please update the docstring to reflect the current supported ops and decomposition used by DecomposeEluPass.

def get_elu_decomposition(op) -> tuple:
    """Returns the decomposition of the given aten.elu operation into its
    equivalent TOSA-supported operations.

    This handles both edge dialect ops and core PyTorch ops. The decomposition strategy
    is:
        elu(x, y) → where(greater_or_eq(x, 0), (exp(x)-1), x)

 from executorch.backends.arm._passes import ArmPass
 from executorch.backends.arm._passes.arm_pass_utils import create_node
+from executorch.backends.arm._passes.insert_table_ops import InsertTableOpsPass
 from executorch.backends.arm.constants import DQ_OPS


+def _get_elu_parameter(args, kwargs, index, name):
+    if len(args) > index:
+        return args[index]
+
+    return kwargs.get(name, 1.0)


+                )
                replace_node.kwargs = updated_kwargs

+                # Save corret parameters


AdrianLundell added 2 commits May 20, 2026 15:49

Fix stale docgen generation pt.3

2c87c6c

Signed-off-by: Adrian Lundell <adrian.lundell@arm.com> Change-Id: I541b65ce989d2dff5a0af3f8605e946b1055a899

Copilot AI review requested due to automatic review settings May 20, 2026 14:00

AdrianLundell requested a review from digantdesai as a code owner May 20, 2026 14:00

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 20, 2026

AdrianLundell added the partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm label May 20, 2026

github-actions Bot added ciflow/trunk module: arm Issues related to arm backend and removed partner: arm For backend delegation, kernels, demo, etc. from the 3rd-party partner, Arm labels May 20, 2026

AdrianLundell added release notes: none Do not include this in the release notes release notes: arm Changes to the ARM backend delegate and removed release notes: none Do not include this in the release notes labels May 20, 2026

Copilot started reviewing on behalf of AdrianLundell May 20, 2026 14:01 View session

Copilot AI reviewed May 20, 2026

View reviewed changes

Potential fix for pull request finding

cc9d7e2

Co-authored-by: Copilot Autofix powered by AI <175728472+Copilot@users.noreply.github.com>

Copilot AI review requested due to automatic review settings May 20, 2026 14:29

Copilot started reviewing on behalf of AdrianLundell May 20, 2026 14:30 View session

Copilot AI reviewed May 20, 2026

View reviewed changes

zingo approved these changes May 20, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve ELU support#19694

Improve ELU support#19694
AdrianLundell wants to merge 3 commits into
pytorch:mainfrom
AdrianLundell:change-1262485

AdrianLundell commented May 20, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented May 20, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AdrianLundell commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented May 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/19694

❗ 1 Active SEVs

❌ 2 New Failures, 4 Unrelated Failures

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AdrianLundell commented May 20, 2026 •

edited

Loading

pytorch-bot Bot commented May 20, 2026 •

edited

Loading