[torchlib] Fix `aten__native_batch_norm_legit_functional` #2753

justinchuby · 2025-12-31T02:32:45Z

Fix aten__native_batch_norm_legit_functional where the running mean/var were returned without creating a new value, making the graph invalid.

Fixes pytorch/pytorch#171471

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

codecov · 2025-12-31T02:41:13Z

Codecov Report

❌ Patch coverage is 0% with 1 line in your changes missing coverage. Please review.
✅ Project coverage is 70.09%. Comparing base (519ef5a) to head (a9f8ff0).
⚠️ Report is 2 commits behind head on main.
✅ All tests successful. No failed tests found.

Files with missing lines	Patch %	Lines
onnxscript/function_libs/torch_lib/ops/core.py	0.00%	1 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##             main    #2753   +/-   ##
=======================================
  Coverage   70.09%   70.09%           
=======================================
  Files         228      228           
  Lines       27382    27382           
  Branches     2783     2783           
=======================================
  Hits        19194    19194           
  Misses       7229     7229           
  Partials      959      959

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

gramalingam · 2025-12-31T17:27:15Z

onnxscript/function_libs/torch_lib/ops/core.py

    running_mean_fp32 = op.Cast(running_mean, to=FLOAT.dtype)
    invstd = op.Cast(invstd, to=FLOAT.dtype)
-    return norm, running_mean_fp32, invstd, running_mean, running_var
+    return norm, running_mean_fp32, invstd, op.Identity(running_mean), op.Identity(running_var)


Do you know what was happening? I assume running_mean/var were valid values already ... but presumably some other requirement was violated (like an input-value cannot be output-value)?

In other words, what is the requirement for torchlib functions from a dev's perspective? Are they required to never return an input-value as an output-value without wrapping in an Identity? Seems like something that the underlying infrastructure could take care of without burdening the torchlib developer.

Here the running_mean and running_var are treated as mutable buffers in the pytorch model. It is an initializer directly as a graph output.

This only happens with the training graph. So we did not really see the case in our testing.

From torchlib's perspective, yes an input should not be returned directly as output. It is probably true that we can detect that externally and wrap the output with an identity.

[torchlib] Fix aten__native_batch_norm_legit_functional

a9f8ff0

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

github-project-automation bot added this to ONNX Script Review Board Dec 31, 2025

github-project-automation bot moved this to Todo in ONNX Script Review Board Dec 31, 2025

justinchuby mentioned this pull request Dec 31, 2025

[ONNX] Exporter fails some torchvision models pytorch/pytorch#171471

Closed

justinchuby requested a review from gramalingam December 31, 2025 02:33

justinchuby enabled auto-merge (squash) December 31, 2025 02:33

justinchuby added the module: torchlib Related to the torch/aten function lib in development label Dec 31, 2025

gramalingam reviewed Dec 31, 2025

View reviewed changes

gramalingam approved these changes Dec 31, 2025

View reviewed changes

justinchuby merged commit a571309 into main Dec 31, 2025
33 checks passed

justinchuby deleted the justinchu/fix-native-batch-norm branch December 31, 2025 17:32

github-project-automation bot moved this from Todo to Done in ONNX Script Review Board Dec 31, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[torchlib] Fix `aten__native_batch_norm_legit_functional` #2753

[torchlib] Fix `aten__native_batch_norm_legit_functional` #2753

Uh oh!

justinchuby commented Dec 31, 2025 •

edited

Loading

Uh oh!

codecov bot commented Dec 31, 2025 •

edited

Loading

Uh oh!

gramalingam Dec 31, 2025

Uh oh!

justinchuby Dec 31, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[torchlib] Fix aten__native_batch_norm_legit_functional #2753

[torchlib] Fix aten__native_batch_norm_legit_functional #2753

Uh oh!

Conversation

justinchuby commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

gramalingam Dec 31, 2025

Choose a reason for hiding this comment

Uh oh!

justinchuby Dec 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[torchlib] Fix `aten__native_batch_norm_legit_functional` #2753

[torchlib] Fix `aten__native_batch_norm_legit_functional` #2753

justinchuby commented Dec 31, 2025 •

edited

Loading

codecov bot commented Dec 31, 2025 •

edited

Loading

justinchuby Dec 31, 2025 •

edited

Loading