[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

BowenBao · 2023-04-20T21:16:44Z

Stack from ghstack (oldest at bottom):

-> [ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

Summary

Previously this was required by and entangled with tracing_mode=symbolic for dynamic tracing.
That is resolved by Delete tracing_mode argument to export #99555 and its follow ups.
Later decomposition pass will do graph lowering, so this step is duplicated.
Updated Functionalization to workaround RuntimeError: Cannot call sizes() on tensor with symbolic sizes/strides w/ dynamo.export, make_fx and functionalize #99774 (comment)

Todo

Training vs eval in dynamo_export
So we are effectively exporting all models in traning mode by
default. But for the sake of this export we are only interested in eval mode.
The question is, should we call model.eval() in dynamo_export?
Tests with model containing batch norm fails 'functionalization' in training mode.
We are explicitly calling model.eval() for these model for now.
Merge decomp and functionalize pass. Both calls into make_fx.
Merging potentially increases performance. However it is unclear
if it will result in different behavior.

Fixes #99662. (For the functionalization issue. Still need missing op support.)

[ghstack-poisoned]

pytorch-bot · 2023-04-20T21:16:49Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/99667

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Degradation on most runner types due to networking outage

✅ No Failures

As of commit 3fc55a6:
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ghstack-source-id: b15d961 Pull Request resolved: #99667

Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. [ghstack-poisoned]

Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. - Introduce `ReplaceInplacePostFunctionalization` pass to replace inplace variant ops with outplace version. These ops are created by aten graph lowering and decomposition post functionalization. They won't be doing any real mutation as it is expected to have been handled by functionalization. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: aa29f45 Pull Request resolved: #99667

torch/onnx/_internal/fx/dynamo_exporter.py

torch/onnx/_internal/fx/fx_exporter.py

Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. - Introduce `ReplaceInplacePostFunctionalization` pass to replace inplace variant ops with outplace version. These ops are created by aten graph lowering and decomposition post functionalization. They won't be doing any real mutation as it is expected to have been handled by functionalization. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: e25861a Pull Request resolved: #99667

…] Drop 'aten_graph' arg for 'DynamoExporter'" Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. - Introduce `ReplaceInplacePostFunctionalization` pass to replace inplace variant ops with outplace version. These ops are created by aten graph lowering and decomposition post functionalization. They won't be doing any real mutation as it is expected to have been handled by functionalization. Workaround to unblock #99662. [ghstack-poisoned]

…h' arg for 'DynamoExporter'" Summary - Previously this was required by `tracing_mode=symbolic` for `dynamic` tracing. That argument will be dropped by #99555. - Later decomposition pass will do graph lowering, so this step is duplicated. - Functionalization currently cannot work properly on aten level graph. So it must happen before lowering & decompositions. - Introduce `ReplaceInplacePostFunctionalization` pass to replace inplace variant ops with outplace version. These ops are created by aten graph lowering and decomposition post functionalization. They won't be doing any real mutation as it is expected to have been handled by functionalization. Workaround to unblock #99662. [ghstack-poisoned]

…orkaround in 'Functionalize' pass" Summary - Previously this was required by and entangled with `tracing_mode=symbolic` for `dynamic` tracing. That is resolved by #99555 and its follow ups. - Later decomposition pass will do graph lowering, so this step is duplicated. - Updated `Functionalization` to workaround #99774 (comment) Todo - Training vs eval in dynamo_export So we are effectively exporting all models in traning mode by default. But for the sake of this export we are only interested in eval mode. The question is, should we call `model.eval()` in `dynamo_export`? Tests with model containing batch norm fails 'functionalization' in training mode. We are explicitly calling `model.eval()` for these model for now. - Merge decomp and functionalize pass. Both calls into `make_fx`. Merging potentially increases performance. However it is unclear if it will result in different behavior. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: 78e3950 Pull Request resolved: #99667

…orkaround in 'Functionalize' pass" Summary - Previously this was required by and entangled with `tracing_mode=symbolic` for `dynamic` tracing. That is resolved by #99555 and its follow ups. - Later decomposition pass will do graph lowering, so this step is duplicated. - Updated `Functionalization` to workaround #99774 (comment) Todo - Training vs eval in dynamo_export So we are effectively exporting all models in traning mode by default. But for the sake of this export we are only interested in eval mode. The question is, should we call `model.eval()` in `dynamo_export`? Tests with model containing batch norm fails 'functionalization' in training mode. We are explicitly calling `model.eval()` for these model for now. - Merge decomp and functionalize pass. Both calls into `make_fx`. Merging potentially increases performance. However it is unclear if it will result in different behavior. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: 594171b Pull Request resolved: #99667

…orkaround in 'Functionalize' pass" Summary - Previously this was required by and entangled with `tracing_mode=symbolic` for `dynamic` tracing. That is resolved by #99555 and its follow ups. - Later decomposition pass will do graph lowering, so this step is duplicated. - Updated `Functionalization` to workaround #99774 (comment) Todo - Training vs eval in dynamo_export So we are effectively exporting all models in traning mode by default. But for the sake of this export we are only interested in eval mode. The question is, should we call `model.eval()` in `dynamo_export`? Tests with model containing batch norm fails 'functionalization' in training mode. We are explicitly calling `model.eval()` for these model for now. - Merge decomp and functionalize pass. Both calls into `make_fx`. Merging potentially increases performance. However it is unclear if it will result in different behavior. Workaround to unblock #99662. [ghstack-poisoned]

ghstack-source-id: 278c37b Pull Request resolved: #99667

titaiwangms · 2023-05-03T17:29:09Z

Seems getting complicated. You might consider move Functionalization ahead of dropping aten_graph in the title.

BowenBao · 2023-05-03T23:59:25Z

@pytorchbot merge

pytorchmergebot · 2023-05-04T00:01:17Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

jon-chuang · 2023-10-12T20:58:05Z

torch/onnx/_internal/fx/passes/functionalization.py

+            for inpt, input_functional in zip(flat_inputs, flat_inputs_functional):
+                if isinstance(input_functional, torch.Tensor):
+                    torch._sync(input_functional)
+                    inpt_new = torch._from_functional_tensor(input_functional)


Why is this inpt_new assigned?

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

9962e88

[ghstack-poisoned]

pytorch-bot bot added release notes: onnx torch.onnx related changes that should show up in the release notes labels Apr 20, 2023

BowenBao added a commit that referenced this pull request Apr 20, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

e032366

ghstack-source-id: b15d961 Pull Request resolved: #99667

pytorchbot added the open source label Apr 20, 2023

BowenBao mentioned this pull request Apr 21, 2023

[ONNX] Cover 'undiscoverable' ops 'torch.ops.aten' #99682

Closed

BowenBao added a commit that referenced this pull request Apr 21, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

64e2aa0

ghstack-source-id: aa29f45 Pull Request resolved: #99667

BowenBao marked this pull request as ready for review April 21, 2023 01:52

BowenBao requested a review from abock as a code owner April 21, 2023 01:52

titaiwangms approved these changes Apr 21, 2023

View reviewed changes

BowenBao added a commit that referenced this pull request Apr 21, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

0a3fb3a

ghstack-source-id: e25861a Pull Request resolved: #99667

BowenBao mentioned this pull request Apr 21, 2023

Introduce FXGraphExtractor into torch.onnx.dynamo_export #98893

Closed

BowenBao added module: onnx Related to torch.onnx topic: new features topic category ciflow/trunk Trigger trunk jobs on your pull request labels Apr 21, 2023

BowenBao changed the title ~~[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'~~ [ONNX] Drop 'aten_graph' arg for 'DynamoExporter'; Apply workaround in 'Functionalize' pass Apr 29, 2023

BowenBao mentioned this pull request Apr 29, 2023

[ONNX] Skip flaky dynamic test in CI #100297

Closed

BowenBao added a commit that referenced this pull request Apr 29, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

c684f4c

ghstack-source-id: 78e3950 Pull Request resolved: #99667

BowenBao added a commit that referenced this pull request May 1, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

6d6b8ed

ghstack-source-id: 594171b Pull Request resolved: #99667

BowenBao added a commit that referenced this pull request May 3, 2023

[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'

66ba01c

ghstack-source-id: 278c37b Pull Request resolved: #99667

BowenBao changed the title ~~[ONNX] Drop 'aten_graph' arg for 'DynamoExporter'; Apply workaround in 'Functionalize' pass~~ [ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' May 3, 2023

pytorchmergebot added the merging label May 4, 2023

pytorchmergebot added Merged and removed merging labels May 4, 2023

pytorchmergebot closed this in f827563 May 4, 2023

facebook-github-bot deleted the gh/BowenBao/234/head branch June 8, 2023 14:28

jon-chuang reviewed Oct 12, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

Uh oh!

BowenBao commented Apr 20, 2023 •

edited

Loading

Uh oh!

pytorch-bot bot commented Apr 20, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

titaiwangms commented May 3, 2023

Uh oh!

BowenBao commented May 3, 2023

Uh oh!

pytorchmergebot commented May 4, 2023

Uh oh!

jon-chuang Oct 12, 2023

Uh oh!

Uh oh!

[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

[ONNX] Update 'Functionalize' pass to support pre-decomp graph; Drop 'aten_graph' arg for 'DynamoExporter' #99667

Uh oh!

Conversation

BowenBao commented Apr 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Apr 20, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/99667

❗ 1 Active SEVs

✅ No Failures

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

titaiwangms commented May 3, 2023

Uh oh!

BowenBao commented May 3, 2023

Uh oh!

pytorchmergebot commented May 4, 2023

Merge started

Uh oh!

jon-chuang Oct 12, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

BowenBao commented Apr 20, 2023 •

edited

Loading

pytorch-bot bot commented Apr 20, 2023 •

edited

Loading