Support both train / eval modes for ModuleInfo #78735

jbschlosser · 2022-06-02T16:35:59Z

Stack from ghstack:

-> Support both train / eval modes for ModuleInfo #78735

This PR enhances the tests in test/test_modules.py to be run across train / eval modes. It makes the following changes:

Adds a required training arg to ModuleInfo.module_inputs_func - this allows those functions to generate sample module inputs based on the training mode.
Updates the @modules decorator to additionally pass a training arg to test functions.
- The training arg values to pass can be customized via TrainEvalMode e.g. @modules(..., train_eval_mode=TrainEvalMode.train_only). Supported modes are train_only, eval_only, and train_and_eval.
- Tests that use the decorator in test/test_modules.py have been updated. They now ingest the training arg, pass it to the module_inputs_func, and set the training mode for instantiated module instances with m.train(training). There's almost certainly a way to do this with less boilerplate.
Adds a new train_and_eval_differ option to ModuleInfo for indicating whether a specific module cares about the training mode.
- If this is False, tests are generated just for a single mode. This avoids pointlessly running tests twice when the training mode setting doesn't matter.
- Adds a new test in test/test_modules.py to verify that train_and_eval_differ is set correctly. It does this by instantiating each module, deleting the training attribute, and then running the forward pass. If the forward pass references training, it will throw an AttributeError, indicating that train_and_eval_differ=True should be set.

[ghstack-poisoned]

facebook-github-bot · 2022-06-02T16:36:05Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/78735
✖️ Python docs build was skipped
✖️ C++ docs build was skipped
❓Need help or want to give feedback on the CI? Visit our office hours

✅ No Failures (46 Pending)

As of commit 6d46096 (more details on the Dr. CI page):

Expand to see more

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

ghstack-source-id: 160cd20 Pull Request resolved: #78735

This PR enhances the tests in `test/test_modules.py` to be run across train / eval modes. It allows sample inputs to be generated based on the mode under test (useful if e.g. if a certain sample should only be used during eval mode). [ghstack-poisoned]

This PR enhances the tests in `test/test_modules.py` to be run across train / eval modes. It makes the following changes: * Adds a required `training` arg to `ModuleInfo.module_inputs_func` - this allows those functions to generate sample module inputs based on the training mode. * Updates the `modules` decorator to additionally pass a `training` arg to test functions. * The `training` arg values to pass can be customized via `TrainEvalMode` e.g. `modules(..., train_eval_mode=TrainEvalMode.train_only)`. Supported modes are `train_only`, `eval_only`, and `train_and_eval`. * Tests that use the decorator in `test/test_modules.py` have been updated. They now ingest the `training` arg, pass it to the `module_inputs_func`, and set the training mode for instantiated module instances with `m.train(training)`. There's almost certainly a way to do this with less boilerplate. * Adds a new `train_and_eval_differ` option to `ModuleInfo` for indicating whether a specific module cares about the training mode. * If this is `False`, tests are generated just for training by default. This avoids pointlessly running tests twice when the training mode setting doesn't matter. * Adds a new test in `test/test_modules.py` to verify that `train_and_eval_differ` is set correctly. It does this by instantiating each module, deleting the `training` attribute, and then running the forward pass. If the forward pass references `training`, it will throw an `AttributeError`, indicating that `train_and_eval_differ=True` should be set. [ghstack-poisoned]

ghstack-source-id: 2204243 Pull Request resolved: #78735

albanD

SGTM!
Just minor comments

test/test_modules.py

jbschlosser

@albanD Before I can merge this, it looks like I need a way to skip a test for eval mode only.

I have a working fix that can skip based on anything passed to the test - going to add a commit earlier in the PR stack that adds this

test/test_modules.py

This PR enhances the tests in `test/test_modules.py` to be run across train / eval modes. It makes the following changes: * Adds a required `training` arg to `ModuleInfo.module_inputs_func` - this allows those functions to generate sample module inputs based on the training mode. * Updates the `modules` decorator to additionally pass a `training` arg to test functions. * The `training` arg values to pass can be customized via `TrainEvalMode` e.g. `modules(..., train_eval_mode=TrainEvalMode.train_only)`. Supported modes are `train_only`, `eval_only`, and `train_and_eval`. * Tests that use the decorator in `test/test_modules.py` have been updated. They now ingest the `training` arg, pass it to the `module_inputs_func`, and set the training mode for instantiated module instances with `m.train(training)`. There's almost certainly a way to do this with less boilerplate. * Adds a new `train_and_eval_differ` option to `ModuleInfo` for indicating whether a specific module cares about the training mode. * If this is `False`, tests are generated just for a single mode. This avoids pointlessly running tests twice when the training mode setting doesn't matter. * Adds a new test in `test/test_modules.py` to verify that `train_and_eval_differ` is set correctly. It does this by instantiating each module, deleting the `training` attribute, and then running the forward pass. If the forward pass references `training`, it will throw an `AttributeError`, indicating that `train_and_eval_differ=True` should be set. [ghstack-poisoned]

ghstack-source-id: e4dc89b Pull Request resolved: #78735

jbschlosser · 2022-06-08T21:20:42Z

@pytorchbot merge -g

pytorchmergebot · 2022-06-08T21:23:58Z

@pytorchbot successfully started a merge job. Check the current status here

github-actions · 2022-06-08T23:20:53Z

Hey @jbschlosser.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

malfet · 2022-06-09T03:36:20Z

@pytorchbot revert -m "Broke eval tests on Win, 10.2 and ROCM, see https://hud.pytorch.org/pytorch/pytorch/commit/12658fcd5bdf4d2437754633b3fa39ab15d213b9 " -c missingsignal

pytorch-bot · 2022-06-09T03:36:22Z

❌ 🤖 pytorchbot command failed:

@pytorchbot revert: error: argument -c/--classification: invalid choice: 'missingsignal' (choose from 'nosignal', 'ignoredsignal', 'landrace', 'weird', 'ghfirst')

usage: @pytorchbot revert -m MESSAGE
                          [-c {nosignal,ignoredsignal,landrace,weird,ghfirst}]

Try @pytorchbot help for more info.

malfet · 2022-06-09T03:36:41Z

@pytorchbot revert -m "Broke eval tests on Win, 10.2 and ROCM, see https://hud.pytorch.org/pytorch/pytorch/commit/12658fcd5bdf4d2437754633b3fa39ab15d213b9 " -c nosignal

pytorchmergebot · 2022-06-09T03:37:53Z

@pytorchbot successfully started a revert job. Check the current status here

This reverts commit 12658fc. Reverted #78735 on behalf of https://github.com/malfet due to Broke eval tests on Win, 10.2 and ROCM, see https://hud.pytorch.org/pytorch/pytorch/commit/12658fcd5bdf4d2437754633b3fa39ab15d213b9

This PR enhances the tests in `test/test_modules.py` to be run across train / eval modes. It makes the following changes: * Adds a required `training` arg to `ModuleInfo.module_inputs_func` - this allows those functions to generate sample module inputs based on the training mode. * Updates the `modules` decorator to additionally pass a `training` arg to test functions. * The `training` arg values to pass can be customized via `TrainEvalMode` e.g. `modules(..., train_eval_mode=TrainEvalMode.train_only)`. Supported modes are `train_only`, `eval_only`, and `train_and_eval`. * Tests that use the decorator in `test/test_modules.py` have been updated. They now ingest the `training` arg, pass it to the `module_inputs_func`, and set the training mode for instantiated module instances with `m.train(training)`. There's almost certainly a way to do this with less boilerplate. * Adds a new `train_and_eval_differ` option to `ModuleInfo` for indicating whether a specific module cares about the training mode. * If this is `False`, tests are generated just for a single mode. This avoids pointlessly running tests twice when the training mode setting doesn't matter. * Adds a new test in `test/test_modules.py` to verify that `train_and_eval_differ` is set correctly. It does this by instantiating each module, deleting the `training` attribute, and then running the forward pass. If the forward pass references `training`, it will throw an `AttributeError`, indicating that `train_and_eval_differ=True` should be set. [ghstack-poisoned]

ghstack-source-id: 58125b7 Pull Request resolved: #78735

jbschlosser · 2022-06-09T18:23:32Z

@pytorchbot merge -a

pytorchmergebot · 2022-06-09T18:24:50Z

@pytorchbot successfully started a merge job. Check the current status here

pytorchmergebot · 2022-06-09T18:24:53Z

Merge failed due to Refusing to merge as mandatory check(s) linux-docs / build-docs (cpp) are pending/not yet run for rule superuser
Raised by https://github.com/pytorch/pytorch/actions/runs/2470378831

jbschlosser · 2022-06-09T20:51:46Z

@pytorchbot merge -a

pytorchmergebot · 2022-06-09T20:57:15Z

@pytorchbot successfully started a merge job. Check the current status here

Summary: Pull Request resolved: #78735 Approved by: https://github.com/albanD Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/12658fcd5bdf4d2437754633b3fa39ab15d213b9 Reviewed By: osalpekar Differential Revision: D37025769 Pulled By: jbschlosser fbshipit-source-id: 4c9c842f4bc6dbed3a37fd464dfbad2fe48bad17

Summary: This reverts commit 12658fc. Reverted #78735 on behalf of https://github.com/malfet due to Broke eval tests on Win, 10.2 and ROCM, see https://hud.pytorch.org/pytorch/pytorch/commit/12658fcd5bdf4d2437754633b3fa39ab15d213b9 Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/854c833f8101b62ccda9198dd0a66555e7caaa69 Reviewed By: osalpekar Differential Revision: D37030370 Pulled By: osalpekar fbshipit-source-id: 77810affd0ac9acefd8ea96a3d39781a0d5b9eaa

Summary: Pull Request resolved: #78735 Approved by: https://github.com/albanD Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/70d6446a3def513b8082d243d7996ef86c2787a6 Reviewed By: osalpekar Differential Revision: D37059361 Pulled By: osalpekar fbshipit-source-id: a179f93c722112a70bdcecc67f6c5d1c9234eb50

Support both train / eval modes for ModuleInfo

e5b7ca8

[ghstack-poisoned]

facebook-github-bot added the cla signed label Jun 2, 2022

jbschlosser added a commit that referenced this pull request Jun 2, 2022

Support both train / eval modes for ModuleInfo

a31a2e0

ghstack-source-id: 160cd20 Pull Request resolved: #78735

jbschlosser mentioned this pull request Jun 7, 2022

[PyTorch] Add test_modules test for TransformerEncoderLayer fast path #78268

Closed

jbschlosser requested review from mruberry and ngimel as code owners June 7, 2022 22:03

jbschlosser requested review from albanD and removed request for ngimel June 7, 2022 22:18

jbschlosser added a commit that referenced this pull request Jun 7, 2022

Support both train / eval modes for ModuleInfo

781344e

ghstack-source-id: 2204243 Pull Request resolved: #78735

albanD approved these changes Jun 8, 2022

View reviewed changes

test/test_modules.py Show resolved Hide resolved

test/test_modules.py Outdated Show resolved Hide resolved

test/test_modules.py Outdated Show resolved Hide resolved

jbschlosser commented Jun 8, 2022

View reviewed changes

test/test_modules.py Show resolved Hide resolved

test/test_modules.py Outdated Show resolved Hide resolved

test/test_modules.py Outdated Show resolved Hide resolved

jbschlosser mentioned this pull request Jun 8, 2022

Support more granular test decoration #79161

Closed

jbschlosser added a commit that referenced this pull request Jun 8, 2022

Support both train / eval modes for ModuleInfo

687ec7a

ghstack-source-id: e4dc89b Pull Request resolved: #78735

albanD approved these changes Jun 8, 2022

View reviewed changes

pytorchmergebot added the Merged label Jun 8, 2022

pytorchmergebot closed this in 12658fc Jun 8, 2022

pytorchmergebot added the Reverted label Jun 9, 2022

jbschlosser reopened this Jun 9, 2022

jbschlosser added the ciflow/trunk Trigger trunk jobs on your pull request label Jun 9, 2022

jbschlosser added a commit that referenced this pull request Jun 9, 2022

Support both train / eval modes for ModuleInfo

a3030be

ghstack-source-id: 58125b7 Pull Request resolved: #78735

jbschlosser added release notes: nn release notes category topic: not user facing topic category labels Jun 9, 2022

pytorchmergebot closed this in 70d6446 Jun 9, 2022

jbschlosser removed the Reverted label Jun 10, 2022

facebook-github-bot deleted the gh/jbschlosser/39/head branch June 13, 2022 14:18

janeyx99 mentioned this pull request Jun 17, 2022

[Meta] CI Revert Tracker #66178

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support both train / eval modes for ModuleInfo #78735

Support both train / eval modes for ModuleInfo #78735

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Support both train / eval modes for ModuleInfo #78735

Support both train / eval modes for ModuleInfo #78735

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

🔗 Helpful links

✅ No Failures (46 Pending)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!