-
Notifications
You must be signed in to change notification settings - Fork 24.4k
[chalf] enable testing for multiple ops #77405
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[chalf] enable testing for multiple ops #77405
Conversation
🔗 Helpful links
✅ No Failures (0 Pending)As of commit f22d099 (more details on the Dr. CI page): Expand to see more💚 💚 Looks good so far! There are no failures yet. 💚 💚 This comment was automatically generated by Dr. CI (expand for details).Please report bugs/suggestions to the (internal) Dr. CI Users group. |
@@ -18467,6 +18466,12 @@ def __init__( | |||
PythonRefInfo( | |||
"_refs.permute", | |||
torch_opinfo_name="permute", | |||
skips=( | |||
DecorateInfo(unittest.expectedFailure, 'TestCommon', |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What's going on here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Oops forgot to add the error as comment
RuntimeError: "index_select_cuda" not implemented for 'ComplexHalf'
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice changes -- just add that comment, please
…lop/chalf/enable-testing-3
…shitij12345/pytorch into develop/chalf/enable-testing-3
@pytorchbot merge this please |
Hey @kshitij12345. |
@pytorchbot revert this please, as it caused torch_nn to fail with SIGIOT, see https://hud.pytorch.org/pytorch/pytorch/commit/fff560cb6e4232778cefe9b1a6ed78463b4b9e54 |
This reverts commit fff560c. Reverted #77405 on behalf of https://github.com/malfet
From the log it looks like it was triggered by SIGIOT while running
And since coredumps are not preserved as artifacts, one can get a backtrace by installing wheel package and running gdb as shown below:
Core file can be downloaded from https://gha-artifacts.s3.amazonaws.com/pytorch/pytorch/2321687135/1/coredumps-default-1-4-linux.4xlarge.nvidia.gpu/test/core.936 and offending whl package from https://gha-artifacts.s3.amazonaws.com/pytorch/pytorch/2321687135/1/linux-xenial-cuda11.3-py3.7-gcc7/artifacts.zip, which are mentioned among the artifacts lists at https://hud.pytorg.org/pr/77405 |
This PR didn't touch the failing test. @lezcano have you seen such failure previously? |
Magma strikes again, this time with a new one cc @IvanYashchuk @xwang233 . It looks like a memory corruption or insufficient resources? @malfet does the test fail consistnetly? |
@malfet looks like it was a one off issue. Can you approve this again so that I can land it? Thanks! |
As discussed above, the failure looks unrelated (but recurrent). Should we disable that test while we figure out the issue? @lezcano |
Are these errors caused by this PR or are they coming from some flaky behaviour in CI? |
AFAIK, the failure isn't directly related to this PR as it doesn't touch that function or test. Seems to be a flaky case. Will close this PR and open a new one with this branch for merging. (IIRC, reopening and remerging the same PR leads to issues internally). |
Reland: #77405 Ref: #74537 Enable for `permute, split, split_with_sizes, select, ravel, reshape, reshape_as, unfold, squeeze, unsqueeze, transpose` Pull Request resolved: #77656 Approved by: https://github.com/anjali411
Summary: Reland: #77405 Ref: #74537 Enable for `permute, split, split_with_sizes, select, ravel, reshape, reshape_as, unfold, squeeze, unsqueeze, transpose` Pull Request resolved: #77656 Approved by: https://github.com/anjali411 Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/687ab97338c434f2d428325fd742ae7cd3042b53 Reviewed By: seemethere Differential Revision: D36494122 Pulled By: seemethere fbshipit-source-id: cb2803bf28c9be46547437c3b52e3dfb63b52336
Ref: #74537
Enable for
permute, split, split_with_sizes, select, ravel, reshape, reshape_as, unfold, squeeze, unsqueeze, transpose