8000 [ROCm] Temporarily disabling ROCm CI job by pruthvistony · Pull Request #81646 · pytorch/pytorch · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

[ROCm] Temporarily disabling ROCm CI job #81646

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 1 commit into from

Conversation

pruthvistony
Copy link
Collaborator

Due to network upgrade on all machines

Due to network upgrade on all machines
@pruthvistony pruthvistony requested a review from a team as a code owner July 18, 2022 17:55
@pytorch-bot pytorch-bot bot added the module: rocm AMD GPU support for Pytorch label Jul 18, 2022
@facebook-github-bot
Copy link
Contributor
facebook-github-bot commented Jul 18, 2022

🔗 Helpful links

❌ 4 New Failures

As of commit d57f0cb (more details on the Dr. CI page):

Expand to see more
  • 4/4 failures introduced in this PR

🕵️ 4 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages

See GitHub Actions build pull / linux-bionic-cuda11.6-py3.7-gcc7 / test (default, 4, 4, linux.4xlarge.nvidia.gpu) (1/4)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

2022-07-18T20:17:14.2338303Z RuntimeError: test_reductions failed!
2022-07-18T20:17:13.4805587Z 
2022-07-18T20:17:13.4805755Z FAILED (errors=5, skipped=128, expected failures=66)
2022-07-18T20:17:13.4805965Z 
2022-07-18T20:17:13.4806073Z Generating XML reports...
2022-07-18T20:17:13.8112899Z Generated XML report: test-reports/python-unittest/test_reductions/TEST-TestReductionsCUDA-20220718201618.xml
2022-07-18T20:17:14.2332632Z Traceback (most recent call last):
2022-07-18T20:17:14.2333395Z   File "/var/lib/jenkins/workspace/test/run_test.py", line 940, in <module>
2022-07-18T20:17:14.2334018Z     main()
2022-07-18T20:17:14.2334602Z   File "/var/lib/jenkins/workspace/test/run_test.py", line 918, in main
2022-07-18T20:17:14.2337639Z     raise RuntimeError(err_message)
2022-07-18T20:17:14.2338303Z RuntimeError: test_reductions failed!
2022-07-18T20:17:14.6928545Z 
2022-07-18T20:17:14.6928938Z real	88m56.642s
2022-07-18T20:17:14.6929230Z user	86m56.089s
2022-07-18T20:17:14.6929475Z sys	3m48.098s
2022-07-18T20:17:14.6974398Z ##[error]Process completed with exit code 1.
2022-07-18T20:17:14.7011196Z Prepare all required actions
2022-07-18T20:17:14.7011639Z Getting action download info
2022-07-18T20:17:14.8968880Z ##[group]Run ./.github/actions/get-workflow-job-id
2022-07-18T20:17:14.8969182Z with:
2022-07-18T20:17:14.8969712Z   github-token: ***

See GitHub Actions build pull / linux-focal-py3.7-gcc7 / test (default, 2, 2, linux.2xlarge) (2/4)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

2022-07-18T18:12:32.0850482Z ##[error]Process completed with exit code 1.
2022-07-18T18:12:14.1993792Z Download error on https://pypi.org/simple/tabulate/: _ssl.c:1074: The handshake operation timed out -- Some packages may not be found!
2022-07-18T18:12:14.2004601Z Couldn't find index page for 'tabulate' (maybe misspelled?)
2022-07-18T18:12:14.2005225Z Scanning index of all packages (this may take a while)
2022-07-18T18:12:14.2005864Z Reading https://pypi.org/simple/
2022-07-18T18:12:31.7882750Z No local packages or working download links found for tabulate
2022-07-18T18:12:31.7883582Z error: Could not find suitable distribution for Requirement.parse('tabulate')
2022-07-18T18:12:32.0816895Z 
2022-07-18T18:12:32.0817354Z real	0m36.327s
2022-07-18T18:12:32.0817722Z user	0m20.175s
2022-07-18T18:12:32.0819137Z sys	0m0.795s
2022-07-18T18:12:32.0850482Z ##[error]Process completed with exit code 1.
2022-07-18T18:12:32.0891020Z Prepare all required actions
2022-07-18T18:12:32.0891310Z Getting action download info
2022-07-18T18:12:32.3861966Z ##[group]Run ./.github/actions/get-workflow-job-id
2022-07-18T18:12:32.3862188Z with:
2022-07-18T18:12:32.3862594Z   github-token: ***
2022-07-18T18:12:32.3862763Z env:
2022-07-18T18:12:32.3862936Z   GIT_DEFAULT_BRANCH: master
2022-07-18T18:12:32.3863106Z ##[endgroup]
2022-07-18T18:12:32.3888542Z ##[group]Run nick-fields/retry@71062288b76e2b6214ebde0e673ce0de1755740a
2022-07-18T18:12:32.3888773Z with:

See GitHub Actions build pull / linux-bionic-cuda11.6-py3.7-gcc7 / test (default, 2, 4, linux.4xlarge.nvidia.gpu) (3/4)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

2022-07-18T18:53:00.1066498Z RuntimeError: test_jit failed!
2022-07-18T18:52:59.3074402Z Generated XML report: test-reports/python-unittest/test_jit/TEST-jit.test_warn.TestWarn-20220718184958.xml
2022-07-18T18:52:59.3082822Z Generated XML report: test-reports/python-unittest/test_jit/TEST-jit.test_with.TestWith-20220718184958.xml
2022-07-18T18:52:59.3091230Z Generated XML report: test-reports/python-unittest/test_jit/TEST-jit.test_data_parallel.TestDataParallel-20220718184958.xml
2022-07-18T18:52:59.3106494Z Generated XML report: test-reports/python-unittest/test_jit/TEST-jit.test_legacy_upgraders.TestLegacyUpgraders-20220718184958.xml
2022-07-18T18:52:59.3120687Z Generated XML report: test-reports/python-unittest/test_jit/TEST-jit.test_save_load.TestSaveLoadFlatbuffer-20220718184958.xml
2022-07-18T18:53:00.1056532Z Traceback (most recent call last):
2022-07-18T18:53:00.1057316Z   File "/var/lib/jenkins/workspace/test/run_test.py", line 940, in <module>
2022-07-18T18:53:00.1065258Z     main()
2022-07-18T18:53:00.1065602Z   File "/var/lib/jenkins/workspace/test/run_test.py", line 918, in main
2022-07-18T18:53:00.1066168Z     raise RuntimeError(err_message)
2022-07-18T18:53:00.1066498Z RuntimeError: test_jit failed!
2022-07-18T18:53:00.5821696Z 
2022-07-18T18:53:00.5822148Z real	3m10.092s
2022-07-18T18:53:00.5822625Z user	4m23.599s
2022-07-18T18:53:00.5822885Z sys	0m22.621s
2022-07-18T18:53:00.5868029Z ##[error]Process completed with exit code 1.
2022-07-18T18:53:00.5904689Z Prepare all required actions
2022-07-18T18:53:00.5905127Z Getting action download info
2022-07-18T18:53:00.8140585Z ##[group]Run ./.github/actions/get-workflow-job-id
2022-07-18T18:53:00.8140894Z with:
2022-07-18T18:53:00.8141316Z   github-token: ***

See GitHub Actions build pull / linux-focal-py3.7-clang7-asan / test (default, 2, 5, linux.2xlarge) (4/4)

Step: "Test" (full log | diagnosis details | 🔁 rerun)

2022-07-18T19:36:34.0097303Z �[0;31m[ FAILED ] �[mCustomAutogradTest.BackwardWithNonLeafInputs
2022-07-18T19:36:33.9429120Z �[0;32m[ RUN      ] �[mOperationTest.Cross
2022-07-18T19:36:33.9681215Z �[0;32m[       OK ] �[mOperationTest.Cross (25 ms)
2022-07-18T19:36:33.9681527Z �[0;32m[ RUN      ] �[mOperationTest.Linear_out
2022-07-18T19:36:33.9699000Z �[0;32m[       OK ] �[mOperationTest.Linear_out (1 ms)
2022-07-18T19:36:33.9699331Z �[0;32m[----------] �[m3 tests from OperationTest (35 ms total)
2022-07-18T19:36:33.9699497Z 
2022-07-18T19:36:33.9699664Z �[0;32m[----------] �[mGlobal test environment tear-down
2022-07-18T19:36:34.0096172Z �[0;32m[==========] �[m988 tests from 47 test suites ran. (319506 ms total)
2022-07-18T19:36:34.0096684Z �[0;32m[  PASSED  ] �[m987 tests.
2022-07-18T19:36:34.0096958Z �[0;31m[  FAILED  ] �[m1 test, listed below:
2022-07-18T19:36:34.0097303Z �[0;31m[  FAILED  ] �[mCustomAutogradTest.BackwardWithNonLeafInputs
2022-07-18T19:36:34.0097502Z 
2022-07-18T19:36:34.0097572Z  1 FAILED TEST
2022-07-18T19:36:34.4400935Z ##[error]Process completed with exit code 1.
2022-07-18T19:36:34.4436863Z Prepare all required actions
2022-07-18T19:36:34.4437156Z Getting action download info
2022-07-18T19:36:34.6632940Z ##[group]Run ./.github/actions/get-workflow-job-id
2022-07-18T19:36:34.6633157Z with:
2022-07-18T19:36:34.6633483Z   github-token: ***
2022-07-18T19:36:34.6633639Z env:
2022-07-18T19:36:34.6633809Z   GIT_DEFAULT_BRANCH: master

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

@pruthvistony
Copy link
Collaborator Author

@pytorchbot merge -f

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a merge job. Check the current status here

@github-actions
Copy link
Contributor

Hey @pruthvistony.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

@pruthvistony
Copy link
Collaborator Author

@pytorchbot revert -m "Trying to restore back ROCm job, network upgrade is done" -c nosignal

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a revert job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

Will not revert as @pruthvistony is not a MEMBER, but COLLABORATOR

@kit1980
Copy link
Contributor
kit1980 commented Jul 20, 2022

@pytorchbot revert -m "Trying to restore back ROCm job, network upgrade is done" -c weird

@pytorchmergebot
Copy link
Collaborator

@pytorchbot successfully started a revert job. Check the current status here

@pytorchmergebot
Copy link
Collaborator

@pruthvistony your PR has been successfully reverted.

pytorchmergebot added a commit that referenced this pull request Jul 20, 2022
This reverts commit 9f9dd4f.

Reverted #81646 on behalf of https://github.com/kit1980 due to Trying to restore back ROCm job, network upgrade is done
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants
0