8000 Remove native_functions.yaml dependency from TensorTopK.cu by peterbell10 · Pull Request #66794 · pytorch/pytorch · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Remove native_functions.yaml dependency from TensorTopK.cu #66794

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 11 commits into from

Conversation

@pytorch-probot
Copy link
pytorch-probot bot commented Oct 18, 2021
CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/pytorch/pytorch/blob/9d472d6a5d000238aee3dbb00f98a92a29d0857f/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default

Workflows Labels (bold enabled) Status
Triggered Workflows
linux-bionic-py3.6-clang9 ciflow/all, ciflow/cpu, ciflow/default, ciflow/linux, ciflow/noarch, ciflow/xla ✅ triggered
linux-vulkan-bionic-py3.6-clang9 ciflow/all, ciflow/cpu, ciflow/default, ciflow/linux, ciflow/vulkan ✅ triggered
linux-xenial-cuda11.3-py3.6-gcc7 ciflow/all, ciflow/cuda, ciflow/default, ciflow/linux ✅ triggered
linux-xenial-py3-clang5-mobile-build ciflow/all, ciflow/default, ciflow/linux, ciflow/mobile ✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-dynamic ciflow/all, ciflow/default, ciflow/linux, ciflow/mobile ✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-static ciflow/all, ciflow/default, ciflow/linux, ciflow/mobile ✅ triggered
linux-xenial-py3.6-clang7-asan ciflow/all, ciflow/cpu, ciflow/default, ciflow/linux, ciflow/sanitizers ✅ triggered
linux-xenial-py3.6-clang7-onnx ciflow/all, ciflow/cpu, ciflow/default, ciflow/linux, ciflow/onnx ✅ triggered
linux-xenial-py3.6-gcc5.4 ciflow/all, ciflow/cpu, ciflow/default, ciflow/linux ✅ triggered
linux-xenial-py3.6-gcc7 ciflow/all, ciflow/cpu, ciflow/default, ciflow/linux ✅ triggered
linux-xenial-py3.6-gcc7-bazel-test ciflow/all, ciflow/bazel, ciflow/cpu, ciflow/default, ciflow/linux ✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single ciflow/all, ciflow/android, ciflow/cpu, ciflow/default, ciflow/linux ✅ triggered
win-vs2019-cpu-py3 ciflow/all, ciflow/cpu, ciflow/default, ciflow/win ✅ triggered
win-vs2019-cuda11.3-py3 ciflow/all, ciflow/cuda, ciflow/default, ciflow/win ✅ triggered
Skipped Workflows
caffe2-linux-xenial-py3.6-gcc5.4 ciflow/all, ciflow/cpu, ciflow/linux 🚫 skipped
docker-builds ciflow/all 🚫 skipped
ios-12-5-1-arm64 ciflow/all, ciflow/ios, ciflow/macos 🚫 skipped
ios-12-5-1-arm64-coreml ciflow/all, ciflow/ios, ciflow/macos 🚫 skipped
ios-12-5-1-arm64-custom-ops ciflow/all, ciflow/ios, ciflow/macos 🚫 skipped
ios-12-5-1-arm64-full-jit ciflow/all, ciflow/ios, ciflow/macos 🚫 skipped
ios-12-5-1-arm64-metal ciflow/all, ciflow/ios, ciflow/macos 🚫 skipped
ios-12-5-1-x86-64 ciflow/all, ciflow/ios, cifl 10000 ow/macos 🚫 skipped
ios-12-5-1-x86-64-coreml ciflow/all, ciflow/ios, ciflow/macos 🚫 skipped
ios-12-5-1-x86-64-full-jit ciflow/all, ciflow/ios, ciflow/macos 🚫 skipped
libtorch-linux-xenial-cuda10.2-py3.6-gcc7 ciflow/all, ciflow/cuda, ciflow/libtorch, ciflow/linux 🚫 skipped
libtorch-linux-xenial-cuda11.3-py3.6-gcc7 ciflow/all, ciflow/cuda, ciflow/libtorch, ciflow/linux 🚫 skipped
linux-bionic-cuda10.2-py3.9-gcc7 ciflow/all, ciflow/cuda, ciflow/linux, ciflow/slow 🚫 skipped
linux-xenial-py3-clang5-mobile-code-analysis ciflow/all, ciflow/linux, ciflow/mobile 🚫 skipped
parallelnative-linux-xenial-py3.6-gcc5.4 ciflow/all, ciflow/cpu, ciflow/linux 🚫 skipped
periodic-libtorch-linux-xenial-cuda11.1-py3.6-gcc7 ciflow/all, ciflow/cuda, ciflow/libtorch, ciflow/linux, ciflow/scheduled 🚫 skipped
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck ciflow/all, ciflow/cuda, ciflow/linux, ciflow/scheduled, ciflow/slow, ciflow/slow-gradcheck 🚫 skipped
periodic-linux-xenial-cuda11.1-py3.6-gcc7 ciflow/all, ciflow/cuda, ciflow/linux, ciflow/scheduled 🚫 skipped
periodic-win-vs2019-cuda11.1-py3 ciflow/all, ciflow/cuda, ciflow/scheduled, ciflow/win 🚫 skipped

You can add a comment to the PR and tag @pytorchbot with the following commands:
# ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun

# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow

For more information, please take a look at the CI Flow Wiki.

@facebook-github-bot
Copy link
Contributor
facebook-github-bot commented Oct 18, 2021

🔗 Helpful links

💊 CI failures summary and remediations

As of commit 65945dd (more details on the Dr. CI page):


  • 28/28 failures introduced in this PR

🕵️ 16 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

See GitHub Actions build linux-xenial-py3.6-clang7-onnx / test (default, 2, 2, linux.2xlarge) (1/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:26:34.6848585Z �[31mERROR: pip's ... the source of the following dependency conflicts.
2021-11-03T14:26:33.2311082Z ++ stat --format %U /opt/conda/bin/pip
2021-11-03T14:26:33.2320721Z + PIP_USER=jenkins
2021-11-03T14:26:33.2324030Z ++ id -u -n
2021-11-03T14:26:33.2334621Z + CURRENT_USER=jenkins
2021-11-03T14:26:33.2335194Z + [[ jenkins = root ]]
2021-11-03T14:26:33.2335777Z + pip -q uninstall -y hypothesis
2021-11-03T14:26:33.5892595Z + pip -q uninstall -y coverage
2021-11-03T14:26:33.8818190Z �[33mWARNING: Skipping coverage as it is not installed.�[0m
2021-11-03T14:26:33.9090317Z + pip -q install attrs==18.1.0 -f https://s3.amazonaws.com/ossci-linux/wheels/attrs-18.1.0-py2.py3-none-any.whl
2021-11-03T14:26:34.3331134Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/attrs-18.1.0-py2.py3-none-any.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:34.6848585Z �[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
2021-11-03T14:26:34.6849875Z pytest 6.2.5 requires attrs>=19.2.0, but you have attrs 18.1.0 which is incompatible.�[0m
2021-11-03T14:26:34.7372244Z + pip -q install coverage==4.5.1 -f https://s3.amazonaws.com/ossci-linux/wheels/coverage-4.5.1-cp36-cp36m-macosx_10_12_x86_64.whl
2021-11-03T14:26:35.5931504Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/coverage-4.5.1-cp36-cp36m-macosx_10_12_x86_64.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:36.0070516Z + pip -q install hypothesis==3.44.6 -f https://s3.amazonaws.com/ossci-linux/wheels/hypothesis-3.44.6-py3-none-any.whl
2021-11-03T14:26:37.0288885Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/hypothesis-3.44.6-py3-none-any.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:37.5508995Z + EXTRA_TESTS=()
2021-11-03T14:26:37.5510222Z + [[ linux-xenial-py3.6-clang7- *-cuda* ]]
2021-11-03T14:26:37.5511377Z + [[ linux-xenial-py3.6-clang7- *-rocm* ]]
2021-11-03T14:26:37.5511994Z + rocm_ignore_test=()
2021-11-03T14:26:37.5512822Z + [[ linux-xenial-py3.6-clang7- *-rocm* ]]

See GitHub Actions build linux-xenial-py3.6-gcc5.4 / test (distributed, 1, 1, linux.2xlarge) (2/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:30:54.2941748Z test_udf_remote_...yUniqueId(created_on=0, local_id=0) to be created.
2021-11-03T14:30:15.8075424Z frame #12: c10::ThreadPool::main_loop(unsigned long) + 0x2a3 (0x7fddcbb75383 in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:15.8076875Z frame #13: <unknown function> + 0xc92bd (0x7fddcbaa32bd in /opt/conda/lib/libstdc++.so.6)
2021-11-03T14:30:15.8078341Z frame #14: <unknown function> + 0x76ba (0x7fdddf9136ba in /lib/x86_64-linux-gnu/libpthread.so.0)
2021-11-03T14:30:15.8079709Z frame #15: clone + 0x6d (0x7fdddf64951d in /lib/x86_64-linux-gnu/libc.so.6)
2021-11-03T14:30:15.8080315Z 
2021-11-03T14:30:16.0614670Z ok (3.316s)
2021-11-03T14:30:30.7968760Z   test_rpc_builtin_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (14.735s)
2021-11-03T14:30:39.6222403Z   test_rpc_script_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (8.826s)
2021-11-03T14:30:42.9396948Z   test_rref_to_here_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (3.317s)
2021-11-03T14:30:50.2622214Z   test_udf_remote_message_delay_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (7.322s)
2021-11-03T14:30:54.2941748Z   test_udf_remote_message_delay_timeout_to_self (__main__.FaultyFaultyAgentRpcTest) ... [E request_callback_no_python.cpp:559] Received error while processing request type 261: falseINTERNAL ASSERT FAILED at "/var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp":387, please report a bug to PyTorch. Expected OwnerRRef with id GloballyUniqueId(created_on=0, local_id=0) to be created.
2021-11-03T14:30:54.2944105Z Exception raised from getOwnerRRef at /var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp:387 (most recent call first):
2021-11-03T14:30:54.2946062Z frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x69 (0x7f824909f549 in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:54.2948268Z frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xd2 (0x7f824909baf2 in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:54.2950127Z frame #2: c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0x4e (0x7f824909d48e in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:54.2951926Z frame #3: torch::distributed::rpc::RRefContext::getOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, bool) + 0x4cb (0x7f824c782cdb in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:54.2954468Z frame #4: torch::distributed::rpc::RequestCallbackNoPython::assignOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, torch::distributed::rpc::GloballyUniqueId const&, c10::intrusive_ptr<c10::ivalue::Future, c10::detail::intrusive_target_default_null_type<c10::ivalue::Future> >) const + 0x71 (0x7f824c773201 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:54.2958453Z frame #5: torch::distributed::rpc::RequestCallbackImpl::processPythonRemoteCall(torch::distributed::rpc::RpcCommandBase&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0xc8 (0x7f82547edf28 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-11-03T14:30:54.2960829Z frame #6: torch::distributed::rpc::RequestCallbackNoPython::processRpc(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x194 (0x7f824c777a54 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:54.2963335Z frame #7: torch::distributed::rpc::RequestCallbackImpl::processRpcWithErrors(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x65 (0x7f82547ed535 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-11-03T14:30:54.2964972Z frame #8: <unknown function> + 0x34995aa (0x7f824c7745aa in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)

See GitHub Actions build linux-xenial-py3.6-gcc5.4 / build-docs (cpp) (3/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:44:21.3982348Z error: could not l...modules/third_party/zstd/config: Permission denied
2021-11-03T14:44:21.3844471Z http.https://github.com/.extraheader
2021-11-03T14:44:21.3854566Z error: could not lock config file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config: Permission denied
2021-11-03T14:44:21.3867131Z Entering 'third_party/tensorpipe/third_party/pybind11'
2021-11-03T14:44:21.3885207Z http.https://github.com/.extraheader
2021-11-03T14:44:21.3896220Z error: could not lock config file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config: Permission denied
2021-11-03T14:44:21.3909180Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang'
2021-11-03T14:44:21.3927271Z http.https://github.com/.extraheader
2021-11-03T14:44:21.3938338Z error: could not lock config file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config: Permission denied
2021-11-03T14:44:21.3953723Z Entering 'third_party/zstd'
2021-11-03T14:44:21.3971727Z http.https://github.com/.extraheader
2021-11-03T14:44:21.3982348Z error: could not lock config file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/zstd/config: Permission denied
2021-11-03T14:44:21.4046309Z Cleaning up orphan processes
2021-11-03T14:44:21.4248749Z Terminate orphan process: pid (9052) (docker)

See GitHub Actions build linux-xenial-py3.6-gcc7 / test (distributed, 1, 1, linux.2xlarge) (4/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:30:24.7462379Z test_udf_remote_...yUniqueId(created_on=0, local_id=0) to be created.
2021-11-03T14:29:46.2677399Z frame #12: c10::ThreadPool::main_loop(unsigned long) + 0x2db (0x7f3a79e1ba3b in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:29:46.2678807Z frame #13: <unknown function> + 0xc9039 (0x7f3a79d47039 in /opt/conda/lib/libstdc++.so.6)
2021-11-03T14:29:46.2680268Z frame #14: <unknown function> + 0x76ba (0x7f3a8dbc06ba in /lib/x86_64-linux-gnu/libpthread.so.0)
2021-11-03T14:29:46.2681600Z frame #15: clone + 0x6d (0x7f3a8d8f651d in /lib/x86_64-linux-gnu/libc.so.6)
2021-11-03T14:29:46.2682178Z 
2021-11-03T14:29:46.5471315Z ok (3.315s)
2021-11-03T14:30:01.2817766Z   test_rpc_builtin_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (14.735s)
2021-11-03T14:30:10.1061274Z   test_rpc_script_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (8.824s)
2021-11-03T14:30:13.4228006Z   test_rref_to_here_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (3.317s)
2021-11-03T14:30:20.7438195Z   test_udf_remote_message_delay_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (7.321s)
2021-11-03T14:30:24.7462379Z   test_udf_remote_message_delay_timeout_to_self (__main__.FaultyFaultyAgentRpcTest) ... [E request_callback_no_python.cpp:559] Received error while processing request type 261: falseINTERNAL ASSERT FAILED at "/var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp":387, please report a bug to PyTorch. Expected OwnerRRef with id GloballyUniqueId(created_on=0, local_id=0) to be created.
2021-11-03T14:30:24.7465454Z Exception raised from getOwnerRRef at /var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp:387 (most recent call first):
2021-11-03T14:30:24.7468082Z frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7f25c13f6cab in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:24.7470915Z frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7f25c13f28fe in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:24.7474472Z frame #2: c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0x4e (0x7f25c13f469e in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:24.7477450Z frame #3: torch::distributed::rpc::RRefContext::getOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, bool) + 0x450 (0x7f25c4a9b140 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:24.7480439Z frame #4: torch::distributed::rpc::RequestCallbackNoPython::assignOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, torch::distributed::rpc::GloballyUniqueId const&, c10::intrusive_ptr<c10::ivalue::Future, c10::detail::intrusive_target_default_null_type<c10::ivalue::Future> >) const + 0x73 (0x7f25c4a8b933 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:24.7483268Z frame #5: torch::distributed::rpc::RequestCallbackImpl::processPythonRemoteCall(torch::distributed::rpc::RpcCommandBase&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0xcc (0x7f25ccb56d4c in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-11-03T14:30:24.7485693Z frame #6: torch::distributed::rpc::RequestCallbackNoPython::processRpc(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x194 (0x7f25c4a8fcd4 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:24.7488697Z frame #7: torch::distributed::rpc::RequestCallbackImpl::processRpcWithErrors(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x65 (0x7f25ccb56465 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-11-03T14:30:24.7490399Z frame #8: <unknown function> + 0x3459cd3 (0x7f25c4a89cd3 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)

See GitHub Actions build linux-bionic-py3.6-clang9 / test (noarch, 1, 1, linux.2xlarge) (5/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:31:41.9460441Z test_add_done_ca...arg() takes 0 positional arguments but 1 was given
2021-11-03T14:31:41.9445866Z   /opt/conda/lib/python3.6/unittest/suite.py(122): run
2021-11-03T14:31:41.9446389Z   /opt/conda/lib/python3.6/unittest/suite.py(84): __call__
2021-11-03T14:31:41.9447131Z   /opt/conda/lib/python3.6/site-packages/xmlrunner/runner.py(66): run
2021-11-03T14:31:41.9447718Z   /opt/conda/lib/python3.6/unittest/main.py(256): runTests
2021-11-03T14:31:41.9448247Z   /opt/conda/lib/python3.6/unittest/main.py(95): __init__
2021-11-03T14:31:41.9448975Z   /opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_utils.py(608): run_tests
2021-11-03T14:31:41.9449597Z   test_futures.py(331): <module>
2021-11-03T14:31:41.9449839Z 
2021-11-03T14:31:41.9450064Z ok (0.002s)
2021-11-03T14:31:41.9455283Z   test_add_done_callback_maintains_callback_order (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:41.9460441Z   test_add_done_callback_no_arg_error_is_ignored (__main__.TestFuture) ... [E pybind_utils.h:201] Got the following error when running the callback: TypeError: no_arg() takes 0 positional arguments but 1 was given
2021-11-03T14:31:41.9461251Z ok (0.001s)
2021-11-03T14:31:41.9469566Z   test_add_done_callback_simple (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:41.9495961Z   test_chained_then (__main__.TestFuture) ... ok (0.003s)
2021-11-03T14:31:42.0514992Z   test_collect_all (__main__.TestFuture) ... ok (0.102s)
2021-11-03T14:31:42.0521021Z   test_done (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:42.0532891Z   test_done_exception (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:42.0546919Z   test_interleaving_then_and_add_done_callback_maintains_callback_order (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:42.0556319Z   test_interleaving_then_and_add_done_callback_propagates_error (__main__.TestFuture) ... [E pybind_utils.h:201] Got the following error when running the callback: ValueError: Expected error
2021-11-03T14:31:42.0556984Z 
2021-11-03T14:31:42.0557271Z At:

See GitHub Actions build linux-xenial-py3.6-clang7-onnx / test (default, 1, 2, linux.2xlarge) (6/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:26:42.1649157Z �[31mERROR: pip's ... the source of the following dependency conflicts.
2021-11-03T14:26:40.6485538Z ++ stat --format %U /opt/conda/bin/pip
2021-11-03T14:26:40.6495458Z + PIP_USER=jenkins
2021-11-03T14:26:40.6498534Z ++ id -u -n
2021-11-03T14:26:40.6507211Z + CURRENT_USER=jenkins
2021-11-03T14:26:40.6507866Z + [[ jenkins = root ]]
2021-11-03T14:26:40.6508532Z + pip -q uninstall -y hypothesis
2021-11-03T14:26:41.0280018Z + pip -q uninstall -y coverage
2021-11-03T14:26:41.3301906Z �[33mWARNING: Skipping coverage as it is not installed.�[0m
2021-11-03T14:26:41.3610469Z + pip -q install attrs==18.1.0 -f https://s3.amazonaws.com/ossci-linux/wheels/attrs-18.1.0-py2.py3-none-any.whl
2021-11-03T14:26:41.7991434Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/attrs-18.1.0-py2.py3-none-any.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:42.1649157Z �[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
2021-11-03T14:26:42.1650350Z pytest 6.2.5 requires attrs>=19.2.0, but you have attrs 18.1.0 which is incompatible.�[0m
2021-11-03T14:26:42.2253856Z + pip -q install coverage==4.5.1 -f https://s3.amazonaws.com/ossci-linux/wheels/coverage-4.5.1-cp36-cp36m-macosx_10_12_x86_64.whl
2021-11-03T14:26:43.0898065Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/coverage-4.5.1-cp36-cp36m-macosx_10_12_x86_64.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:43.5352555Z + pip -q install hypothesis==3.44.6 -f https://s3.amazonaws.com/ossci-linux/wheels/hypothesis-3.44.6-py3-none-any.whl
2021-11-03T14:26:44.5803125Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/hypothesis-3.44.6-py3-none-any.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:45.1331567Z + EXTRA_TESTS=()
2021-11-03T14:26:45.1333107Z + [[ linux-xenial-py3.6-clang7- *-cuda* ]]
2021-11-03T14:26:45.1334373Z + [[ linux-xenial-py3.6-clang7- *-rocm* ]]
2021-11-03T14:26:45.1335122Z + rocm_ignore_test=()
2021-11-03T14:26:45.1336082Z + [[ linux-xenial-py3.6-clang7- *-rocm* ]]

See GitHub Actions build win-vs2019-cuda11.3-py3 / build (7/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:16:28.7464935Z ls: cannot access ...d/win_tmp/ci_scripts/*': No such file or directory
2021-11-03T14:16:28.0803255Z ++ cygpath -w /c/actions-runner/_work/pytorch/pytorch/build/win_tmp
2021-11-03T14:16:28.1775147Z + TMP_DIR_WIN='C:\actions-runner\_work\pytorch\pytorch\build\win_tmp'
2021-11-03T14:16:28.1775672Z + export TMP_DIR_WIN
2021-11-03T14:16:28.1776127Z + export PYTORCH_FINAL_PACKAGE_DIR=/c/1417084618/build-results/
2021-11-03T14:16:28.1776651Z + PYTORCH_FINAL_PACKAGE_DIR=/c/1417084618/build-results/
2021-11-03T14:16:28.1777077Z + [[ -n /c/1417084618/build-results/ ]]
2021-11-03T14:16:28.1777482Z + mkdir -p /c/1417084618/build-results/
2021-11-03T14:16:28.5143248Z + CI_SCRIPTS_DIR=/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts
2021-11-03T14:16:28.5144074Z + mkdir -p /c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts
2021-11-03T14:16:28.5338132Z ++ ls '/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts/*'
2021-11-03T14:16:28.7464935Z ls: cannot access '/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts/*': No such file or directory
2021-11-03T14:16:28.7468296Z + '[' -n '' ']'
2021-11-03T14:16:28.7469055Z + export SCRIPT_HELPERS_DIR=/c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers
2021-11-03T14:16:28.7469908Z + SCRIPT_HELPERS_DIR=/c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers
2021-11-03T14:16:28.7470424Z + set +ex
2021-11-03T14:16:38.2948151Z + /c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers/build_pytorch.bat
2021-11-03T14:16:38.3202011Z 
2021-11-03T14:16:38.3202860Z C:\actions-runner\_work\pytorch\pytorch>if "" == "1" (set BUILD_TYPE=debug )  ELSE (set BUILD_TYPE=release ) 
2021-11-03T14:16:38.3206187Z 
2021-11-03T14:16:38.3208321Z C:\actions-runner\_work\pytorch\pytorch>set PATH=C:\Program Files\CMake\bin;C:\Program Files\7-Zip;C:\ProgramData\chocolatey\bin;C:\Program Files\Git\cmd;C:\Program Files\Amazon\AWSCLI;C:\Program Files\Amazon\AWSCLI\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0;C:\Windows\System32\OpenSSH;C:\Program Files\Amazon\cfn-bootstrap;C:\ProgramData\chocolatey\bin;C:\Program Files\Amazon\AWSCLIV2;C:\Program Files\Git\cmd;C:\Program Files\Git\mingw64\bin;C:\Program Files\Git\usr\bin;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit;C:\Users\runneruser\AppData\Local\Microsoft\WindowsApps 
2021-11-03T14:16:38.3210490Z 

See GitHub Actions build linux-xenial-py3.6-gcc5.4 / test (default, 2, 2, linux.2xlarge) (8/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:43:43.0524617Z test_add_done_ca...arg() takes 0 positional arguments but 1 was given
2021-11-03T14:43:43.0509174Z   /opt/conda/lib/python3.6/unittest/suite.py(122): run
2021-11-03T14:43:43.0509768Z   /opt/conda/lib/python3.6/unittest/suite.py(84): __call__
2021-11-03T14:43:43.0510456Z   /opt/conda/lib/python3.6/site-packages/xmlrunner/runner.py(66): run
2021-11-03T14:43:43.0511041Z   /opt/conda/lib/python3.6/unittest/main.py(256): runTests
2021-11-03T14:43:43.0511544Z   /opt/conda/lib/python3.6/unittest/main.py(95): __init__
2021-11-03T14:43:43.0512303Z   /opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_utils.py(608): run_tests
2021-11-03T14:43:43.0512876Z   test_futures.py(331): <module>
2021-11-03T14:43:43.0513127Z 
2021-11-03T14:43:43.0513394Z ok (0.002s)
2021-11-03T14:43:43.0519340Z   test_add_done_callback_maintains_callback_order (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.0524617Z   test_add_done_callback_no_arg_error_is_ignored (__main__.TestFuture) ... [E pybind_utils.h:201] Got the following error when running the callback: TypeError: no_arg() takes 0 positional arguments but 1 was given
2021-11-03T14:43:43.0525400Z ok (0.001s)
2021-11-03T14:43:43.0534020Z   test_add_done_callback_simple (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.0561242Z   test_chained_then (__main__.TestFuture) ... ok (0.003s)
2021-11-03T14:43:43.1580069Z   test_collect_all (__main__.TestFuture) ... ok (0.102s)
2021-11-03T14:43:43.1587140Z   test_done (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.1597663Z   test_done_exception (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.1611117Z   test_interleaving_then_and_add_done_callback_maintains_callback_order (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.1619866Z   test_interleaving_then_and_add_done_callback_propagates_error (__main__.TestFuture) ... [E pybind_utils.h:201] Got the following error when running the callback: ValueError: Expected error
2021-11-03T14:43:43.1621050Z 
2021-11-03T14:43:43.1621438Z At:

See GitHub Actions build linux-bionic-py3.6-clang9 / test (default, 1, 2, linux.2xlarge) (9/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:38:34.1528214Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:38:34.1147653Z 
2021-11-03T14:38:34.1148844Z Intel MKL ERROR: Parameter 4 was incorrect on entry to SLASCL.
2021-11-03T14:38:34.1195936Z ok (0.037s)
2021-11-03T14:38:34.1351982Z   test_svd_errors_and_warnings_cpu_float64 (__main__.TestLinalgCPU) ... 
2021-11-03T14:38:34.1353006Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:38:34.1353602Z 
2021-11-03T14:38:34.1354278Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:38:34.1525988Z 
2021-11-03T14:38:34.1526938Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:38:34.1527534Z 
2021-11-03T14:38:34.1528214Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:38:34.1575315Z ok (0.038s)
2021-11-03T14:38:37.2956861Z   test_svd_lowrank_cpu_float64 (__main__.TestLinalgCPU) ... ok (3.138s)
2021-11-03T14:38:37.7490313Z   test_svd_memory_allocation_cpu_complex128 (__main__.TestLinalgCPU) ... test_linalg.py:3026: UserWarning: An output with one or more elements was resized since it had shape [3, 3], which does not match the required output shape [3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:38:37.7493636Z   torch.linalg.svdvals(a, out=out0)
2021-11-03T14:38:37.9215671Z test_linalg.py:3027: UserWarning: An output with one or more elements was resized since it had shape [3], which does not match the required output shape [3, 3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:38:37.9218715Z   torch.linalg.svd(a, full_matrices=False, out=(out0, out1, out2))
2021-11-03T14:38:37.9270524Z ok (0.631s)
2021-11-03T14:38:38.2740435Z   test_svd_memory_allocation_cpu_complex64 (__main__.TestLinalgCPU) ... ok (0.347s)
2021-11-03T14:38:38.4223450Z   test_svd_memory_allocation_cpu_float32 (__main__.TestLinalgCPU) ... ok (0.148s)
2021-11-03T14:38:38.6334297Z   test_svd_memory_allocation_cpu_float64 (__main__.TestLinalgCPU) ... ok (0.211s)

See GitHub Actions build win-vs2019-cpu-py3 / build (10/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:16:31.3859073Z ls: cannot access ...d/win_tmp/ci_scripts/*': No such file or directory
2021-11-03T14:16:31.0079370Z ++ cygpath -w /c/actions-runner/_work/pytorch/pytorch/build/win_tmp
2021-11-03T14:16:31.1117302Z + TMP_DIR_WIN='C:\actions-runner\_work\pytorch\pytorch\build\win_tmp'
2021-11-03T14:16:31.1117869Z + export TMP_DIR_WIN
2021-11-03T14:16:31.1118316Z + export PYTORCH_FINAL_PACKAGE_DIR=/c/1417084628/build-results/
2021-11-03T14:16:31.1118855Z + PYTORCH_FINAL_PACKAGE_DIR=/c/1417084628/build-results/
2021-11-03T14:16:31.1119305Z + [[ -n /c/1417084628/build-results/ ]]
2021-11-03T14:16:31.1119875Z + mkdir -p /c/1417084628/build-results/
2021-11-03T14:16:31.2147624Z + CI_SCRIPTS_DIR=/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts
2021-11-03T14:16:31.2148376Z + mkdir -p /c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts
2021-11-03T14:16:31.2343491Z ++ ls '/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts/*'
2021-11-03T14:16:31.3859073Z ls: cannot access '/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts/*': No such file or directory
2021-11-03T14:16:31.3862179Z + '[' -n '' ']'
2021-11-03T14:16:31.3862895Z + export SCRIPT_HELPERS_DIR=/c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers
2021-11-03T14:16:31.3863819Z + SCRIPT_HELPERS_DIR=/c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers
2021-11-03T14:16:31.3864349Z + set +ex
2021-11-03T14:16:41.1842041Z + /c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers/build_pytorch.bat
2021-11-03T14:16:41.2121491Z 
2021-11-03T14:16:41.2122424Z C:\actions-runner\_work\pytorch\pytorch>if "" == "1" (set BUILD_TYPE=debug )  ELSE (set BUILD_TYPE=release ) 
2021-11-03T14:16:41.2125590Z 
2021-11-03T14:16:41.2127770Z C:\actions-runner\_work\pytorch\pytorch>set PATH=C:\Program Files\CMake\bin;C:\Program Files\7-Zip;C:\ProgramData\chocolatey\bin;C:\Program Files\Git\cmd;C:\Program Files\Amazon\AWSCLI;C:\Program Files\Amazon\AWSCLI\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0;C:\Windows\System32\OpenSSH;C:\Program Files\Amazon\cfn-bootstrap;C:\ProgramData\chocolatey\bin;C:\Program Files\Amazon\AWSCLIV2;C:\Program Files\Git\cmd;C:\Program Files\Git\mingw64\bin;C:\Program Files\Git\usr\bin;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit;C:\Users\runneruser\AppData\Local\Microsoft\WindowsApps 
2021-11-03T14:16:41.2130122Z 

See GitHub Actions build linux-xenial-py3.6-clang7-asan / test (default, 1, 2, linux.2xlarge) (11/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:25:51.9365179Z SUMMARY: Undefined.../jenkins/workspace/aten/src/ATen/Utils.cpp:20:3 in
2021-11-03T14:25:51.8867762Z     #9 0x55602afb38f2 in PyEval_EvalCode /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/ceval.c:731
2021-11-03T14:25:51.8869009Z     #10 0x55602b01bcd5 in run_mod /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:1025
2021-11-03T14:25:51.8870579Z     #11 0x55602b01dd5d in PyRun_StringFlags /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:949
2021-11-03T14:25:51.8872350Z     #12 0x55602b01ddbb in PyRun_SimpleStringFlags /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:445
2021-11-03T14:25:51.8873712Z     #13 0x55602b01e926 in run_command /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Modules/main.c:301
2021-11-03T14:25:51.8874891Z     #14 0x55602b01e926 in Py_Main /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Modules/main.c:749
2021-11-03T14:25:51.8876046Z     #15 0x55602af58196 in main /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Programs/python.c:69
2021-11-03T14:25:51.9363207Z     #16 0x7f940884883f in __libc_start_main /build/glibc-S7Ft5T/glibc-2.23/csu/../csu/libc-start.c:291
2021-11-03T14:25:51.9363977Z     #17 0x55602afe833d in _start (/opt/conda/bin/python3.6+0x1a733d)
2021-11-03T14:25:51.9364320Z 
2021-11-03T14:25:51.9365179Z SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior /var/lib/jenkins/workspace/aten/src/ATen/Utils.cpp:20:3 in 
2021-11-03T14:25:51.9553808Z + retcode=1
2021-11-03T14:25:51.9554377Z + set -e
2021-11-03T14:25:51.9554666Z + return 1
2021-11-03T14:25:51.9557991Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX-* ]]
2021-11-03T14:25:51.9558639Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X ]]
2021-11-03T14:25:51.9559429Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX2-* ]]
2021-11-03T14:25:51.9560058Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]]
2021-11-03T14:25:51.9560979Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX512-* ]]
2021-11-03T14:25:51.9562045Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\5\1\2 ]]
2021-11-03T14:25:51.9562386Z ++ mktemp

See GitHub Actions build linux-xenial-py3.6-gcc7 / test (default, 2, 2, linux.2xlarge) (12/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:36:23.7443537Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:36:23.7049086Z 
2021-11-03T14:36:23.7049834Z Intel MKL ERROR: Parameter 4 was incorrect on entry to SLASCL.
2021-11-03T14:36:23.7100538Z ok (0.039s)
2021-11-03T14:36:23.7263315Z   test_svd_errors_and_warnings_cpu_float64 (__main__.TestLinalgCPU) ... 
2021-11-03T14:36:23.7264271Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:36:23.7264630Z 
2021-11-03T14:36:23.7265031Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:36:23.7442042Z 
2021-11-03T14:36:23.7442751Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:36:23.7443122Z 
2021-11-03T14:36:23.7443537Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:36:23.7491939Z ok (0.039s)
2021-11-03T14:36:27.0425453Z   test_svd_lowrank_cpu_float64 (__main__.TestLinalgCPU) ... ok (3.293s)
2021-11-03T14:36:27.5813080Z   test_svd_memory_allocation_cpu_complex128 (__main__.TestLinalgCPU) ... test_linalg.py:3026: UserWarning: An output with one or more elements was resized since it had shape [3, 3], which does not match the required output shape [3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:36:27.5814973Z   torch.linalg.svdvals(a, out=out0)
2021-11-03T14:36:27.7772456Z test_linalg.py:3027: UserWarning: An output with one or more elements was resized since it had shape [3], which does not match the required output shape [3, 3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:36:27.7774986Z   torch.linalg.svd(a, full_matrices=False, out=(out0, out1, out2))
2021-11-03T14:36:27.7845725Z ok (0.742s)
2021-11-03T14:36:28.1632730Z   test_svd_memory_allocation_cpu_complex64 (__main__.TestLinalgCPU) ... ok (0.378s)
2021-11-03T14:36:28.3507805Z   test_svd_memory_allocation_cpu_float32 (__main__.TestLinalgCPU) ... ok (0.187s)
2021-11-03T14:36:28.5805419Z   test_svd_memory_allocation_cpu_float64 (__main__.TestLinalgCPU) ... ok (0.230s)

See GitHub Actions build linux-xenial-py3.6-clang7-asan / test (default, 2, 2, linux.2xlarge) (13/16)

Step: "Unknown" (full log | diagnosis details< 8000 /a> | 🔁 rerun)

2021-11-03T14:25:52.0715318Z SUMMARY: Undefined.../jenkins/workspace/aten/src/ATen/Utils.cpp:20:3 in
2021-11-03T14:25:52.0180538Z     #9 0x5618f9e6a8f2 in PyEval_EvalCode /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/ceval.c:731
2021-11-03T14:25:52.0181287Z     #10 0x5618f9ed2cd5 in run_mod /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:1025
2021-11-03T14:25:52.0182080Z     #11 0x5618f9ed4d5d in PyRun_StringFlags /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:949
2021-11-03T14:25:52.0183212Z     #12 0x5618f9ed4dbb in PyRun_SimpleStringFlags /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:445
2021-11-03T14:25:52.0184035Z     #13 0x5618f9ed5926 in run_command /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Modules/main.c:301
2021-11-03T14:25:52.0184740Z     #14 0x5618f9ed5926 in Py_Main /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Modules/main.c:749
2021-11-03T14:25:52.0185439Z     #15 0x5618f9e0f196 in main /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Programs/python.c:69
2021-11-03T14:25:52.0713185Z     #16 0x7f0c34ad483f in __libc_start_main /build/glibc-S7Ft5T/glibc-2.23/csu/../csu/libc-start.c:291
2021-11-03T14:25:52.0714103Z     #17 0x5618f9e9f33d in _start (/opt/conda/bin/python3.6+0x1a733d)
2021-11-03T14:25:52.0714444Z 
2021-11-03T14:25:52.0715318Z SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior /var/lib/jenkins/workspace/aten/src/ATen/Utils.cpp:20:3 in 
2021-11-03T14:25:52.0896938Z + retcode=1
2021-11-03T14:25:52.0897832Z + set -e
2021-11-03T14:25:52.0898226Z + return 1
2021-11-03T14:25:52.0900261Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX-* ]]
2021-11-03T14:25:52.0901032Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X ]]
2021-11-03T14:25:52.0901818Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX2-* ]]
2021-11-03T14:25:52.0902730Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]]
2021-11-03T14:25:52.0903940Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX512-* ]]
2021-11-03T14:25:52.0904590Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\5\1\2 ]]
2021-11-03T14:25:52.0904923Z ++ mktemp

See GitHub Actions build linux-xenial-py3.6-gcc5.4 / test (default, 1, 2, linux.2xlarge) (14/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:40:56.2115743Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:40:56.1696844Z 
2021-11-03T14:40:56.1697756Z Intel MKL ERROR: Parameter 4 was incorrect on entry to SLASCL.
2021-11-03T14:40:56.1748448Z ok (0.041s)
2021-11-03T14:40:56.1923457Z   test_svd_errors_and_warnings_cpu_float64 (__main__.TestLinalgCPU) ... 
2021-11-03T14:40:56.1924216Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:40:56.1924578Z 
2021-11-03T14:40:56.1925009Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:40:56.2114014Z 
2021-11-03T14:40:56.2114734Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:40:56.2115286Z 
2021-11-03T14:40:56.2115743Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:40:56.2167213Z ok (0.042s)
2021-11-03T14:40:59.4752801Z   test_svd_lowrank_cpu_float64 (__main__.TestLinalgCPU) ... ok (3.258s)
2021-11-03T14:40:59.9987794Z   test_svd_memory_allocation_cpu_complex128 (__main__.TestLinalgCPU) ... test_linalg.py:3026: UserWarning: An output with one or more elements was resized since it had shape [3, 3], which does not match the required output shape [3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:40:59.9990356Z   torch.linalg.svdvals(a, out=out0)
2021-11-03T14:41:00.1942132Z test_linalg.py:3027: UserWarning: An output with one or more elements was resized since it had shape [3], which does not match the required output shape [3, 3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:41:00.1944522Z   torch.linalg.svd(a, full_matrices=False, out=(out0, out1, out2))
2021-11-03T14:41:00.2019233Z ok (0.727s)
2021-11-03T14:41:00.5531554Z   test_svd_memory_allocation_cpu_complex64 (__main__.TestLinalgCPU) ... ok (0.351s)
2021-11-03T14:41:00.7091082Z   test_svd_memory_allocation_cpu_float32 (__main__.TestLinalgCPU) ... ok (0.156s)
2021-11-03T14:41:00.9142584Z   test_svd_memory_allocation_cpu_float64 (__main__.TestLinalgCPU) ... ok (0.205s)

See CircleCI build pytorch_linux_xenial_py3_6_gcc5_4_build (15/16)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.
  git switch -

Turn off this advice by setting config variable advice.detachedHead to false

HEAD is now at 65945ddbc7 Update on "Remove native_functions.yaml dependency from TensorTopK.cu"
+ git reset --hard 65945ddbc7eb202f9b747110ab90a6b63154a1f7
HEAD is now at 65945ddbc7 Update on "Remove native_functions.yaml dependency from TensorTopK.cu"
+ git merge --allow-unrelated-histories --no-edit --no-ff e32d7f7525fded2c044dd690d3a1f52aa69ae79e
Auto-merging caffe2/CMakeLists.txt
CONFLICT (content): Merge conflict in caffe2/CMakeLists.txt
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1

See CircleCI build pytorch_xla_linux_bionic_py3_6_clang9_build (16/16)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.
  git switch -

Turn off this advice by setting config variable advice.detachedHead to false

HEAD is now at 65945ddbc7 Update on "Remove native_functions.yaml dependency from TensorTopK.cu"
+ git reset --hard 65945ddbc7eb202f9b747110ab90a6b63154a1f7
HEAD is now at 65945ddbc7 Update on "Remove native_functions.yaml dependency from TensorTopK.cu"
+ git merge --allow-unrelated-histories --no-edit --no-ff e32d7f7525fded2c044dd690d3a1f52aa69ae79e
Auto-merging caffe2/CMakeLists.txt
CONFLICT (content): Merge conflict in caffe2/CMakeLists.txt
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1


12 failures not recognized by patterns:

Job Step Action
CircleCI pytorch_linux_xenial_py3_clang5_android_ndk_r19c_x86_32_build Build 🔁 rerun
CircleCI pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single-full-jit pytorch android gradle custom build single architecture (for PR) 🔁 rerun
CircleCI pytorch_ios_12_5_1_x86_64_full_jit_build Build 🔁 rerun
CircleCI pytorch_macos_10_13_py3_test Test 🔁 rerun
CircleCI pytorch_ios_12_5_1_x86_64_build Build 🔁 rerun
CircleCI pytorch_macos_10_13_py3_lite_interpreter_build_test Test 🔁 rerun
CircleCI pytorch_ios_12_5_1_x86_64_coreml_build Build 🔁 rerun
GitHub Actions linux-bionic-py3.6-clang9 / test (default, 2, 2, linux.2xlarge) Unknown 🔁 rerun
GitHub Actions linux-xenial-cuda11.3-py3.6-gcc7 / test (distributed, 1, 1, linux.8xlarge.nvidia.gpu) Unknown 🔁 rerun
GitHub Actions linux-xenial-cuda11.3-py3.6-gcc7 / test (default, 1, 2, linux.4xlarge.nvidia.gpu) Unknown 🔁 rerun
GitHub Actions linux-xenial-cuda11.3-py3.6-gcc7 / test (default, 2, 2, linux.4xlarge.nvidia.gpu) Unknown 🔁 rerun
GitHub Actions linux-xenial-py3.6-gcc7 / test (default, 1, 2, linux.2xlarge) Unknown 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

peterbell10 added a commit that referenced this pull request Oct 18, 2021
@peterbell10 peterbell10 requested a review from dagitses October 18, 2021 23:02
@dagitses
Copy link
Collaborator

@dagitses has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@dagitses
Copy link
Collaborator
dagitses commented Nov 2, 2021

@dagitses has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@dagitses
Copy link
Collaborator
dagitses commented Nov 3, 2021

@dagitses has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.


void launch_gather_topk_kernel(
const TensorBase& self, int64_t k, int64_t dim, bool largest, bool sorted,
const TensorBase& values, const TensorBase& indices) {
int numDims = self.dim();
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

out of curiosity, what is the criteria by which these four lines remain in the .cu but the if (k == 0) moves?

My understanding of your work is that you are prioritizing the following:

  1. pull out dependencies of Tensor and ATen.h from .cu files.
  2. less important but where possible, extract other unnecessary dependencies from .cu files
  3. take advantage of other opportunities to move code out of .cu files

Is that roughly the prioritization of your approach here? Under that, moving this block and the k == 0 check both fit under 3) as the lowest priority.

Reordering the k == 0 check does change the behavior since it now avoids the check about having too many dimensions. Is that OK? FWIW, I like the idea of being stringent on inputs rather than letting a loophole like this let the user get away with an invalid input.

Changing topic altogether, do you think that splitting code up this way causes any meaningful harm by creating cross-module optimization barriers?

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

out of curiosity, what is the criteria by which these four lines remain in the .cu but the if (k == 0) moves?

The return statement must go in the .cpp file function so we don't launch the sorting kernels. It would make sense to keep the MAX_DIMS checks together with it, but MAX_DIMS is defined in a .cuh header file so needs nvcc:

constexpr int MAX_DIMS = 25;

My understanding of your work is that you are prioritizing the following: [...]

This is mostly right, although .cu isn't actually in my criteria anywhere. I'm currently focusing on files that depend on native_functions.yaml, prioritized by their compile time (to maximize impact). It just so happens that cuda code is much slower to compile so the top of the list is all cuda files. Somewhat interestingly, GridSample.cpp was above GridSample.cu in compile time which is why that PR changes both.

The top of the list at the moment looks like this:

caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/DistributionBernoulli.cu.o
     1m 13s 344ms
caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/ForeachUnaryOp.cu.o
     1m 12s 425ms
caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/Distributions.cu.o
     1m 8s 582ms
caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/Indexing.cu.o
     1m 7s 595ms
caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/group_norm_kernel.cu.o
     0m 59s 931ms
caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/RegisterCUDA.cpp.o
     0m 55s 472ms

FWIW, I like the idea of being stringent on inputs rather than letting a loophole like this let the user get away with an invalid input.

I wouldn't say that applies here since MAX_DIMS is an implementation limitation not an invalid input. If, for example, matmul allowed empty tensors to have a shape mismatch then I would agree.

Changing topic altogether, do you think that splitting code up this way causes any meaningful harm by creating cross-module optimization barriers?

I don't think there's much the compiler can do here, but things like calling the same tensor method in both functions will have some impact (especially for virtual methods). However, generally speaking, the heavy lifting of these functions are done in the cuda runtime to actually launch the kernel. So, if there is any slow down I expect it to be fairly minimal.

@dagitses
Copy link
Collaborator
dagitses commented Nov 3, 2021

@dagitses has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

@facebook-github-bot
Copy link
Contributor

@dagitses merged this pull request in 4262c89.

@facebook-github-bot facebook-github-bot deleted the gh/peterbell10/179/head branch November 8, 2021 15:16
mszhanyi added a commit to mszhanyi/pytorch that referenced this pull request Nov 9, 2021
mszhanyi added a commit to mszhanyi/pytorch that referenced this pull request Nov 9, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants
0