Remove native_functions.yaml dependency from TensorTopK.cu #66794

peterbell10 · 2021-10-18T15:24:09Z

Stack from ghstack:

Differential Revision: D31856104

[ghstack-poisoned]

pytorch-probot · 2021-10-18T15:24:13Z

CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/pytorch/pytorch/blob/9d472d6a5d000238aee3dbb00f98a92a29d0857f/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default

Workflows	Labels (bold enabled)	Status
Triggered Workflows
linux-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/noarch`, `ciflow/xla`	✅ triggered
linux-vulkan-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/vulkan`	✅ triggered
linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3-clang5-mobile-build	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`	✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-dynamic	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`	✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-static	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`	✅ triggered
linux-xenial-py3.6-clang7-asan	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/sanitizers`	✅ triggered
linux-xenial-py3.6-clang7-onnx	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/onnx`	✅ triggered
linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/win`	✅ triggered
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/win`	✅ triggered
Skipped Workflows
caffe2-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
docker-builds	`ciflow/all`	🚫 skipped
ios-12-5-1-arm64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-custom-ops	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-arm64-metal	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64	`ciflow/all`, `ciflow/ios`, `cifl 10000 ow/macos`	🚫 skipped
ios-12-5-1-x86-64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
ios-12-5-1-x86-64-full-jit	`ciflow/all`, `ciflow/ios`, `ciflow/macos`	🚫 skipped
libtorch-linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
libtorch-linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	🚫 skipped
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	🚫 skipped
linux-xenial-py3-clang5-mobile-code-analysis	`ciflow/all`, `ciflow/linux`, `ciflow/mobile`	🚫 skipped
parallelnative-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
periodic-libtorch-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`, `ciflow/slow`, `ciflow/slow-gradcheck`	🚫 skipped
periodic-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-win-vs2019-cuda11.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped

You can add a comment to the PR and tag @pytorchbot with the following commands:

# ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun

# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow

For more information, please take a look at the CI Flow Wiki.

facebook-github-bot · 2021-10-18T15:24:16Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/66794
📄 Preview docs built from this PR
🔧 Opt-in to CIFlow to control what jobs run on your PRs

💊 CI failures summary and remediations

As of commit 65945dd (more details on the Dr. CI page):

28/28 failures introduced in this PR

🕵️ 16 new failures recognized by patterns

The following CI failures do not appear to be due to upstream breakages:

linux-xenial-py3.6-clang7-onnx / test (default, 2, 2, linux.2xlarge) (1/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:26:34.6848585Z �[31mERROR: pip's ... the source of the following dependency conflicts.

2021-11-03T14:26:33.2311082Z ++ stat --format %U /opt/conda/bin/pip
2021-11-03T14:26:33.2320721Z + PIP_USER=jenkins
2021-11-03T14:26:33.2324030Z ++ id -u -n
2021-11-03T14:26:33.2334621Z + CURRENT_USER=jenkins
2021-11-03T14:26:33.2335194Z + [[ jenkins = root ]]
2021-11-03T14:26:33.2335777Z + pip -q uninstall -y hypothesis
2021-11-03T14:26:33.5892595Z + pip -q uninstall -y coverage
2021-11-03T14:26:33.8818190Z �[33mWARNING: Skipping coverage as it is not installed.�[0m
2021-11-03T14:26:33.9090317Z + pip -q install attrs==18.1.0 -f https://s3.amazonaws.com/ossci-linux/wheels/attrs-18.1.0-py2.py3-none-any.whl
2021-11-03T14:26:34.3331134Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/attrs-18.1.0-py2.py3-none-any.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:34.6848585Z �[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
2021-11-03T14:26:34.6849875Z pytest 6.2.5 requires attrs>=19.2.0, but you have attrs 18.1.0 which is incompatible.�[0m
2021-11-03T14:26:34.7372244Z + pip -q install coverage==4.5.1 -f https://s3.amazonaws.com/ossci-linux/wheels/coverage-4.5.1-cp36-cp36m-macosx_10_12_x86_64.whl
2021-11-03T14:26:35.5931504Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/coverage-4.5.1-cp36-cp36m-macosx_10_12_x86_64.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:36.0070516Z + pip -q install hypothesis==3.44.6 -f https://s3.amazonaws.com/ossci-linux/wheels/hypothesis-3.44.6-py3-none-any.whl
2021-11-03T14:26:37.0288885Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/hypothesis-3.44.6-py3-none-any.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:37.5508995Z + EXTRA_TESTS=()
2021-11-03T14:26:37.5510222Z + [[ linux-xenial-py3.6-clang7- *-cuda* ]]
2021-11-03T14:26:37.5511377Z + [[ linux-xenial-py3.6-clang7- *-rocm* ]]
2021-11-03T14:26:37.5511994Z + rocm_ignore_test=()
2021-11-03T14:26:37.5512822Z + [[ linux-xenial-py3.6-clang7- *-rocm* ]]

linux-xenial-py3.6-gcc5.4 / test (distributed, 1, 1, linux.2xlarge) (2/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:30:54.2941748Z test_udf_remote_...yUniqueId(created_on=0, local_id=0) to be created.

2021-11-03T14:30:15.8075424Z frame #12: c10::ThreadPool::main_loop(unsigned long) + 0x2a3 (0x7fddcbb75383 in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:15.8076875Z frame #13: <unknown function> + 0xc92bd (0x7fddcbaa32bd in /opt/conda/lib/libstdc++.so.6)
2021-11-03T14:30:15.8078341Z frame #14: <unknown function> + 0x76ba (0x7fdddf9136ba in /lib/x86_64-linux-gnu/libpthread.so.0)
2021-11-03T14:30:15.8079709Z frame #15: clone + 0x6d (0x7fdddf64951d in /lib/x86_64-linux-gnu/libc.so.6)
2021-11-03T14:30:15.8080315Z 
2021-11-03T14:30:16.0614670Z ok (3.316s)
2021-11-03T14:30:30.7968760Z   test_rpc_builtin_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (14.735s)
2021-11-03T14:30:39.6222403Z   test_rpc_script_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (8.826s)
2021-11-03T14:30:42.9396948Z   test_rref_to_here_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (3.317s)
2021-11-03T14:30:50.2622214Z   test_udf_remote_message_delay_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (7.322s)
2021-11-03T14:30:54.2941748Z   test_udf_remote_message_delay_timeout_to_self (__main__.FaultyFaultyAgentRpcTest) ... [E request_callback_no_python.cpp:559] Received error while processing request type 261: falseINTERNAL ASSERT FAILED at "/var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp":387, please report a bug to PyTorch. Expected OwnerRRef with id GloballyUniqueId(created_on=0, local_id=0) to be created.
2021-11-03T14:30:54.2944105Z Exception raised from getOwnerRRef at /var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp:387 (most recent call first):
2021-11-03T14:30:54.2946062Z frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x69 (0x7f824909f549 in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:54.2948268Z frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xd2 (0x7f824909baf2 in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:54.2950127Z frame #2: c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0x4e (0x7f824909d48e in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:54.2951926Z frame #3: torch::distributed::rpc::RRefContext::getOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, bool) + 0x4cb (0x7f824c782cdb in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:54.2954468Z frame #4: torch::distributed::rpc::RequestCallbackNoPython::assignOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, torch::distributed::rpc::GloballyUniqueId const&, c10::intrusive_ptr<c10::ivalue::Future, c10::detail::intrusive_target_default_null_type<c10::ivalue::Future> >) const + 0x71 (0x7f824c773201 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:54.2958453Z frame #5: torch::distributed::rpc::RequestCallbackImpl::processPythonRemoteCall(torch::distributed::rpc::RpcCommandBase&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0xc8 (0x7f82547edf28 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-11-03T14:30:54.2960829Z frame #6: torch::distributed::rpc::RequestCallbackNoPython::processRpc(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x194 (0x7f824c777a54 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:54.2963335Z frame #7: torch::distributed::rpc::RequestCallbackImpl::processRpcWithErrors(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x65 (0x7f82547ed535 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-11-03T14:30:54.2964972Z frame #8: <unknown function> + 0x34995aa (0x7f824c7745aa in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)

linux-xenial-py3.6-gcc5.4 / build-docs (cpp) (3/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:44:21.3982348Z error: could not l...modules/third_party/zstd/config: Permission denied

2021-11-03T14:44:21.3844471Z http.https://github.com/.extraheader
2021-11-03T14:44:21.3854566Z error: could not lock config file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config: Permission denied
2021-11-03T14:44:21.3867131Z Entering 'third_party/tensorpipe/third_party/pybind11'
2021-11-03T14:44:21.3885207Z http.https://github.com/.extraheader
2021-11-03T14:44:21.3896220Z error: could not lock config file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config: Permission denied
2021-11-03T14:44:21.3909180Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang'
2021-11-03T14:44:21.3927271Z http.https://github.com/.extraheader
2021-11-03T14:44:21.3938338Z error: could not lock config file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config: Permission denied
2021-11-03T14:44:21.3953723Z Entering 'third_party/zstd'
2021-11-03T14:44:21.3971727Z http.https://github.com/.extraheader
2021-11-03T14:44:21.3982348Z error: could not lock config file /home/ec2-user/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/zstd/config: Permission denied
2021-11-03T14:44:21.4046309Z Cleaning up orphan processes
2021-11-03T14:44:21.4248749Z Terminate orphan process: pid (9052) (docker)

linux-xenial-py3.6-gcc7 / test (distributed, 1, 1, linux.2xlarge) (4/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:30:24.7462379Z test_udf_remote_...yUniqueId(created_on=0, local_id=0) to be created.

2021-11-03T14:29:46.2677399Z frame #12: c10::ThreadPool::main_loop(unsigned long) + 0x2db (0x7f3a79e1ba3b in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:29:46.2678807Z frame #13: <unknown function> + 0xc9039 (0x7f3a79d47039 in /opt/conda/lib/libstdc++.so.6)
2021-11-03T14:29:46.2680268Z frame #14: <unknown function> + 0x76ba (0x7f3a8dbc06ba in /lib/x86_64-linux-gnu/libpthread.so.0)
2021-11-03T14:29:46.2681600Z frame #15: clone + 0x6d (0x7f3a8d8f651d in /lib/x86_64-linux-gnu/libc.so.6)
2021-11-03T14:29:46.2682178Z 
2021-11-03T14:29:46.5471315Z ok (3.315s)
2021-11-03T14:30:01.2817766Z   test_rpc_builtin_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (14.735s)
2021-11-03T14:30:10.1061274Z   test_rpc_script_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (8.824s)
2021-11-03T14:30:13.4228006Z   test_rref_to_here_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (3.317s)
2021-11-03T14:30:20.7438195Z   test_udf_remote_message_delay_timeout (__main__.FaultyFaultyAgentRpcTest) ... ok (7.321s)
2021-11-03T14:30:24.7462379Z   test_udf_remote_message_delay_timeout_to_self (__main__.FaultyFaultyAgentRpcTest) ... [E request_callback_no_python.cpp:559] Received error while processing request type 261: falseINTERNAL ASSERT FAILED at "/var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp":387, please report a bug to PyTorch. Expected OwnerRRef with id GloballyUniqueId(created_on=0, local_id=0) to be created.
2021-11-03T14:30:24.7465454Z Exception raised from getOwnerRRef at /var/lib/jenkins/workspace/torch/csrc/distributed/rpc/rref_context.cpp:387 (most recent call first):
2021-11-03T14:30:24.7468082Z frame #0: c10::Error::Error(c10::SourceLocation, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> >) + 0x6b (0x7f25c13f6cab in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:24.7470915Z frame #1: c10::detail::torchCheckFail(char const*, char const*, unsigned int, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0xce (0x7f25c13f28fe in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:24.7474472Z frame #2: c10::detail::torchInternalAssertFail(char const*, char const*, unsigned int, char const*, std::__cxx11::basic_string<char, std::char_traits<char>, std::allocator<char> > const&) + 0x4e (0x7f25c13f469e in /opt/conda/lib/python3.6/site-packages/torch/lib/libc10.so)
2021-11-03T14:30:24.7477450Z frame #3: torch::distributed::rpc::RRefContext::getOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, bool) + 0x450 (0x7f25c4a9b140 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:24.7480439Z frame #4: torch::distributed::rpc::RequestCallbackNoPython::assignOwnerRRef(torch::distributed::rpc::GloballyUniqueId const&, torch::distributed::rpc::GloballyUniqueId const&, c10::intrusive_ptr<c10::ivalue::Future, c10::detail::intrusive_target_default_null_type<c10::ivalue::Future> >) const + 0x73 (0x7f25c4a8b933 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:24.7483268Z frame #5: torch::distributed::rpc::RequestCallbackImpl::processPythonRemoteCall(torch::distributed::rpc::RpcCommandBase&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0xcc (0x7f25ccb56d4c in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-11-03T14:30:24.7485693Z frame #6: torch::distributed::rpc::RequestCallbackNoPython::processRpc(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x194 (0x7f25c4a8fcd4 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)
2021-11-03T14:30:24.7488697Z frame #7: torch::distributed::rpc::RequestCallbackImpl::processRpcWithErrors(torch::distributed::rpc::RpcCommandBase&, torch::distributed::rpc::MessageType const&, std::vector<c10::Stream, std::allocator<c10::Stream> >) const + 0x65 (0x7f25ccb56465 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_python.so)
2021-11-03T14:30:24.7490399Z frame #8: <unknown function> + 0x3459cd3 (0x7f25c4a89cd3 in /opt/conda/lib/python3.6/site-packages/torch/lib/libtorch_cpu.so)

linux-bionic-py3.6-clang9 / test (noarch, 1, 1, linux.2xlarge) (5/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:31:41.9460441Z test_add_done_ca...arg() takes 0 positional arguments but 1 was given

2021-11-03T14:31:41.9445866Z   /opt/conda/lib/python3.6/unittest/suite.py(122): run
2021-11-03T14:31:41.9446389Z   /opt/conda/lib/python3.6/unittest/suite.py(84): __call__
2021-11-03T14:31:41.9447131Z   /opt/conda/lib/python3.6/site-packages/xmlrunner/runner.py(66): run
2021-11-03T14:31:41.9447718Z   /opt/conda/lib/python3.6/unittest/main.py(256): runTests
2021-11-03T14:31:41.9448247Z   /opt/conda/lib/python3.6/unittest/main.py(95): __init__
2021-11-03T14:31:41.9448975Z   /opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_utils.py(608): run_tests
2021-11-03T14:31:41.9449597Z   test_futures.py(331): <module>
2021-11-03T14:31:41.9449839Z 
2021-11-03T14:31:41.9450064Z ok (0.002s)
2021-11-03T14:31:41.9455283Z   test_add_done_callback_maintains_callback_order (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:41.9460441Z   test_add_done_callback_no_arg_error_is_ignored (__main__.TestFuture) ... [E pybind_utils.h:201] Got the following error when running the callback: TypeError: no_arg() takes 0 positional arguments but 1 was given
2021-11-03T14:31:41.9461251Z ok (0.001s)
2021-11-03T14:31:41.9469566Z   test_add_done_callback_simple (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:41.9495961Z   test_chained_then (__main__.TestFuture) ... ok (0.003s)
2021-11-03T14:31:42.0514992Z   test_collect_all (__main__.TestFuture) ... ok (0.102s)
2021-11-03T14:31:42.0521021Z   test_done (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:42.0532891Z   test_done_exception (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:42.0546919Z   test_interleaving_then_and_add_done_callback_maintains_callback_order (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:31:42.0556319Z   test_interleaving_then_and_add_done_callback_propagates_error (__main__.TestFuture) ... [E pybind_utils.h:201] Got the following error when running the callback: ValueError: Expected error
2021-11-03T14:31:42.0556984Z 
2021-11-03T14:31:42.0557271Z At:

linux-xenial-py3.6-clang7-onnx / test (default, 1, 2, linux.2xlarge) (6/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:26:42.1649157Z �[31mERROR: pip's ... the source of the following dependency conflicts.

2021-11-03T14:26:40.6485538Z ++ stat --format %U /opt/conda/bin/pip
2021-11-03T14:26:40.6495458Z + PIP_USER=jenkins
2021-11-03T14:26:40.6498534Z ++ id -u -n
2021-11-03T14:26:40.6507211Z + CURRENT_USER=jenkins
2021-11-03T14:26:40.6507866Z + [[ jenkins = root ]]
2021-11-03T14:26:40.6508532Z + pip -q uninstall -y hypothesis
2021-11-03T14:26:41.0280018Z + pip -q uninstall -y coverage
2021-11-03T14:26:41.3301906Z �[33mWARNING: Skipping coverage as it is not installed.�[0m
2021-11-03T14:26:41.3610469Z + pip -q install attrs==18.1.0 -f https://s3.amazonaws.com/ossci-linux/wheels/attrs-18.1.0-py2.py3-none-any.whl
2021-11-03T14:26:41.7991434Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/attrs-18.1.0-py2.py3-none-any.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:42.1649157Z �[31mERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
2021-11-03T14:26:42.1650350Z pytest 6.2.5 requires attrs>=19.2.0, but you have attrs 18.1.0 which is incompatible.�[0m
2021-11-03T14:26:42.2253856Z + pip -q install coverage==4.5.1 -f https://s3.amazonaws.com/ossci-linux/wheels/coverage-4.5.1-cp36-cp36m-macosx_10_12_x86_64.whl
2021-11-03T14:26:43.0898065Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/coverage-4.5.1-cp36-cp36m-macosx_10_12_x86_64.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:43.5352555Z + pip -q install hypothesis==3.44.6 -f https://s3.amazonaws.com/ossci-linux/wheels/hypothesis-3.44.6-py3-none-any.whl
2021-11-03T14:26:44.5803125Z �[33mWARNING: Skipping page https://s3.amazonaws.com/ossci-linux/wheels/hypothesis-3.44.6-py3-none-any.whl because the HEAD request got Content-Type: binary/octet-stream.The only supported Content-Type is text/html�[0m
2021-11-03T14:26:45.1331567Z + EXTRA_TESTS=()
2021-11-03T14:26:45.1333107Z + [[ linux-xenial-py3.6-clang7- *-cuda* ]]
2021-11-03T14:26:45.1334373Z + [[ linux-xenial-py3.6-clang7- *-rocm* ]]
2021-11-03T14:26:45.1335122Z + rocm_ignore_test=()
2021-11-03T14:26:45.1336082Z + [[ linux-xenial-py3.6-clang7- *-rocm* ]]

win-vs2019-cuda11.3-py3 / build (7/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:16:28.7464935Z ls: cannot access ...d/win_tmp/ci_scripts/*': No such file or directory

2021-11-03T14:16:28.0803255Z ++ cygpath -w /c/actions-runner/_work/pytorch/pytorch/build/win_tmp
2021-11-03T14:16:28.1775147Z + TMP_DIR_WIN='C:\actions-runner\_work\pytorch\pytorch\build\win_tmp'
2021-11-03T14:16:28.1775672Z + export TMP_DIR_WIN
2021-11-03T14:16:28.1776127Z + export PYTORCH_FINAL_PACKAGE_DIR=/c/1417084618/build-results/
2021-11-03T14:16:28.1776651Z + PYTORCH_FINAL_PACKAGE_DIR=/c/1417084618/build-results/
2021-11-03T14:16:28.1777077Z + [[ -n /c/1417084618/build-results/ ]]
2021-11-03T14:16:28.1777482Z + mkdir -p /c/1417084618/build-results/
2021-11-03T14:16:28.5143248Z + CI_SCRIPTS_DIR=/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts
2021-11-03T14:16:28.5144074Z + mkdir -p /c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts
2021-11-03T14:16:28.5338132Z ++ ls '/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts/*'
2021-11-03T14:16:28.7464935Z ls: cannot access '/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts/*': No such file or directory
2021-11-03T14:16:28.7468296Z + '[' -n '' ']'
2021-11-03T14:16:28.7469055Z + export SCRIPT_HELPERS_DIR=/c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers
2021-11-03T14:16:28.7469908Z + SCRIPT_HELPERS_DIR=/c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers
2021-11-03T14:16:28.7470424Z + set +ex
2021-11-03T14:16:38.2948151Z + /c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers/build_pytorch.bat
2021-11-03T14:16:38.3202011Z 
2021-11-03T14:16:38.3202860Z C:\actions-runner\_work\pytorch\pytorch>if "" == "1" (set BUILD_TYPE=debug )  ELSE (set BUILD_TYPE=release ) 
2021-11-03T14:16:38.3206187Z 
2021-11-03T14:16:38.3208321Z C:\actions-runner\_work\pytorch\pytorch>set PATH=C:\Program Files\CMake\bin;C:\Program Files\7-Zip;C:\ProgramData\chocolatey\bin;C:\Program Files\Git\cmd;C:\Program Files\Amazon\AWSCLI;C:\Program Files\Amazon\AWSCLI\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0;C:\Windows\System32\OpenSSH;C:\Program Files\Amazon\cfn-bootstrap;C:\ProgramData\chocolatey\bin;C:\Program Files\Amazon\AWSCLIV2;C:\Program Files\Git\cmd;C:\Program Files\Git\mingw64\bin;C:\Program Files\Git\usr\bin;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit;C:\Users\runneruser\AppData\Local\Microsoft\WindowsApps 
2021-11-03T14:16:38.3210490Z

linux-xenial-py3.6-gcc5.4 / test (default, 2, 2, linux.2xlarge) (8/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:43:43.0524617Z test_add_done_ca...arg() takes 0 positional arguments but 1 was given

2021-11-03T14:43:43.0509174Z   /opt/conda/lib/python3.6/unittest/suite.py(122): run
2021-11-03T14:43:43.0509768Z   /opt/conda/lib/python3.6/unittest/suite.py(84): __call__
2021-11-03T14:43:43.0510456Z   /opt/conda/lib/python3.6/site-packages/xmlrunner/runner.py(66): run
2021-11-03T14:43:43.0511041Z   /opt/conda/lib/python3.6/unittest/main.py(256): runTests
2021-11-03T14:43:43.0511544Z   /opt/conda/lib/python3.6/unittest/main.py(95): __init__
2021-11-03T14:43:43.0512303Z   /opt/conda/lib/python3.6/site-packages/torch/testing/_internal/common_utils.py(608): run_tests
2021-11-03T14:43:43.0512876Z   test_futures.py(331): <module>
2021-11-03T14:43:43.0513127Z 
2021-11-03T14:43:43.0513394Z ok (0.002s)
2021-11-03T14:43:43.0519340Z   test_add_done_callback_maintains_callback_order (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.0524617Z   test_add_done_callback_no_arg_error_is_ignored (__main__.TestFuture) ... [E pybind_utils.h:201] Got the following error when running the callback: TypeError: no_arg() takes 0 positional arguments but 1 was given
2021-11-03T14:43:43.0525400Z ok (0.001s)
2021-11-03T14:43:43.0534020Z   test_add_done_callback_simple (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.0561242Z   test_chained_then (__main__.TestFuture) ... ok (0.003s)
2021-11-03T14:43:43.1580069Z   test_collect_all (__main__.TestFuture) ... ok (0.102s)
2021-11-03T14:43:43.1587140Z   test_done (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.1597663Z   test_done_exception (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.1611117Z   test_interleaving_then_and_add_done_callback_maintains_callback_order (__main__.TestFuture) ... ok (0.001s)
2021-11-03T14:43:43.1619866Z   test_interleaving_then_and_add_done_callback_propagates_error (__main__.TestFuture) ... [E pybind_utils.h:201] Got the following error when running the callback: ValueError: Expected error
2021-11-03T14:43:43.1621050Z 
2021-11-03T14:43:43.1621438Z At:

linux-bionic-py3.6-clang9 / test (default, 1, 2, linux.2xlarge) (9/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:38:34.1528214Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.

2021-11-03T14:38:34.1147653Z 
2021-11-03T14:38:34.1148844Z Intel MKL ERROR: Parameter 4 was incorrect on entry to SLASCL.
2021-11-03T14:38:34.1195936Z ok (0.037s)
2021-11-03T14:38:34.1351982Z   test_svd_errors_and_warnings_cpu_float64 (__main__.TestLinalgCPU) ... 
2021-11-03T14:38:34.1353006Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:38:34.1353602Z 
2021-11-03T14:38:34.1354278Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:38:34.1525988Z 
2021-11-03T14:38:34.1526938Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:38:34.1527534Z 
2021-11-03T14:38:34.1528214Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:38:34.1575315Z ok (0.038s)
2021-11-03T14:38:37.2956861Z   test_svd_lowrank_cpu_float64 (__main__.TestLinalgCPU) ... ok (3.138s)
2021-11-03T14:38:37.7490313Z   test_svd_memory_allocation_cpu_complex128 (__main__.TestLinalgCPU) ... test_linalg.py:3026: UserWarning: An output with one or more elements was resized since it had shape [3, 3], which does not match the required output shape [3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:38:37.7493636Z   torch.linalg.svdvals(a, out=out0)
2021-11-03T14:38:37.9215671Z test_linalg.py:3027: UserWarning: An output with one or more elements was resized since it had shape [3], which does not match the required output shape [3, 3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:38:37.9218715Z   torch.linalg.svd(a, full_matrices=False, out=(out0, out1, out2))
2021-11-03T14:38:37.9270524Z ok (0.631s)
2021-11-03T14:38:38.2740435Z   test_svd_memory_allocation_cpu_complex64 (__main__.TestLinalgCPU) ... ok (0.347s)
2021-11-03T14:38:38.4223450Z   test_svd_memory_allocation_cpu_float32 (__main__.TestLinalgCPU) ... ok (0.148s)
2021-11-03T14:38:38.6334297Z   test_svd_memory_allocation_cpu_float64 (__main__.TestLinalgCPU) ... ok (0.211s)

win-vs2019-cpu-py3 / build (10/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:16:31.3859073Z ls: cannot access ...d/win_tmp/ci_scripts/*': No such file or directory

2021-11-03T14:16:31.0079370Z ++ cygpath -w /c/actions-runner/_work/pytorch/pytorch/build/win_tmp
2021-11-03T14:16:31.1117302Z + TMP_DIR_WIN='C:\actions-runner\_work\pytorch\pytorch\build\win_tmp'
2021-11-03T14:16:31.1117869Z + export TMP_DIR_WIN
2021-11-03T14:16:31.1118316Z + export PYTORCH_FINAL_PACKAGE_DIR=/c/1417084628/build-results/
2021-11-03T14:16:31.1118855Z + PYTORCH_FINAL_PACKAGE_DIR=/c/1417084628/build-results/
2021-11-03T14:16:31.1119305Z + [[ -n /c/1417084628/build-results/ ]]
2021-11-03T14:16:31.1119875Z + mkdir -p /c/1417084628/build-results/
2021-11-03T14:16:31.2147624Z + CI_SCRIPTS_DIR=/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts
2021-11-03T14:16:31.2148376Z + mkdir -p /c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts
2021-11-03T14:16:31.2343491Z ++ ls '/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts/*'
2021-11-03T14:16:31.3859073Z ls: cannot access '/c/actions-runner/_work/pytorch/pytorch/build/win_tmp/ci_scripts/*': No such file or directory
2021-11-03T14:16:31.3862179Z + '[' -n '' ']'
2021-11-03T14:16:31.3862895Z + export SCRIPT_HELPERS_DIR=/c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers
2021-11-03T14:16:31.3863819Z + SCRIPT_HELPERS_DIR=/c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers
2021-11-03T14:16:31.3864349Z + set +ex
2021-11-03T14:16:41.1842041Z + /c/actions-runner/_work/pytorch/pytorch/.jenkins/pytorch/win-test-helpers/build_pytorch.bat
2021-11-03T14:16:41.2121491Z 
2021-11-03T14:16:41.2122424Z C:\actions-runner\_work\pytorch\pytorch>if "" == "1" (set BUILD_TYPE=debug )  ELSE (set BUILD_TYPE=release ) 
2021-11-03T14:16:41.2125590Z 
2021-11-03T14:16:41.2127770Z C:\actions-runner\_work\pytorch\pytorch>set PATH=C:\Program Files\CMake\bin;C:\Program Files\7-Zip;C:\ProgramData\chocolatey\bin;C:\Program Files\Git\cmd;C:\Program Files\Amazon\AWSCLI;C:\Program Files\Amazon\AWSCLI\bin;C:\Windows\system32;C:\Windows;C:\Windows\System32\Wbem;C:\Windows\System32\WindowsPowerShell\v1.0;C:\Windows\System32\OpenSSH;C:\Program Files\Amazon\cfn-bootstrap;C:\ProgramData\chocolatey\bin;C:\Program Files\Amazon\AWSCLIV2;C:\Program Files\Git\cmd;C:\Program Files\Git\mingw64\bin;C:\Program Files\Git\usr\bin;C:\Program Files (x86)\Windows Kits\10\Windows Performance Toolkit;C:\Users\runneruser\AppData\Local\Microsoft\WindowsApps 
2021-11-03T14:16:41.2130122Z

linux-xenial-py3.6-clang7-asan / test (default, 1, 2, linux.2xlarge) (11/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:25:51.9365179Z SUMMARY: Undefined.../jenkins/workspace/aten/src/ATen/Utils.cpp:20:3 in

2021-11-03T14:25:51.8867762Z     #9 0x55602afb38f2 in PyEval_EvalCode /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/ceval.c:731
2021-11-03T14:25:51.8869009Z     #10 0x55602b01bcd5 in run_mod /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:1025
2021-11-03T14:25:51.8870579Z     #11 0x55602b01dd5d in PyRun_StringFlags /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:949
2021-11-03T14:25:51.8872350Z     #12 0x55602b01ddbb in PyRun_SimpleStringFlags /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:445
2021-11-03T14:25:51.8873712Z     #13 0x55602b01e926 in run_command /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Modules/main.c:301
2021-11-03T14:25:51.8874891Z     #14 0x55602b01e926 in Py_Main /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Modules/main.c:749
2021-11-03T14:25:51.8876046Z     #15 0x55602af58196 in main /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Programs/python.c:69
2021-11-03T14:25:51.9363207Z     #16 0x7f940884883f in __libc_start_main /build/glibc-S7Ft5T/glibc-2.23/csu/../csu/libc-start.c:291
2021-11-03T14:25:51.9363977Z     #17 0x55602afe833d in _start (/opt/conda/bin/python3.6+0x1a733d)
2021-11-03T14:25:51.9364320Z 
2021-11-03T14:25:51.9365179Z SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior /var/lib/jenkins/workspace/aten/src/ATen/Utils.cpp:20:3 in 
2021-11-03T14:25:51.9553808Z + retcode=1
2021-11-03T14:25:51.9554377Z + set -e
2021-11-03T14:25:51.9554666Z + return 1
2021-11-03T14:25:51.9557991Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX-* ]]
2021-11-03T14:25:51.9558639Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X ]]
2021-11-03T14:25:51.9559429Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX2-* ]]
2021-11-03T14:25:51.9560058Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]]
2021-11-03T14:25:51.9560979Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX512-* ]]
2021-11-03T14:25:51.9562045Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\5\1\2 ]]
2021-11-03T14:25:51.9562386Z ++ mktemp

linux-xenial-py3.6-gcc7 / test (default, 2, 2, linux.2xlarge) (12/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:36:23.7443537Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.

2021-11-03T14:36:23.7049086Z 
2021-11-03T14:36:23.7049834Z Intel MKL ERROR: Parameter 4 was incorrect on entry to SLASCL.
2021-11-03T14:36:23.7100538Z ok (0.039s)
2021-11-03T14:36:23.7263315Z   test_svd_errors_and_warnings_cpu_float64 (__main__.TestLinalgCPU) ... 
2021-11-03T14:36:23.7264271Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:36:23.7264630Z 
2021-11-03T14:36:23.7265031Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:36:23.7442042Z 
2021-11-03T14:36:23.7442751Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:36:23.7443122Z 
2021-11-03T14:36:23.7443537Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:36:23.7491939Z ok (0.039s)
2021-11-03T14:36:27.0425453Z   test_svd_lowrank_cpu_float64 (__main__.TestLinalgCPU) ... ok (3.293s)
2021-11-03T14:36:27.5813080Z   test_svd_memory_allocation_cpu_complex128 (__main__.TestLinalgCPU) ... test_linalg.py:3026: UserWarning: An output with one or more elements was resized since it had shape [3, 3], which does not match the required output shape [3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:36:27.5814973Z   torch.linalg.svdvals(a, out=out0)
2021-11-03T14:36:27.7772456Z test_linalg.py:3027: UserWarning: An output with one or more elements was resized since it had shape [3], which does not match the required output shape [3, 3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:36:27.7774986Z   torch.linalg.svd(a, full_matrices=False, out=(out0, out1, out2))
2021-11-03T14:36:27.7845725Z ok (0.742s)
2021-11-03T14:36:28.1632730Z   test_svd_memory_allocation_cpu_complex64 (__main__.TestLinalgCPU) ... ok (0.378s)
2021-11-03T14:36:28.3507805Z   test_svd_memory_allocation_cpu_float32 (__main__.TestLinalgCPU) ... ok (0.187s)
2021-11-03T14:36:28.5805419Z   test_svd_memory_allocation_cpu_float64 (__main__.TestLinalgCPU) ... ok (0.230s)

linux-xenial-py3.6-clang7-asan / test (default, 2, 2, linux.2xlarge) (13/16)

Step: "Unknown" (full log | diagnosis details< 8000 /a> | 🔁 rerun)

2021-11-03T14:25:52.0715318Z SUMMARY: Undefined.../jenkins/workspace/aten/src/ATen/Utils.cpp:20:3 in

2021-11-03T14:25:52.0180538Z     #9 0x5618f9e6a8f2 in PyEval_EvalCode /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/ceval.c:731
2021-11-03T14:25:52.0181287Z     #10 0x5618f9ed2cd5 in run_mod /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:1025
2021-11-03T14:25:52.0182080Z     #11 0x5618f9ed4d5d in PyRun_StringFlags /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:949
2021-11-03T14:25:52.0183212Z     #12 0x5618f9ed4dbb in PyRun_SimpleStringFlags /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Python/pythonrun.c:445
2021-11-03T14:25:52.0184035Z     #13 0x5618f9ed5926 in run_command /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Modules/main.c:301
2021-11-03T14:25:52.0184740Z     #14 0x5618f9ed5926 in Py_Main /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Modules/main.c:749
2021-11-03T14:25:52.0185439Z     #15 0x5618f9e0f196 in main /home/builder/ktietz/cos6/ci_cos6/python_1622833237666/work/Programs/python.c:69
2021-11-03T14:25:52.0713185Z     #16 0x7f0c34ad483f in __libc_start_main /build/glibc-S7Ft5T/glibc-2.23/csu/../csu/libc-start.c:291
2021-11-03T14:25:52.0714103Z     #17 0x5618f9e9f33d in _start (/opt/conda/bin/python3.6+0x1a733d)
2021-11-03T14:25:52.0714444Z 
2021-11-03T14:25:52.0715318Z SUMMARY: UndefinedBehaviorSanitizer: undefined-behavior /var/lib/jenkins/workspace/aten/src/ATen/Utils.cpp:20:3 in 
2021-11-03T14:25:52.0896938Z + retcode=1
2021-11-03T14:25:52.0897832Z + set -e
2021-11-03T14:25:52.0898226Z + return 1
2021-11-03T14:25:52.0900261Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX-* ]]
2021-11-03T14:25:52.0901032Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X ]]
2021-11-03T14:25:52.0901818Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX2-* ]]
2021-11-03T14:25:52.0902730Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]]
2021-11-03T14:25:52.0903940Z + [[ linux-xenial-py3.6-clang7-asan-default == *-NO_AVX512-* ]]
2021-11-03T14:25:52.0904590Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\5\1\2 ]]
2021-11-03T14:25:52.0904923Z ++ mktemp

linux-xenial-py3.6-gcc5.4 / test (default, 1, 2, linux.2xlarge) (14/16)

Step: "Unknown" (full log | diagnosis details | 🔁 rerun)

2021-11-03T14:40:56.2115743Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.

2021-11-03T14:40:56.1696844Z 
2021-11-03T14:40:56.1697756Z Intel MKL ERROR: Parameter 4 was incorrect on entry to SLASCL.
2021-11-03T14:40:56.1748448Z ok (0.041s)
2021-11-03T14:40:56.1923457Z   test_svd_errors_and_warnings_cpu_float64 (__main__.TestLinalgCPU) ... 
2021-11-03T14:40:56.1924216Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:40:56.1924578Z 
2021-11-03T14:40:56.1925009Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:40:56.2114014Z 
2021-11-03T14:40:56.2114734Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:40:56.2115286Z 
2021-11-03T14:40:56.2115743Z Intel MKL ERROR: Parameter 4 was incorrect on entry to DLASCL.
2021-11-03T14:40:56.2167213Z ok (0.042s)
2021-11-03T14:40:59.4752801Z   test_svd_lowrank_cpu_float64 (__main__.TestLinalgCPU) ... ok (3.258s)
2021-11-03T14:40:59.9987794Z   test_svd_memory_allocation_cpu_complex128 (__main__.TestLinalgCPU) ... test_linalg.py:3026: UserWarning: An output with one or more elements was resized since it had shape [3, 3], which does not match the required output shape [3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:40:59.9990356Z   torch.linalg.svdvals(a, out=out0)
2021-11-03T14:41:00.1942132Z test_linalg.py:3027: UserWarning: An output with one or more elements was resized since it had shape [3], which does not match the required output shape [3, 3].This behavior is deprecated, and in a future PyTorch release outputs will not be resized unless they have zero elements. You can explicitly reuse an out tensor t by resizing it, inplace, to zero elements with t.resize_(0). (Triggered internally at  /var/lib/jenkins/workspace/aten/src/ATen/native/Resize.cpp:23.)
2021-11-03T14:41:00.1944522Z   torch.linalg.svd(a, full_matrices=False, out=(out0, out1, out2))
2021-11-03T14:41:00.2019233Z ok (0.727s)
2021-11-03T14:41:00.5531554Z   test_svd_memory_allocation_cpu_complex64 (__main__.TestLinalgCPU) ... ok (0.351s)
2021-11-03T14:41:00.7091082Z   test_svd_memory_allocation_cpu_float32 (__main__.TestLinalgCPU) ... ok (0.156s)
2021-11-03T14:41:00.9142584Z   test_svd_memory_allocation_cpu_float64 (__main__.TestLinalgCPU) ... ok (0.205s)

pytorch_linux_xenial_py3_6_gcc5_4_build (15/16)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.

  git switch -

Turn off this advice by setting config variable advice.detachedHead to false

HEAD is now at 65945ddbc7 Update on "Remove native_functions.yaml dependency from TensorTopK.cu"
+ git reset --hard 65945ddbc7eb202f9b747110ab90a6b63154a1f7
HEAD is now at 65945ddbc7 Update on "Remove native_functions.yaml dependency from TensorTopK.cu"
+ git merge --allow-unrelated-histories --no-edit --no-ff e32d7f7525fded2c044dd690d3a1f52aa69ae79e
Auto-merging caffe2/CMakeLists.txt
CONFLICT (content): Merge conflict in caffe2/CMakeLists.txt
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1

pytorch_xla_linux_bionic_py3_6_clang9_build (16/16)

Step: "(Optional) Merge target branch" (full log | diagnosis details | 🔁 rerun)

Automatic merge failed; fix conflicts and then commit the result.

  git switch -

Turn off this advice by setting config variable advice.detachedHead to false

HEAD is now at 65945ddbc7 Update on "Remove native_functions.yaml dependency from TensorTopK.cu"
+ git reset --hard 65945ddbc7eb202f9b747110ab90a6b63154a1f7
HEAD is now at 65945ddbc7 Update on "Remove native_functions.yaml dependency from TensorTopK.cu"
+ git merge --allow-unrelated-histories --no-edit --no-ff e32d7f7525fded2c044dd690d3a1f52aa69ae79e
Auto-merging caffe2/CMakeLists.txt
CONFLICT (content): Merge conflict in caffe2/CMakeLists.txt
Automatic merge failed; fix conflicts and then commit the result.


Exited with code exit status 1

12 failures not recognized by patterns:

Job	Step	Action
^{pytorch_linux_xenial_py3_clang5_android_ndk_r19c_x86_32_build}	^Build	🔁 rerun
^{pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single-full-jit}	^{pytorch android gradle custom build single architecture (for PR)}	🔁 rerun
^{pytorch_ios_12_5_1_x86_64_full_jit_build}	^Build	🔁 rerun
^{pytorch_macos_10_13_py3_test}	^Test	🔁 rerun
^{pytorch_ios_12_5_1_x86_64_build}	^Build	🔁 rerun
^{pytorch_macos_10_13_py3_lite_interpreter_build_test}	^Test	🔁 rerun
^{pytorch_ios_12_5_1_x86_64_coreml_build}	^Build	🔁 rerun
^{linux-bionic-py3.6-clang9 / test (default, 2, 2, linux.2xlarge)}	^Unknown	🔁 rerun
^{linux-xenial-cuda11.3-py3.6-gcc7 / test (distributed, 1, 1, linux.8xlarge.nvidia.gpu)}	^Unknown	🔁 rerun
^{linux-xenial-cuda11.3-py3.6-gcc7 / test (default, 1, 2, linux.4xlarge.nvidia.gpu)}	^Unknown	🔁 rerun
^{linux-xenial-cuda11.3-py3.6-gcc7 / test (default, 2, 2, linux.4xlarge.nvidia.gpu)}	^Unknown	🔁 rerun
^{linux-xenial-py3.6-gcc7 / test (default, 1, 2, linux.2xlarge)}	^Unknown	🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

ghstack-source-id: 493fab8 Pull Request resolved: #66794

[ghstack-poisoned]

dagitses · 2021-10-22T09:11:29Z

@dagitses has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

dagitses · 2021-11-02T12:44:33Z

@dagitses has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

dagitses · 2021-11-03T12:29:31Z

@dagitses has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

dagitses · 2021-11-03T12:45:37Z

aten/src/ATen/native/cuda/TensorTopK.cu

-
+void launch_gather_topk_kernel(
+    const TensorBase& self, int64_t k, int64_t dim, bool largest, bool sorted,
+    const TensorBase& values, const TensorBase& indices) {
  int numDims = self.dim();


out of curiosity, what is the criteria by which these four lines remain in the .cu but the if (k == 0) moves?

My understanding of your work is that you are prioritizing the following:

pull out dependencies of Tensor and ATen.h from .cu files.

less important but where possible, extract other unnecessary dependencies from .cu files

take advantage of other opportunities to move code out of .cu files

Is that roughly the prioritization of your approach here? Under that, moving this block and the k == 0 check both fit under 3) as the lowest priority.

Reordering the k == 0 check does change the behavior since it now avoids the check about having too many dimensions. Is that OK? FWIW, I like the idea of being stringent on inputs rather than letting a loophole like this let the user get away with an invalid input.

Changing topic altogether, do you think that splitting code up this way causes any meaningful harm by creating cross-module optimization barriers?

out of curiosity, what is the criteria by which these four lines remain in the .cu but the if (k == 0) moves?

The return statement must go in the .cpp file function so we don't launch the sorting kernels. It would make sense to keep the MAX_DIMS checks together with it, but MAX_DIMS is defined in a .cuh header file so needs nvcc:

pytorch/aten/src/ATen/cuda/detail/OffsetCalculator.cuh

Line 19 in 383c1f5

constexpr int MAX_DIMS = 25;

My understanding of your work is that you are prioritizing the following: [...]

This is mostly right, although .cu isn't actually in my criteria anywhere. I'm currently focusing on files that depend on native_functions.yaml, prioritized by their compile time (to maximize impact). It just so happens that cuda code is much slower to compile so the top of the list is all cuda files. Somewhat interestingly, GridSample.cpp was above GridSample.cu in compile time which is why that PR changes both.

The top of the list at the moment looks like this:

caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/DistributionBernoulli.cu.o 1m 13s 344ms caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/ForeachUnaryOp.cu.o 1m 12s 425ms caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/Distributions.cu.o 1m 8s 582ms caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/Indexing.cu.o 1m 7s 595ms caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/native/cuda/group_norm_kernel.cu.o 0m 59s 931ms caffe2/CMakeFiles/torch_cuda_cu.dir/__/aten/src/ATen/RegisterCUDA.cpp.o 0m 55s 472ms

FWIW, I like the idea of being stringent on inputs rather than letting a loophole like this let the user get away with an invalid input.

I wouldn't say that applies here since MAX_DIMS is an implementation limitation not an invalid input. If, for example, matmul allowed empty tensors to have a shape mismatch then I would agree.

Changing topic altogether, do you think that splitting code up this way causes any meaningful harm by creating cross-module optimization barriers?

I don't think there's much the compiler can do here, but things like calling the same tensor method in both functions will have some impact (especially for virtual methods). However, generally speaking, the heavy lifting of these functions are done in the cuda runtime to actually launch the kernel. So, if there is any slow down I expect it to be fairly minimal.

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

dagitses · 2021-11-03T20:36:20Z

@dagitses has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2021-11-04T15:32:26Z

@dagitses merged this pull request in 4262c89.

…ytorch#66794)" This reverts commit 4262c89.

…pK.cu (pytorch#66794)"" This reverts commit a9a489e.

Remove native_functions.yaml dependency from TensorTopK.cu

27280aa

[ghstack-poisoned]

pytorch-probot bot added the ciflow/default label Oct 18, 2021

This was referenced Oct 18, 2021

Remove native_functions.yaml dependency from ScanKernels.cu #66620

Closed

Remove native_functions.yaml dependency from Sorting.cu #66621

Closed

peterbell10 mentioned this pull request Oct 18, 2021

Remove native_functions.yaml dependency from Sort.cu #66793

Closed

facebook-github-bot added the cla signed label Oct 18, 2021

peterbell10 added a commit that referenced this pull request Oct 18, 2021

Remove native_functions.yaml dependency from TensorTopK.cu

bd0ff71

ghstack-source-id: 493fab8 Pull Request resolved: #66794

pytorchbot added the open source label Oct 18, 2021

peterbell10 requested a review from dagitses October 18, 2021 23:02

Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

18664f6

[ghstack-poisoned]

This was referenced Oct 20, 2021

Remove native_functions.yaml dependency from TensorModeKernel.cu #66913

Closed

Remove native_functions.yaml dependency from IndexKernel.{cpp,cu} #66914

Closed

Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

19fa3c1

[ghstack-poisoned]

This was referenced Oct 20, 2021

Remove native_functions.yaml dependency from GridSample.{cpp,cu} #66979

Closed

Remove native_functions.yaml dependency from Sorting.cpp #66980

Closed

Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

c060b8e

[ghstack-poisoned]

peterbell10 added 2 commits October 25, 2021 17:52

Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

bd58d34

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

7d6ec9d

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

peterbell10 mentioned this pull request Oct 25, 2021

Split cuda: list cpp files that go in _cu library explicitly #67216

Closed

Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

2dabfce

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

peterbell10 added 2 commits November 2, 2021 19:32

Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

5817275

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

6D40
Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

c31fa87

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

peterbell10 mentioned this pull request Nov 2, 2021

Remove native_functions.yaml dependency from DistributionBernoulli.cu #67721

Closed

dagitses approved these changes Nov 3, 2021

View reviewed changes

Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

65945dd

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

Update on "Remove native_functions.yaml dependency from TensorTopK.cu"

9d472d6

Differential Revision: [D31856104](https://our.internmc.facebook.com/intern/diff/D31856104) [ghstack-poisoned]

facebook-github-bot closed this in 4262c89 Nov 4, 2021

facebook-github-bot added the Merged label Nov 4, 2021

This was referenced Nov 4, 2021

Remove native_functions.yaml dependency from DistributionNormal.cu #67874

Closed

Remove native_functions.yaml dependency from CUDA distributions #67875

Closed

facebook-github-bot deleted the gh/peterbell10/179/head branch November 8, 2021 15:16

mszhanyi added a commit to mszhanyi/pytorch that referenced this pull request Nov 9, 2021

Revert "Remove native_functions.yaml dependency from TensorTopK.cu (p…

a9a489e

…ytorch#66794)" This reverts commit 4262c89.

mszhanyi added a commit to mszhanyi/pytorch that referenced this pull request Nov 9, 2021

Revert "Revert "Remove native_functions.yaml dependency from TensorTo…

821818f

…pK.cu (pytorch#66794)"" This reverts commit a9a489e.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Remove native_functions.yaml dependency from TensorTopK.cu #66794

Remove native_functions.yaml dependency from TensorTopK.cu #66794

Uh oh!

Uh oh!

⚛️ CI Flow

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Remove native_functions.yaml dependency from TensorTopK.cu #66794

Remove native_functions.yaml dependency from TensorTopK.cu #66794

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

⚛️ CI Flow

Uh oh!

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

🕵️ 16 new failures recognized by patterns

linux-xenial-py3.6-clang7-onnx / test (default, 2, 2, linux.2xlarge) (1/16)

linux-xenial-py3.6-gcc5.4 / test (distributed, 1, 1, linux.2xlarge) (2/16)

linux-xenial-py3.6-gcc5.4 / build-docs (cpp) (3/16)

linux-xenial-py3.6-gcc7 / test (distributed, 1, 1, linux.2xlarge) (4/16)

linux-bionic-py3.6-clang9 / test (noarch, 1, 1, linux.2xlarge) (5/16)

linux-xenial-py3.6-clang7-onnx / test (default, 1, 2, linux.2xlarge) (6/16)

win-vs2019-cuda11.3-py3 / build (7/16)

linux-xenial-py3.6-gcc5.4 / test (default, 2, 2, linux.2xlarge) (8/16)

linux-bionic-py3.6-clang9 / test (default, 1, 2, linux.2xlarge) (9/16)

win-vs2019-cpu-py3 / build (10/16)

linux-xenial-py3.6-clang7-asan / test (default, 1, 2, linux.2xlarge) (11/16)

linux-xenial-py3.6-gcc7 / test (default, 2, 2, linux.2xlarge) (12/16)

linux-xenial-py3.6-clang7-asan / test (default, 2, 2, linux.2xlarge) (13/16)

linux-xenial-py3.6-gcc5.4 / test (default, 1, 2, linux.2xlarge) (14/16)

pytorch_linux_xenial_py3_6_gcc5_4_build (15/16)

pytorch_xla_linux_bionic_py3_6_clang9_build (16/16)

12 failures not recognized by patterns:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!