Extend CSR constructor to support batched indices and values #74542

IvanYashchuk · 2022-03-22T14:19:12Z

This is the first portion of changes required to enable Batched CSR format described in #60854 (comment).

Currently, only the same batch shape for indices and values is allowed. In the future, we could enable "broadcasting" of indices and batched values, as done in xFormers (https://github.com/facebookresearch/xformers/blob/dd96b8d8beda5308fb433c1ef3ff04b7f178c263/xformers/components/attention/_sputnik_sparse.py#L441).

This PR adds possibility to construct a batched CSR matrix with torch.sparse_csr_tensor and this batched CSR can be converted to a dense tensor with a .to_dense() call.

facebook-github-bot · 2022-03-22T14:19:18Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/74542
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit 0c79aa7 (more details on the Dr. CI page):

✅ None of the CI failures appear to be your fault 💚

1/1 broken upstream at merge base b72b5b2 on Apr 06 from 10:19am to 6:31pm

🚧 1 fixed upstream failure:

These were probably caused by upstream breakages that were already fixed.

Please rebase on the viable/strict branch (expand for instructions)

If your commit is older than viable/strict, run these commands:

git fetch https://github.com/pytorch/pytorch viable/strict
git rebase FETCH_HEAD

trunk / linux-bionic-rocm5.0-py3.7-distributed / test (distributed, 1, 1, linux.rocm.gpu) on Apr 06 from 10:19am to 6:31pm (eb43e60 - 936e7ea)
- 🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

cpuhrsch · 2022-03-22T15:24:20Z

aten/src/ATen/native/sparse/SparseCsrTensor.cpp

+  // std::array<int64_t, 2> size = {0, 0};
+  auto size = DimVector(IntArrayRef(col_indices.sizes().data(), col_indices.dim() - 1));
+  size.push_back(crow_indices.size(-1) - 1);
+  size.push_back(col_indices.max().item<int64_t>() + 1);


col_indices are always guaranteed to be int64_t now?

No, but here .max().item() gives a Scalar and .item<int64_t> casts the Scalar to int64_t.

pytorch/aten/src/ATen/templates/TensorMethods.cpp

Line 25 in 45da320

return item().to##name(); \

IvanYashchuk · 2022-03-22T16:14:33Z

Okay, a few tests are really failing. I'll resolve the failures.

cpuhrsch · 2022-03-22T17:23:02Z

test/test_sparse_csr.py

+        from functools import reduce
+        for batch_shape in ((), (2,), (2, 3)):
+            prod = reduce(mul, batch_shape, 1)
+            crow_indices = torch.tensor([0, 2, 4], device=device).repeat(prod, 1).reshape(*batch_shape, -1)


Technically these indices don't have to be the same for each batch entry. A more powerful test would potentially modify them as well to be different for each batch entry.

cpuhrsch

I think generally this looks fine, but let's wait until you resolved the current CI failures.

…opy test

Enable batched_csr.to_dense()

pytorch-bot · 2022-03-23T16:21:52Z

We have recently simplified the CIFlow labels and ciflow/cuda is no longer in use.
You can use any of the following

ciflow/trunk (.github/workflowss/trunk.yml): all jobs we run per-commit on master
ciflow/periodic (.github/workflows/periodic.yml): all jobs we run periodically on master
ciflow/all: trunk + periodic; all jobs we run in master CI
ciflow/nightly (.github/workflows/nightly.yml): all jobs we run nightly
ciflow/binaries: all binary build and upload jobs

pytorch-bot · 2022-03-23T16:22:33Z

We have recently simplified the CIFlow labels and ciflow/cpu is no longer in use.
You can use any of the following

ciflow/trunk (.github/workflowss/trunk.yml): all jobs we run per-commit on master
ciflow/periodic (.github/workflows/periodic.yml): all jobs we run periodically on master
ciflow/all: trunk + periodic; all jobs we run in master CI
ciflow/nightly (.github/workflows/nightly.yml): all jobs we run nightly
ciflow/binaries: all binary build and upload jobs

cpuhrsch · 2022-04-04T15:54:58Z

@IvanYashchuk - could you rebase this on top of a green commit please? See https://hud.pytorch.org/ (e.g. c5872e6). Hopefully that'll fix the lint CI error.

IvanYashchuk · 2022-04-04T16:05:59Z

The base of this is viable/strict branch with bf16552, which is green.

cpuhrsch · 2022-04-04T16:07:07Z

@IvanYashchuk - well then let's try rerunning those jobs again.

cpuhrsch · 2022-04-04T19:23:53Z

@IvanYashchuk - FYI there's a PR that aims to prevent a broken master lint job from holding up PRs that are built upon viable strict. #75199

cpuhrsch · 2022-04-04T22:08:05Z

@pytorchbot merge this

It caused a number of internal only compilation failures, for example see: #74425 (comment) and #74542 (comment) [ghstack-poisoned]

It caused a number of internal only compilation failures, for example see: #74425 (comment) and #74542 (comment) ghstack-source-id: 14889fa Pull Request resolved: #75085

It caused a number of internal only compilation failures, for example see: #74425 (comment) and #74542 (comment) Pull Request resolved: #75085 Approved by: https://github.com/ngimel, https://github.com/albanD

b0noI · 2022-04-05T21:37:53Z

@pytorchbot revert this

b0noI · 2022-04-05T21:39:17Z

Internal errors:

Summary:
stderr: caffe2/aten/src/ATen/native/sparse/cuda/SparseBlasImpl.cpp:983:20: error: lambda capture 'C_crow_indices' is not used [-Werror,-Wunused-lambda-capture]
auto fix_nnz = [&C_crow_indices, &m](int nnz) -> int {
~^~~~~~~~~~~~~~~
caffe2/aten/src/ATen/native/sparse/cuda/SparseBlasImpl.cpp:983:37: error: lambda capture 'm' is not used [-Werror,-Wunused-lambda-capture]
auto fix_nnz = [&C_crow_indices, &m](int nnz) -> int {
~~~^
stderr: caffe2/aten/src/ATen/native/sparse/cuda/SparseBlasImpl.cpp:983:20: error: lambda capture 'C_crow_indices' is not used [-Werror,-Wunused-lambda-capture]
auto fix_nnz = [&C_crow_indices, &m](int nnz) -> int {
~^~~~~~~~~~~~~~~
caffe2/aten/src/ATen/native/sparse/cuda/SparseBlasImpl.cpp:983:37: error: lambda capture 'm' is not used [-Werror,-Wunused-lambda-capture]
auto fix_nnz = [&C_crow_indices, &m](int nnz) -> int {
~~~^

9E88

This reverts commit eead599. Reverted #74542 on behalf of https://github.com/b0noI

cpuhrsch · 2022-04-05T21:41:25Z

@malfet - We might want to make this error part of the CI too

cpuhrsch · 2022-04-05T22:08:04Z

@IvanYashchuk - I merged master and added a simple fix for this and will attempt to merge again once the CI runs green

diff --git a/aten/src/ATen/native/sparse/cuda/SparseBlasImpl.cpp b/aten/src/ATen/native/sparse/cuda/SparseBlasImpl.cpp
index 27432431f7..7cfe1248fb 100644
--- a/aten/src/ATen/native/sparse/cuda/SparseBlasImpl.cpp
+++ b/aten/src/ATen/native/sparse/cuda/SparseBlasImpl.cpp
@@ -983,15 +983,21 @@ void add_out_sparse_csr(
   auto C_col_indices_ptr = C_col_indices.data_ptr<int>();

   // Windows compilers don't support nested macros
-  // so we need this lambda outside of the AT_DISPATCH_FLOATING_AND_COMPLEX_TYPES
-  auto fix_nnz = [&C_crow_indices, &m](int nnz) -> int {
-    // For some reason POINTER_MODE_HOST is not working here
-    // Let's extract manually the nnz from the C_crow_indices
-    #if AT_ROCM_ENABLED()
+  // so we need this lambda outside of the
+  // AT_DISPATCH_FLOATING_AND_COMPLEX_TYPES
+  auto fix_nnz = [
+#if AT_ROCM_ENABLED()
+                     &C_crow_indices,
+                     &m
+#endif
+  ](int nnz) -> int {
+// For some reason POINTER_MODE_HOST is not working here
+// Let's extract manually the nnz from the C_crow_indices
+#if AT_ROCM_ENABLED()
     return std::max({nnz, C_crow_indices.narrow(-1, m, 1).item<int>()});
-    #else
+#else
     return nnz;
-    #endif
+#endif
   };

malfet · 2022-04-05T22:34:37Z

@malfet - We might want to make this error part of the CI too

[Edit] This warning is only generated by clang(see really old gcc feature request) , and we do not have CUDA+clang builds configured in our CI at the moment (trying to add this in #75293)

Summary: It caused a number of internal only compilation failures, for example see: #74425 (comment) and #74542 (comment) Pull Request resolved: #75085 Approved by: https://github.com/ngimel, https://github.com/albanD Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/90a56fc515dbac9534a1a14110f9edf089430f81 Reviewed By: b0noI Differential Revision: D35404322 Pulled By: malfet fbshipit-source-id: aaa7033d0b7cbfcc1d4b3eeff86d09eba428f068

IvanYashchuk · 2022-04-07T16:01:56Z

@cpuhrsch, let's try one more time? 🤞

cpuhrsch · 2022-04-07T17:09:34Z

@pytorchbot merge this

Summary: This is the first portion of changes required to enable Batched CSR format described in #60854 (comment). Currently, only the same batch shape for indices and values is allowed. In the future, we could enable "broadcasting" of indices and batched values, as done in xFormers (https://github.com/facebookresearch/xformers/blob/dd96b8d8beda5308fb433c1ef3ff04b7f178c263/xformers/components/attention/_sputnik_sparse.py#L441). This PR adds possibility to construct a batched CSR matrix with `torch.sparse_csr_tensor` and this batched CSR can be converted to a dense tensor with a `.to_dense()` call. Pull Request resolved: #74542 Approved by: https://github.com/cpuhrsch Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/c7ae23b50e5f96261889ab9d55df1be7a6b1d55f Reviewed By: b0noI Differential Revision: D35485699 fbshipit-source-id: fa1c0c5cf256ac886717a9016a83e62ea2772f75

IvanYashchuk added module: sparse Related to torch.sparse release notes: sparse release notes category topic: improvements topic category labels Mar 22, 2022

IvanYashchuk requested a review from cpuhrsch March 22, 2022 14:19

facebook-github-bot added the cla signed label Mar 22, 2022

pytorchbot added the open source label Mar 22, 2022

cpuhrsch reviewed Mar 22, 2022

View reviewed changes

IvanYashchuk added 13 commits March 23, 2022 13:37

Update csr constructor to accept batched indices and values

d3ebf0a

Update error messages in tests

c06bd43

Add test_sparse_csr_batch_constructor

0029c54

Add checking the length of provided shape

d0807cb

Add check for same batch shape

81f78d9

Fix nnz() for batched CSR

8b047e2

Add batched CSR support for torch.empty

3a3e3e5

Add test .clone() for batched csr

8323c02

Make genSparseCSRTensor work for batched shape; add batch shape for c…

fdfaf7e

…opy test

Make resize_ test work with batched case

7eb61a7

flake8

ecf77d9

Clean up commented code

e5d07e2

IvanYashchuk force-pushed the batched-csr branch from 457c895 to e5d07e2 Compare March 23, 2022 15:10

IvanYashchuk added ciflow/cuda labels Mar 23, 2022

IvanYashchuk added the ciflow/trunk Trigger trunk jobs on your pull request label Mar 23, 2022

Merge remote-tracking branch 'upstream/viable/strict' into batched-csr

4eb2819

pytorchmergebot closed this in eead599 Apr 4, 2022

malfet added a commit that referenced this pull request Apr 5, 2022

Update base for Update on "Add -Wsign-compare to list of clang flags"

a9dfacc

It caused a number of internal only compilation failures, for example see: #74425 (comment) and #74542 (comment) [ghstack-poisoned]

malfet added a commit that referenced this pull request Apr 5, 2022

Update on "Add -Wsign-compare to list of clang flags"

bb32724

It caused a number of internal only compilation failures, for example see: #74425 (comment) and #74542 (comment) [ghstack-poisoned]

malfet added a commit that referenced this pull request Apr 5, 2022

Add -Wsign-compare to list of clang flags

850adaf

It caused a number of internal only compilation failures, for example see: #74425 (comment) and #74542 (comment) ghstack-source-id: 14889fa Pull Request resolved: #75085

zengk95 mentioned this pull request Apr 5, 2022

[Meta] CI Revert Tracker #66178

Closed

pytorchmergebot added a commit that referenced this pull request Apr 5, 2022

Revert "Extend CSR constructor to support batched indices and values"

6d832a7

This reverts commit eead599. Reverted #74542 on behalf of https://github.com/b0noI

cpuhrsch reopened this Apr 5, 2022

Merge branch 'master' of github.com:pytorch/pytorch into batched-csr

0b064ff

IvanYashchuk requested a review from a team as a code owner April 5, 2022 21:57

Guard lambda capture on ROCM

6f4a9da

Merge branch 'master' of github.com:pytorch/pytorch into batched-csr

246a887

Merge remote-tracking branch 'upstream/viable/strict' into batched-csr

0c79aa7

pytorchmergebot closed this in c7ae23b Apr 7, 2022

nikitaved mentioned this pull request Jun 20, 2022

Validate Sparse Compressed tensor inputs #79385

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extend CSR constructor to support batched indices and values #74542

Extend CSR constructor to support batched indices and values #74542

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Extend CSR constructor to support batched indices and values #74542

Extend CSR constructor to support batched indices and values #74542

Uh oh!

Conversation

Uh oh!

Uh oh!

🔗 Helpful links

💊 CI failures summary and remediations

🚧 1 fixed upstream failure:

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!