expanded scalar is not contiguous #3919

shekhovt · 2017-11-28T11:45:32Z

torch.Tensor(1,1).expand(1,2).view(1,2)

Traceback (most recent call last):
File "", line 1, in
RuntimeError: invalid argument 1: input is not contiguous at /pytorch/torch/lib/TH/generic/THTensor.c:231

There are two problems:

view of the same size as the input should always succeed, regardless continuity
an expanded scalar is of course contiguous, no?
I think these are bugs.

As a feature request, may I suggest that some of the reshaping should also succeed on non-contiguous Tensors as long as only dimensions that are contiguous are affected. Examples:
size (4,4, X), strides (1,4, Y) -> size (16, X), stride (1, Y)

The text was updated successfully, but these errors were encountered:

gchanan · 2017-11-28T15:00:19Z

On 2., the docs could be a little cleaner; right now they say "Returns True if this tensor is contiguous in memory in C order," but it's not really clear (to me at least) what "in C order" means in the case of 0-strided (i.e. expanded) tensors. The condition is actually something like "contiguous in memory in C order and each point is non-aliased", "contiguous in memory in C with no strides of size 0" or "memory is equivalent to a contiguous C N-dimensional array of the corresponding size".

On 1., we probably need more "viewing" functions, i.e. a "reshape" (that is a no-op for input of the same size", a "view" that always succeeds, and maybe the current view (that requires contiguity).

shekhovt · 2017-11-28T15:23:23Z

view is kind of attaching a new handle with some size and strides to the same data, without rearranging the data in memory. This is certainly possible in this case:
torch.Tensor(1,1).expand(1,2).view(-1)
by setting size=(2), strides=(0). Even though this is not contiguous in the sense of the existing definition, as you explain, it would be more complete and consistent if it covered such cases.

vadimkantorov · 2017-11-28T17:03:52Z

proposed a similar thing to (1) in #3653

shekhovt · 2017-11-28T20:45:22Z

I checked how it is in numpy, and they support all the mentioned cases, including zero strides and #3653.
In fact, one can hack around Tensor.view through numpy:

import torch
import numpy as np

a = torch.Tensor(16, 8, 32, 64).transpose(-1, -2)
#b = a.view(-1, 64, 32)  # does not work
b = torch.from_numpy(a.numpy().reshape([-1, 64, 32]))  # works

You can verify that a and b share the same memory and inspect

b.numpy().strides
(8192, 4, 256)

Only it does not work for cuda Tensors as the data pointer to device is barriered off numpy. But I hope I made my point that Tensor.view can work this way and it would be more clear and consistent.

colesbury · 2017-11-29T16:45:08Z

An expanded Scalar is not C-contiguous in either PyTorch or NumPy.

>>> np.broadcast_to(np.array(1), (5, 5)).flags['C_CONTIGUOUS']
False

C-contiguous means that moving to the next element in the array moves one over in the underlying memory block. This is not the case for expanded scalars since. Having the memory block be contiguous in memory is not sufficient.

We should add reshape(). That's already discussed in other issues.

shekhovt · 2017-11-29T17:08:18Z

Thanks, I understand C-contiguous. But is it the right prerequisite for view to work? It can work in all the following cases:
sizes are the same
reshaping a subsequence of dimensions with zero strides
reshaping a subsequence of C-contiguous dimensions

I believe they are working in numpy and it does not have to do with all mechanics in reshape:

a = np.broadcast_to(np.array(1), (2, 2)).view()
a.shape = [1,4]

array([[1, 1, 1, 1]])

colesbury · 2017-11-29T17:13:06Z

Tensor.view is not equivalent to numpy.reshape (which does not require C-contiguity) nor ndarray.view (which changes data type)

shekhovt · 2017-11-29T17:18:51Z

That I cannot argue about. I just suggest that there is a good deal of use cases where one wants a view of partially broadcasted tensors, where copying or changing data type can be avoided, and the current implementation is not supporting it

colesbury · 2017-11-29T17:19:34Z

Yes, agreed. But we want to solve it by adding reshape() instead of changing view.

shekhovt · 2017-11-29T17:26:30Z

In that case what would be the conditions that the new reshape() does not rearrange the data? And sorry, the .view() in the code above was indeed irrelevant.

colesbury closed this as completed Nov 29, 2017

This was referenced Dec 4, 2017

[proposal] Preserve zero-strides and storage size for simple element-wise ops #4010

Closed

Allow .view on noncontig tensors when certain conditions are met #4062

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

expanded scalar is not contiguous #3919

expanded scalar is not contiguous #3919

expanded scalar is not contiguous #3919

expanded scalar is not contiguous #3919

Comments