Optimize vindex #11625

dcherian · 2024-12-27T17:55:12Z

Tokenize before broadcasting
avoid copying indexer arrays as much as possible.
Use cached_max, cached_cumsum for determining chunksize and chunk
bounds
broadcast as late as possible.
Uses an array-centric algorithm instead of toolz.groupby

Closes Array.vindex performance improvements #10237 (slightly slower at 50ms vs 1s on main but the code has also changed in the meantime)
Tests added / passed
Passes pre-commit run --all-files
Closes vindex as outer indexer: memory and time performance #11018 (1.87s, 2.1GB memory)

1. Tokenize before broadcasting 2. avoid copying indexer arrays as much as possible. 3. Use cached_max, cached_cumsum for determining chunksize and chunk bounds 4. broadcast as late as possible.

This reverts commit e56f72fafc07f7aceae5f05b9b69bb5b3a530a18.

dask/array/core.py

github-actions · 2024-12-27T18:28:42Z

Unit Test Results

See test report for an extended history of previous test failures. This is useful for diagnosing flaky tests.

15 files ±0 15 suites ±0 3h 59m 27s ⏱️ + 2m 50s
12 611 tests ±0 11 448 ✅ ±0 1 163 💤 ±0 0 ❌ ±0
156 885 runs ±0 140 231 ✅ ±0 16 654 💤 ±0 0 ❌ ±0

Results for commit e23eac5. ± Comparison against base commit fe465e0.

♻️ This comment has been updated with latest results.

dcherian · 2024-12-27T22:06:41Z

dask/array/core.py

+                slicer = slice(start, stop)
+                key = sorted_keys[start]
+                outblock, *input_blocks = np.unravel_index(key, ravel_shape)
+                inblock = [_[slicer] for _ in sorted_inblock_idxs]


perhaps we should just scatter out sorted_inblock_idxs as dask arrays and embed a reference to a dask array sliced by these slicers in the task instead of the full array? The graph is quite big for the example in #11018

Yeah UserWarning: Sending large graph of size 738.91 MiB. for

import dask.array as da import numpy as np import scipy as sp chunksize = 100 size = 10_000 n_points = 5000 X = da.random.poisson(15, (size, size), chunks = (chunksize, chunksize)) index_0 = np.random.randint(0, X.shape[0], n_points) index_0.sort() index_1 = np.random.randint(0, X.shape[1], n_points) index_1.sort() print('vindex timing:') X.vindex[np.ix_(index_0, index_1)].compute()

if index_0 has size (M,1) and index_1 has size (1, N), it should be possible to embed M+N values instead of the current MxN.

That said, I think this PR can be merged (barring any other comments). Such a graph takes so long to build today no one is using it.

phofl

yeah I agree, we can merge this now, it already reduced graph size by 50%

phofl · 2025-01-02T16:56:43Z

Thanks!

dcherian added 7 commits December 27, 2024 10:36

Optimize vindex

e6623cc

1. Tokenize before broadcasting 2. avoid copying indexer arrays as much as possible. 3. Use cached_max, cached_cumsum for determining chunksize and chunk bounds 4. broadcast as late as possible.

inblock_idx

c8f1804

npoints

758a7b6

WIP iterator

f9f5182

Revert "WIP iterator"

c4f928a

This reverts commit e56f72fafc07f7aceae5f05b9b69bb5b3a530a18.

Use nd-array centric algorithm instead of toolz.groupby.

383e210

Argsort, then slice

< 8000 code> a4379f3

dcherian commented Dec 27, 2024

View reviewed changes

dask/array/core.py Outdated Show resolved Hide resolved

dcherian force-pushed the optimize-vindex branch from 967b129 to a4379f3 Compare December 27, 2024 22:05

dcherian commented Dec 27, 2024

View reviewed changes

dcherian and others added 3 commits December 28, 2024 09:40

Cleanup

c529e78

one more cleanup

ca1295a

Merge branch 'main' into optimize-vindex

e23eac5

phofl approved these changes Jan 2, 2025

View reviewed changes

phofl merged commit 331fb1a into dask:main Jan 2, 2025
26 checks passed

dcherian mentioned this pull request Jan 2, 2025

vindex as outer indexer: memory and time performance #11018

Closed

dcherian deleted the optimize-vindex branch January 2, 2025 17:00

veenstrajelmer mentioned this pull request Apr 16, 2025

to_facet method Deltares/xugrid#350

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Optimize vindex #11625

Optimize vindex #11625

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Optimize vindex #11625

Optimize vindex #11625

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Unit Test Results

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!