[libzstd] Optimize ZSTD_insertBt1() for repetitive data #1635

terrelln · 2019-06-06T03:46:57Z

We would only skip at most 192 bytes at a time before this diff.
This was added to optimize long matches and skip the middle of the
match. However, it doesn't handle the case of repetitive data.

This patch keeps the optimization, but also handles repetitive data
by taking the max of the two return values.

Before:

> for n in $(seq 9); do echo strategy=$n; dd status=none if=/dev/zero bs=1024k count=1000 | command time -f %U zstd --zstd=strategy=$n >/dev/null; done
strategy=1
0.34
strategy=2
0.36
strategy=3
0.41
strategy=4
0.93
strategy=5
1.33
strategy=6
8.41
strategy=7
31.03
strategy=8
31.13
strategy=9
123.05

After:

> for n in $(seq 9); do echo strategy=$n; dd status=none if=/dev/zero bs=1024k count=1000 | command time -f %U ./zstd --zstd=strategy=$n >/dev/null; done
strategy=1
0.27
strategy=2
0.23
strategy=3
0.27
strategy=4
0.43
strategy=5
0.56
strategy=6
0.43
strategy=7
0.34
strategy=8
0.34
strategy=9
0.35

At level 19 with multithreading the compressed size of silesia.tar regresses 300 bytes, and enwik8 regresses 100 bytes. In single threaded mode enwik8 is also within 100 bytes, and I didn't test silesia.tar. The compression speed is not significantly effected.

The regression tests show changes in the order of single digit number of bytes.

Fixes Issue #1634.

We would only skip at most 192 bytes at a time before this diff. This was added to optimize long matches and skip the middle of the match. However, it doesn't handle the case of repetitive data. This patch keeps the optimization, but also handles repetitive data by taking the max of the two return values. ``` > for n in $(seq 9); do echo strategy=$n; dd status=none if=/dev/zero bs=1024k count=1000 | command time -f %U ./zstd --zstd=strategy=$n >/dev/null; done strategy=1 0.27 strategy=2 0.23 strategy=3 0.27 strategy=4 0.43 strategy=5 0.56 strategy=6 0.43 strategy=7 0.34 strategy=8 0.34 strategy=9 0.35 ``` At level 19 with multithreading the compressed size of `silesia.tar` regresses 300 bytes, and `enwik8` regresses 100 bytes. In single threaded mode `enwik8` is also within 100 bytes, and I didn't test `silesia.tar`. Fixes Issue facebook#1634.

terrelln and others added 2 commits June 5, 2019 20:34

[regression] Update results.csv

f3800ba

facebook-github-bot added the CLA Signed label Jun 6, 2019

Cyan4973 approved these changes Jun 6, 2019

View reviewed changes

terrelln merged commit d06c15c into facebook:dev Jun 6, 2019

terrelln mentioned this pull request Jun 6, 2019

Higher strategies have dramatic slowdowns on repetitive data #1634

Closed

felixhandte mentioned this pull request Jul 19, 2019

Merge v1.4.1 to Master #1691

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[libzstd] Optimize ZSTD_insertBt1() for repetitive data #1635

[libzstd] Optimize ZSTD_insertBt1() for repetitive data #1635

Uh oh!

Uh oh!

Uh oh!

[libzstd] Optimize ZSTD_insertBt1() for repetitive data #1635

[libzstd] Optimize ZSTD_insertBt1() for repetitive data #1635

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!