fix: improve performance of readStringVarFromBuff #4194

geyslan · 2024-07-18T19:50:31Z

1. Explain what the PR does

28ec953 fix: improve performance of readStringVarFromBuff

Optimize readStringVarFromBuff for better performance and memory usage.

- The optimizations were tested using new benchmarks:
  - Short: 33.26% faster, 42.86% less memory usage, 1 fewer allocation.
  - Medium: 28.34% faster, 54.55% less memory usage, 1 fewer allocation.
  - Long: 26.90% faster, 73.42% less memory usage, 3 fewer allocations.
  - Long: with Low Max: 19.12% faster, 48.15% less memory usage, 1 fewer
    allocation.

The overall improvements show significant gains in both execution speed
and memory efficiency. For more check eventsreader_bench_test.go.

Changes:
- Preallocated the buffer with a reasonable initial capacity to avoid
  repeated slice resizing.
- Removed TrimLeft call since conversion logic already stops at the
  first nul byte decoded.

2. Explain how to test it

3. Other comments

pkg/bufferdecoder/eventsreader.go

pkg/bufferdecoder/eventsreader_test.go

Optimize readStringVarFromBuff for better performance and memory usage. - The optimizations were tested using new benchmarks: - Short: 33.26% faster, 42.86% less memory usage, 1 fewer allocation. - Medium: 28.34% faster, 54.55% less memory usage, 1 fewer allocation. - Long: 26.90% faster, 73.42% less memory usage, 3 fewer allocations. - Long: with Low Max: 19.12% faster, 48.15% less memory usage, 1 fewer allocation. The overall improvements show significant gains in both execution speed and memory efficiency. For more check eventsreader_bench_test.go. Changes: - Preallocated the buffer with a reasonable initial capacity without initializing data. - Removed TrimLeft call since conversion logic already stops at the first nul byte decoded.

rscampos · 2024-08-23T19:37:59Z

Double check the benchmark:

/usr/local/go/bin/go test -benchmem -run=^$ -bench ^(BenchmarkReadStringVarFromBuff_ShortString_warm|BenchmarkReadStringVarFromBuff_ShortString_old|BenchmarkReadStringVarFromBuff_ShortString|BenchmarkReadStringVarFromBuff_MediumString_old|BenchmarkReadStringVarFromBuff_MediumString|BenchmarkReadStringVarFromBuff_LongString_old|BenchmarkReadStringVarFromBuff_LongString|BenchmarkReadStringVarFromBuff_LongStringLowMax_old|BenchmarkReadStringVarFromBuff_LongStringLowMax)$ github.com/aquasecurity/tracee/pkg/bufferdecoder -benchtime=1000000x

=== RUN   BenchmarkReadStringVarFromBuff_ShortString_old
BenchmarkReadStringVarFromBuff_ShortString_old
BenchmarkReadStringVarFromBuff_ShortString_old-4                 1000000                52.02 ns/op           40 B/op          3 allocs/op
=== RUN   BenchmarkReadStringVarFromBuff_ShortString
BenchmarkReadStringVarFromBuff_ShortString
BenchmarkReadStringVarFromBuff_ShortString-4                     1000000                27.85 ns/op           16 B/op          2 allocs/op
=== RUN   BenchmarkReadStringVarFromBuff_MediumString_old
BenchmarkReadStringVarFromBuff_MediumString_old
BenchmarkReadStringVarFromBuff_MediumString_old-4                1000000                83.48 ns/op           88 B/op          3 allocs/op
=== RUN   BenchmarkReadStringVarFromBuff_MediumString
BenchmarkReadStringVarFromBuff_MediumString
BenchmarkReadStringVarFromBuff_MediumString-4                    1000000                49.47 ns/op           40 B/op          2 allocs/op
=== RUN   BenchmarkReadStringVarFromBuff_LongString_old
BenchmarkReadStringVarFromBuff_LongString_old
BenchmarkReadStringVarFromBuff_LongString_old-4                  1000000             24929 ns/op           77057 B/op          5 allocs/op
=== RUN   BenchmarkReadStringVarFromBuff_LongString
BenchmarkReadStringVarFromBuff_LongString
BenchmarkReadStringVarFromBuff_LongString-4                      1000000             13190 ns/op           20480 B/op          2 allocs/op
=== RUN   BenchmarkReadStringVarFromBuff_LongStringLowMax_old
BenchmarkReadStringVarFromBuff_LongStringLowMax_old
BenchmarkReadStringVarFromBuff_LongStringLowMax_old-4            1000000               289.4 ns/op           432 B/op          3 allocs/op
=== RUN   BenchmarkReadStringVarFromBuff_LongStringLowMax
BenchmarkReadStringVarFromBuff_LongStringLowMax
BenchmarkReadStringVarFromBuff_LongStringLowMax-4                1000000               182.8 ns/op           224 B/op          2 allocs/op

The results are aligned with yours:

Short: 46.46% faster, 60.00% less memory usage, 1 fewer allocation.
Medium: 40.74% faster, 54.55% less memory usage, 1 fewer allocation.
Long: 47.09% faster, 73.42% less memory usage, 3 fewer allocations.
Long with Low Max: 36.83% faster, 48.15% less memory usage, 1 fewer allocation.

rscampos

LGTM

geyslan added kind/bug area/performance milestone/v0.22.0 labels Jul 18, 2024

geyslan requested review from rscampos, yanivagman and NDStrahilevitz July 18, 2024 19:50

geyslan self-assigned this Jul 18, 2024

github-actions bot added area/testing and removed area/performance labels Jul 18, 2024

geyslan force-pushed the 4081-makezero branch from 7a59029 to 82ca37b Compare July 18, 2024 20:04

geyslan marked this pull request as ready for review July 18, 2024 20:04

geyslan commented Jul 18, 2024

View reviewed changes

pkg/bufferdecoder/eventsreader.go Outdated Show resolved Hide resolved

NDStrahilevitz reviewed Jul 20, 2024

View reviewed changes

pkg/bufferdecoder/eventsreader.go Show resolved Hide resolved

geyslan force-pushed the 4081-makezero branch from 82ca37b to 7eb2ef7 Compare July 21, 2024 15:34

geyslan commented Jul 21, 2024

View reviewed changes

pkg/bufferdecoder/eventsreader_test.go Show resolved Hide resolved

geyslan force-pushed the 4081-makezero branch from 7eb2ef7 to 5530e1a Compare July 23, 2024 11:28

geyslan force-pushed the 4081-makezero branch from 5530e1a to 28ec953 Compare August 23, 2024 12:55

rscampos approved these changes Aug 23, 2024

View reviewed changes

geyslan merged commit 773e747 into aquasecurity:main Aug 23, 2024
30 checks passed

geyslan deleted the 4081-makezero branch February 19, 2025 20:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: improve performance of readStringVarFromBuff #4194

fix: improve performance of readStringVarFromBuff #4194

fix: improve performance of readStringVarFromBuff #4194

fix: improve performance of readStringVarFromBuff #4194

Conversation

1. Explain what the PR does

2. Explain how to test it

3. Other comments

Choose a reason for hiding this comment