Storage optimization #44

lasarojc · 2022-12-23T18:02:49Z

High-level tracking issue for general storage optimization efforts. This issue can be expanded over time.

At present (mid 2023), depending on their configuration, Tendermint-based nodes use large quantities of storage space. This has significant cost implications for operators. We aim to implement strategies to reduce and/or offload certain data stored in order to reduce operators' costs.

The two main problems that are present in the CometBFT storage layer:

We have a very big storage footprint
Querying stored data (whether supporting RPC queries or Comet retrieving consensus data structures) is not optimized and in some cases proven to be very efficient

To address these problems, we first need to build understanding of:

Workloads : What we store, how frequently we access it, what are the characteristics of the stored data (and this list will be expanded).
The database backend: database features, design goals and optimization possibilities.

The work to be done can be broken down in the following main subsections:

Understand and simplify CometBFT database backend #48
The end result of this work should be CometBFT optimized for a single storage backend which ultimately results in a significant reduction in both storage access time and on disk storage footprint.

To reach this goal we envision the following steps :

Define external users and use cases for the CometBFT storage layer #68
Preliminary investigation to identify the users and workloads of the storage backend (how they query the nodes, what are their common pain points with regards to storage, collection of issues to address).
Establish the baseline and future requirements for the storage backend #63
- Benchmark current storage improvements #1044
- Understand the workload: What data are we storing, in what format
- Establish and implement the relevant metrics to understand storage workloads #46
- Storage evaluation baseline: measure and report current storage related behaviour of CometBFT #67
Evaluate database engines according to requirements and decide which one to optimize #64
Refactor CometBFT to use a single underlying database #1039
Add support for users to migrate to the chosen backend

Tune CometBFT to address storage related bottlenecks

Part of this section covers addressing issues found during the benchmarking and investigation process outlined above. Another part addresses concrete issues reported by users. While part of this issues cannot be fully addressed before the analysis above, some optimizations can be performed on CometBFT as it is today - marked with * .

storage: Alternative representation of the genesis file. #1037 *
The Genesis file can be large and surpass internal DB file size limitations (3GB for RocksDB).
Batch commits to the state and block store #1040 *
Reconsider representation of keys for state and block store #1041
Reconstruct state using iterators rather than storing it as an entry. ( depends on previous point)
Pruning of blockstore is not reflected in storage used
- Investigate why Tendermint disk storage keeps growing over time informalsystems/interchain#1
- Add in-process compaction support to databases #49
- storage+indexer: Indexer is not pruned #169
  It seems that for users pruning the indexer is not as high a priority and well understood as reducing the footprint and reducing the potential DoS vector querying it can be.

CometBFT stores and allows querying of data not essential for consensus
We need to Identify the functionalities we want to support within Tendermint and offload non-critical data and functionality.

Implement ADR-101 PoC targeting main #816
This implementation provides users with an API to implement their own event indexing and prune the full nodes who store events at the moment.
Write a data companion based on ADR 101

abci++: Define and implement pruning strategy for stored extended commits #50

CometBFT currently maintains its own [WAL](https://github.com/cometbft/cometbft/blob/101bf50e715d6a10c8135392166c35bdae94972e/consensus/wal.go) - is this even necessary, given that the underlying database should actually be taking care of this? It is another source of complexity and potential point of failure in the system that the team has to maintain.

Original issue: tendermint/tendermint#9881

The text was updated successfully, but these errors were encountered:

lasarojc added this to CometBFT 2023 Dec 23, 2022

lasarojc added storage major-priority A major, long-running priority for the team tracking A complex issue broken down into sub-problems labels Dec 23, 2022

jmalicevic mentioned this issue Jan 20, 2023

storage+indexer: Indexer is not pruned #169

Closed

5 tasks

thanethomson changed the title ~~Storage Optimization~~ Storage optimization Mar 28, 2023

adizere mentioned this issue Oct 19, 2023

Investigate why Tendermint disk storage keeps growing over time informalsystems/interchain#1

Closed

adizere added this to CometBFT Jan 11, 2024

github-project-automation bot moved this to Todo in CometBFT Jan 11, 2024

adizere assigned jmalicevic Jan 12, 2024

adizere moved this from Todo to In Progress in CometBFT Jan 12, 2024

jmalicevic mentioned this issue Jan 17, 2024

Storage optimizations: Q1 2024 tracking issue #2058

Closed

7 tasks

adizere added this to the 2024-Q1 milestone Jan 31, 2024

adizere removed this from the 2024-Q1 milestone Apr 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Storage optimization #44

Storage optimization #44

Storage optimization #44

Storage optimization #44

Comments

Uh oh!