Add retention by time and size [INK-251] #325

ivanyu · 2025-06-16T10:41:29Z

This PR adds retention by time and size (retention.ms, retention.bytes) to the both control planes and its support on the broker side.

Note to the reviewer: I tried to decompose this into logical separate commits, should be easier to review step-by-step.

jeqo · 2025-06-17T07:36:23Z

storage/inkless/src/main/java/io/aiven/inkless/control_plane/InMemoryControlPlane.java

+            long bytesDeleted = 0;
+
+            // Enforce the size retention.
+            if (request.retentionBytes() >= 0) {


Should we include logInfo.byteSize > request.retentionBytes() as a condition before starting to check each batch?

A good shortcut, yes! I'll update

Done. Btw on the SQL side this is implicitly done by the combination of WHERE and LIMIT

jeqo · 2025-06-17T07:54:56Z

storage/inkless/src/main/java/io/aiven/inkless/control_plane/InMemoryControlPlane.java

                }
            }
+
+            // Enforce the time retention.
+            if (request.retentionMs() >= 0) {


Maybe we could have a similar early validation here by adding the oldest batch max timestamp to the log info

In contrast to size, the oldest batch max timestamp is known right away as we start looking at the batches (setting aside rare situations where a rogue batch in the middle has much older timestamp than its neighbors). So probably this shortcut won't give us anything

jeqo · 2025-06-17T08:02:34Z

storage/inkless/src/main/java/io/aiven/inkless/control_plane/InMemoryControlPlane.java

+            logInfo.byteSize -= bytesDeleted;
+            if (coordinates.isEmpty()) {
+                logInfo.logStartOffset = logInfo.highWatermark;
+                assert logInfo.byteSize == 0;


nit: given that assertions come disabled by default, should we instead throw a runtime exception here? or suggest to enable assertions?

I expected it to work only in tests, not in prod run time.
But thinking again: InMemoryControlPlane is not a prod control plane anyway, we can do whatever loud failure with it. I've replaced this with a proper if.

jeqo · 2025-06-17T08:20:16Z

storage/inkless/src/main/java/io/aiven/inkless/delete/RetentionEnforcer.java

+            final LogConfig topicConfig = logConfigCache.get(partition.topicPartition(),
+                tp -> LogConfig.fromProps(metadataView.getDefaultConfig(), metadataView.getTopicConfig(tp.topic())));


Isn't the metadata view already a cached view of the metadata on KRaft? I wonder if we could use it directly instead of adding a cache dependency here.

It is cached, but the problem is LogConfig instantiation. Like here 847798c but not from the memory pressure point of view, but from that we're doing quite a bit of checks and validation inside.
We of course can hand-rewrite them here, too, but that's starting being a bit fragile

After offline discussion, I removed the cache. Left only a map to prevent multiple instantiations during a single run.

jeqo · 2025-06-17T08:40:24Z

storage/inkless/src/main/java/io/aiven/inkless/delete/RetentionEnforcementScheduler.java

+/**
+ * The class responsible for scheduling per partition retention enforcement.
+ */
+class RetentionEnforcementScheduler {


Maybe worth adding a documentation note on how this scheduler is expected to behave on a distributed environment, e.g. there's no coordination and the scheduling times are randomized to avoid collision, what would happen on collision (probably nothing as control plane handles the concurrency), etc.

jeqo · 2025-06-17T09:18:42Z

core/src/main/scala/kafka/server/ReplicaManager.scala

@@ -411,6 +412,8 @@ class ReplicaManager(val config: KafkaConfig,

    // Inkless threads
    inklessSharedState.map { sharedState =>
+      scheduler.schedule("inkless-retention-enforcer", () => inklessRetentionEnforcer.foreach(_.run()), 500L, 500L)  // the real interval is inside


Maybe we could start using LOG_INITIAL_TASK_DELAY_MS_DEFAULT as initial delay.
About the frequency, does it need to be this small?
What if we piggyback on log.retention.check.interval.ms (default 5 min)?

jeqo · 2025-06-17T09:21:26Z

storage/inkless/src/main/resources/db/migration/V4__Retention_enforcement.sql

+
+        l_base_offset_of_first_batch_to_keep = NULL;
+
+        IF l_request.retention_bytes >= 0 OR l_request.retention_ms >= 0 THEN


a similar suggestion as in-memory to use log info to have an early quick check if the retention check is needed.

Replace assert with proper if

Pre-check log size

Remove `LogConfig` cache. Leave only a map for prevent multiple instantiations in a single run.

ivanyu force-pushed the ivanyu/ink-251-retention branch 9 times, most recently from 9d3f261 to f6952ed Compare June 16, 2025 12:40

Implement retention enforcement in InMemoryControlPlane [INK-251]

5d82149

ivanyu force-pushed the ivanyu/ink-251-retention branch 3 times, most recently from d54decb to 5158167 Compare June 16, 2025 14:46

Implement retention enforcement in PostgresControlPlane [INK-251]

57a0180

ivanyu force-pushed the ivanyu/ink-251-retention branch 2 times, most recently from 107f5e1 to 8ad45cd Compare June 16, 2025 15:22

ivanyu marked this pull request as ready for review June 16, 2025 15:27

ivanyu requested a review from jeqo June 16, 2025 15:27

Implement retention enforcement in broker [INK-251]

059cfd7

ivanyu force-pushed the ivanyu/ink-251-retention branch from 8ad45cd to 059cfd7 Compare June 16, 2025 15:55

jeqo reviewed Jun 17, 2025

View reviewed changes

ivanyu added 2 commits June 17, 2025 12:49

fixup! Implement retention enforcement in InMemoryControlPlane [INK-251]

cbe41f5

Replace assert with proper if

fixup! Implement retention enforcement in InMemoryControlPlane [INK-251]

994fb1f

Pre-check log size

ivanyu force-pushed the ivanyu/ink-251-retention branch from da4c4bb to 994fb1f Compare June 17, 2025 11:35

fixup! Implement retention enforcement in broker [INK-251]

f61ab5d

Remove `LogConfig` cache. Leave only a map for prevent multiple instantiations in a single run.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add retention by time and size [INK-251] #325

Add retention by time and size [INK-251] #325

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		final LogConfig topicConfig = logConfigCache.get(partition.topicPartition(),
		tp -> LogConfig.fromProps(metadataView.getDefaultConfig(), metadataView.getTopicConfig(tp.topic())));


		l_base_offset_of_first_batch_to_keep = NULL;

		IF l_request.retention_bytes >= 0 OR l_request.retention_ms >= 0 THEN

Add retention by time and size [INK-251] #325

Are you sure you want to change the base?

Add retention by time and size [INK-251] #325

Conversation

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!