Make KAFKA_RESCHEDULE_MS a Kafka table setting #90112

JerAguilon · 2025-11-14T22:39:03Z

Changelog category (leave one):

Performance Improvement

Changelog entry (a [user-readable short description]

Add kafka_consumer_reschedule_ms as a tunable Kafka table engine setting in order to adjust how long consumers sleep for new data.

Documentation entry for user-facing changes

Documentation is written (mandatory for new features)

This pull request is to address this issue that I reported: #89204

Essentially, if one were to set kafka_flush_interval_ms to be quite low (say, 250ms), one would expect that data would make its way to the downstream table in ~250ms. However, this is not the case, even if ingestion is happening very quickly. The reason is because there is a hardcoded 500ms stall that happens (a) when a consumer sees no messages or (b) when 1 minute passes.

This means that ingestion can happen much later than expected, as noted in my investigation. To fix, allow end users to tune this param while keeping the 500ms default.

CLAassistant · 2025-11-14T22:39:10Z

All committers have signed the CLA.

azat · 2025-11-15T07:23:04Z

docs/en/engines/table-engines/integrations/kafka.md

  SELECT level, sum(total) FROM daily GROUP BY level;
 ```
-To improve performance, received messages are grouped into blocks the size of [max_insert_block_size](../../../operations/settings/settings.md#max_insert_block_size). If the block wasn't formed within [stream_flush_interval_ms](/operations/settings/settings#stream_flush_interval_ms) milliseconds, the data will be flushed to the table regardless of the completeness of the block.
+To improve performance, received messages are grouped into blocks the size of [max_insert_block_size](../../../operations/settings/settings.md#max_insert_block_size). If the block wasn't formed within [stream_flush_interval_ms](/operations/settings/settings#stream_flush_interval_ms) milliseconds, the data will be flushed to the table regardless of the completeness of the block. 


Suggested change

To improve performance, received messages are grouped into blocks the size of [max_insert_block_size](../../../operations/settings/settings.md#max_insert_block_size). If the block wasn't formed within [stream_flush_interval_ms](/operations/settings/settings#stream_flush_interval_ms) milliseconds, the data will be flushed to the table regardless of the completeness of the block.

To improve performance, received messages are grouped into blocks the size of [max_insert_block_size](../../../operations/settings/settings.md#max_insert_block_size). If the block wasn't formed within [stream_flush_interval_ms](/operations/settings/settings#stream_flush_interval_ms) milliseconds, the data will be flushed to the table regardless of the completeness of the block.

azat · 2025-11-15T07:26:43Z

tests/integration/test_storage_kafka/test_batch_slow_7.py

+    main_configs=["configs/kafka.xml", "configs/named_collection.xml"],
+    user_configs=["configs/users.xml"],
+    with_kafka=True,
+    with_zookeeper=True,  # For Replicated Table


Suggested change

with_zookeeper=True, # For Replicated Table

with_zookeeper=True, # For Kafka2

azat · 2025-11-15T07:28:34Z

tests/integration/test_storage_kafka/test_batch_slow_7.py

+    instance.wait_for_log_line(f"{kafka_table}.*Committed offset 20000")
+
+    logging.debug("Timestamps: %s", instance.query(f"SELECT max(consume_ts), min(consume_ts) FROM test.{kafka_table}_destination"))
+    assert int(instance.query(f"SELECT max(consume_ts) - min(consume_ts) FROM test.{kafka_table}_destination")) < 8


What is this test is about? Why 8?

Ah, this should be removed. I used test_batch_slow_1.py as a template for this, and this test comes from that. Sorry for the confusion. I will address Monday alongside your suggestions

azat · 2025-11-15T07:29:42Z

tests/integration/test_storage_kafka/test_batch_slow_7.py

+    instance.wait_for_log_line(f"{kafka_table}.*Committed offset 50")
+
+    # Wait a bit for stream to stall and log the reschedule message
+    time.sleep(2.0)


I guess we can remove this, since wait_for_log_line should have retries that are greater then 2 secs in total

Add kafka_consumer_reschedule_ms

ba79292

JerAguilon force-pushed the reschedule-setting branch from a4cef84 to 080a0df Compare November 15, 2025 00:11

JerAguilon added 2 commits November 14, 2025 19:11

type fixes

f7d41ec

more work

1912132

JerAguilon force-pushed the reschedule-setting branch from 080a0df to 61442a5 Compare November 15, 2025 00:13

JerAguilon changed the title ~~Reschedule setting~~ Make KAFKA_RESCHEDULE_MS a Kafka table setting Nov 15, 2025

more work for types

1bd2636

JerAguilon force-pushed the reschedule-setting branch from 61442a5 to 1bd2636 Compare November 15, 2025 01:35

JerAguilon marked this pull request as ready for review November 15, 2025 01:35

azat self-assigned this Nov 15, 2025

azat reviewed Nov 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make KAFKA_RESCHEDULE_MS a Kafka table setting #90112

Make KAFKA_RESCHEDULE_MS a Kafka table setting #90112

JerAguilon commented Nov 14, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Nov 14, 2025 •

edited

Loading

Uh oh!

azat Nov 15, 2025

Uh oh!

azat Nov 15, 2025

Uh oh!

azat Nov 15, 2025

Uh oh!

JerAguilon Nov 15, 2025

Uh oh!

azat Nov 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	To improve performance, received messages are grouped into blocks the size of [max_insert_block_size](../../../operations/settings/settings.md#max_insert_block_size). If the block wasn't formed within [stream_flush_interval_ms](/operations/settings/settings#stream_flush_interval_ms) milliseconds, the data will be flushed to the table regardless of the completeness of the block.
	To improve performance, received messages are grouped into blocks the size of [max_insert_block_size](../../../operations/settings/settings.md#max_insert_block_size). If the block wasn't formed within [stream_flush_interval_ms](/operations/settings/settings#stream_flush_interval_ms) milliseconds, the data will be flushed to the table regardless of the completeness of the block.

	with_zookeeper=True, # For Replicated Table
	with_zookeeper=True, # For Kafka2

Make KAFKA_RESCHEDULE_MS a Kafka table setting #90112

Are you sure you want to change the base?

Make KAFKA_RESCHEDULE_MS a Kafka table setting #90112

Conversation

JerAguilon commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a [user-readable short description]

Documentation entry for user-facing changes

Uh oh!

CLAassistant commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

azat Nov 15, 2025

Choose a reason for hiding this comment

Uh oh!

azat Nov 15, 2025

Choose a reason for hiding this comment

Uh oh!

azat Nov 15, 2025

Choose a reason for hiding this comment

Uh oh!

JerAguilon Nov 15, 2025

Choose a reason for hiding this comment

Uh oh!

azat Nov 15, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JerAguilon commented Nov 14, 2025 •

edited

Loading

CLAassistant commented Nov 14, 2025 •

edited

Loading