Add proxy support for TimescaleDB to drop partitions #155

agelwarg · 2025-08-13T22:14:57Z

For high (N)VPS deployments where a proxy is deployed and responsible for lots of data, housekeeping on the proxy can become a bottleneck requiring lots of cpu/memory resources. As such, we're looking to add TimescaleDB (partitioning) support to the proxy_history table, along with modifications to the housekeeping process, similar to what the zabbix server supports. However, in this case we use the id column for partitioning since it's an ever-increasing value, with a chunk_time_interval of 1,000,000.

This should be paired with zabbix/zabbix-docker#1755

for 'proxy_history' table through timescaledb, similar to zabbix_server.

agelwarg · 2025-08-13T22:16:23Z

The current state of this PR is enough to demonstrate the significant (positive) impact, but there are open questions that should be reviewed / addressed. I will call the ones I am aware of in a review

agelwarg · 2025-08-13T22:18:31Z

create/bin/gen_schema.pl

+	for ("proxy_history")
+	{
+		print<<EOF
+	PERFORM create_hypertable('$_', 'id', chunk_time_interval => 1000000, $flags);


In our testing, 1000000 is a good balance between low throughput and high throughput systems, such that it may be a few hours before a partition is dropped on a small proxy, while not too many partitions are created between housekeeping executions for large proxy.

agelwarg · 2025-08-13T22:23:29Z

src/zabbix_proxy/housekeeper/housekeeper_proxy.c

+	const char* enable_timescale  = getenv("ENABLE_TIMESCALEDB");
+
+	if (0 == strcmp(table,"proxy_history") && 0 == strcmp(enable_timescale, "true"))


The condition to decide if partition management should be used, instead of delete from... is much simpler than on the server side as we do not need to worry about compression or overrides, as we should never use compression on the proxy, nor are any global overrides for retention relevant. However, relying on an environment variable is probably not the best answer. Furthermore, I don't know if relying on a value in the config table is the right answer either (like the server housekeeper does), especially since the config table doesn't look like it's even populated or ever used on a proxy.

agelwarg · 2025-08-13T22:25:15Z

src/zabbix_proxy/housekeeper/housekeeper_proxy.c

+		if (0 != config_local_buffer)
+			condition = zbx_dsprintf(NULL, " or %s>=%d", clock_field, now - config_local_buffer * SEC_PER_HOUR);
+
+		result = zbx_db_select(
+				"select coalesce(min(id),%d) from %s"
+				" where (id>" ZBX_FS_UI64 " and %s>=%d) %s",
+				maxid + 1, table, lastid,
+				clock_field, now - config_offline_buffer * SEC_PER_HOUR,
+				ZBX_NULL2EMPTY_STR(condition));
+		zbx_free(condition);
+
+		if (NULL == (row = zbx_db_fetch(result)) || SUCCEED == zbx_db_is_null(row[0]))
+			goto rollback;
+
+		ZBX_STR2UINT64(keep_from, row[0]);
+		zbx_db_free_result(result);


Here's we're building a query to determine the minimum id that we should retain data from, in a somewhat inverse way that the delete below is built.

If we refactor keep_from to be the minimum id of the partition (timescaledb chunk) that our currently calculated keep_from is, then the proper partitions will still be dropped AND the count of records that will be returned (determined below) will be correct, which ultimately gets logged later on.

Refactored keep_from to be floor(coalesce(min(id),%d)/1000000)*1000000

… proxy_history table

Initial commit allowing zabbix_proxy housekeeping to drop partitions

46a7cb1

for 'proxy_history' table through timescaledb, similar to zabbix_server.

agelwarg added 2 commits August 13, 2025 18:47

Handle null

38b93d0

Lower keep_from to first id in timescale partition/chunk

17e6153

agelwarg commented Aug 14, 2025

View reviewed changes

agelwarg mentioned this pull request Aug 14, 2025

Add build for proxy on PostgreSQL zabbix/zabbix-docker#1755

Open

agelwarg changed the title ~~Add proxy support for TimescalDB to drop partitions~~ Add proxy support for TimescaleDB to drop partitions Aug 14, 2025

Fix datatype references when building query strings for 'id' field in…

1e96501

… proxy_history table

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add proxy support for TimescaleDB to drop partitions #155

Add proxy support for TimescaleDB to drop partitions #155

Uh oh!

agelwarg commented Aug 13, 2025 •

edited

Loading

Uh oh!

agelwarg commented Aug 13, 2025

Uh oh!

agelwarg Aug 13, 2025

Uh oh!

agelwarg Aug 13, 2025

Uh oh!

agelwarg Aug 13, 2025

Uh oh!

agelwarg Aug 13, 2025

Uh oh!

agelwarg Aug 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		const char* enable_timescale = getenv("ENABLE_TIMESCALEDB");

		if (0 == strcmp(table,"proxy_history") && 0 == strcmp(enable_timescale, "true"))

Add proxy support for TimescaleDB to drop partitions #155

Are you sure you want to change the base?

Add proxy support for TimescaleDB to drop partitions #155

Uh oh!

Conversation

agelwarg commented Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

agelwarg commented Aug 13, 2025

Uh oh!

agelwarg Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

agelwarg Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

agelwarg Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

agelwarg Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

agelwarg Aug 14, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

agelwarg commented Aug 13, 2025 •

edited

Loading