consume: cache batches, not fetches #423

jjaakola-aiven · 2025-10-08T12:24:16Z

Change caching to cache topic-partition batches instead of fetch. Cache entries are smaller and are directly calculated from the batch coordinates and only relevant batches are returned. The big fetch data blob is not needed to be handled. With Infinispan caching the topic-partition batches will be assigned within the cache cluster and this will likely lead to loading more of entries from neighbor nodes on clusters having more than one broker in the AZ. This means that smaller entries are loaded, with fetch caching the worst case is the loading of full fetch from neighbor. With the OMB benchmarking the full fetch entry average was 4 MiB and with batches the entry average is 500 KiB.

Cache sizing must be changed accordingly.

ivanyu · 2025-10-14T06:10:53Z

I have a prototype of a class that does groups concurrent byte range requests to a file withing the specified time window (e.g. 5-10 ms). Might help to reduce the number of GETs when individual batches of the same file are requested concurrently?

jjaakola-aiven · 2025-10-15T16:47:29Z

The cache hit ratio is not nearly as good with topic-partition batches as with full fetch data and key aligning. I am closing this.

jjaakola-aiven force-pushed the jjaakola-aiven-cache-topic-partition-batches branch 3 times, most recently from 0545a2a to f58ae3d Compare October 10, 2025 10:14

jjaakola-aiven marked this pull request as ready for review October 10, 2025 11:21

jjaakola-aiven force-pushed the jjaakola-aiven-cache-topic-partition-batches branch from f58ae3d to 62cf0b8 Compare October 13, 2025 06:02

consume: cache batches, not fetches

86c9a0a

jjaakola-aiven force-pushed the jjaakola-aiven-cache-topic-partition-batches branch from 62cf0b8 to 86c9a0a Compare October 15, 2025 15:36

jjaakola-aiven closed this Oct 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

consume: cache batches, not fetches #423

consume: cache batches, not fetches #423

jjaakola-aiven commented Oct 8, 2025 •

edited

Loading

Uh oh!

ivanyu commented Oct 14, 2025

Uh oh!

jjaakola-aiven commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

consume: cache batches, not fetches #423

consume: cache batches, not fetches #423

Conversation

jjaakola-aiven commented Oct 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ivanyu commented Oct 14, 2025

Uh oh!

jjaakola-aiven commented Oct 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jjaakola-aiven commented Oct 8, 2025 •

edited

Loading