Multi node pipeline executor #4070

dolavb · 2025-01-22T20:13:11Z

This allow clients to provide an ExecutorService at pipeline creation instead of having a new ExecutorService created at every pipeline Sync call. An Executor service creation will create new threads, which are expensive resource to creates. In a high throughput application developed internally, we are writing at a rate of ~100k set per seconds, on a six node cluster, on an instance equivalent to an EC2 m5.12xlarge. The creation of threads uses 40% of CPU, and adds substantial latency. This new approach will allow clients to send a pooled Executor that is tuned to there load patterns.

This change is non breaking, but will come with a slight optimization for the clients currently using the created thread pool. In the current approach even if a pipeline has a single connection to close the Executor service will create MULTI_NODE_PIPELINE_SYNC_WORKERS threads. In the default mode would mean wasting 2 thread creation.

From: https://stackoverflow.com/questions/5483047/why-is-creating-a-thread-said-to-be-expensive

Thread lifecycle overhead. Thread creation and teardown are not free. The actual overhead varies across platforms, but thread creation takes time, introducing latency into request processing, and requires some processing activity by the JVM and OS. If requests are frequent and lightweight, as in most server applications, creating a new thread for each request can consume significant computing resources.

From Java Concurrency in Practice
By Brian Goetz, Tim Peierls, Joshua Bloch, Joseph Bowbeer, David Holmes, Doug Lea
Print ISBN-10: 0-321-34960-1

Java thread creation is expensive because there is a fair bit of work involved:

A large block of memory has to be allocated and initialized for the thread stack.

System calls need to be made to create / register the native thread with the host OS.

Descriptors need to be created, initialized and added to JVM-internal data structures.

This allows passing an ExecutorService when creating a ClusterPipeline. The previous parallelization approach for pipeline syncing/closing would create a new executor service for each sync operation, resulting in excessive thread creation and termination. On an EC2 m5.12xlarge instance with ~100k single writes/sec, this thread creation consumed 40% CPU and increased operation latency. The change also optimizes thread usage when no ExecutorService is provided. Previously, even a single pipeline within a multipipeline would create 3 threads for syncing. This improvement removes that overhead, though callers are encouraged to provide their own ExecutorService for optimal CPU usage and latency.

sazzad16 · 2025-01-23T10:15:28Z

@dolavb Thank you for your effort to improve Jedis.
Your concern is justified. But we have our hands full ATM. We'll try to get to this PR ASAP.

dolavb and others added 2 commits January 17, 2025 13:50

Merge branch 'redis:master' into MultiNodePipelineExecutor

a50568a

dolavb marked this pull request as ready for review January 24, 2025 01:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi node pipeline executor #4070

Multi node pipeline executor #4070

dolavb commented Jan 22, 2025 •

edited

Loading

sazzad16 commented Jan 23, 2025

Multi node pipeline executor #4070

Are you sure you want to change the base?

Multi node pipeline executor #4070

Conversation

dolavb commented Jan 22, 2025 • edited Loading

sazzad16 commented Jan 23, 2025

dolavb commented Jan 22, 2025 •

edited

Loading