Add method to asynchronously prepare CQL statements #1239

lukasz-antoniak · 2024-12-02T10:35:53Z

Add an option to asynchronously prepare CQL statements.

I think that proposed implementation has one difference in behaviour that I was not able to solve - after calling prepare() CQL statement was prepared on all nodes when this option was selected. After the change, when calling prepare_async(), future completes only after triggering preparation on all other nodes (operation was not completed). Maybe we should keep implementation of both functions separated and preserve synchronous behaviour.

lukasz-antoniak · 2024-12-03T14:14:54Z

I have implemented a small fix to preserve synchronous behaviour of prepare logic in the prepare() call.

absurdfarce

I think it's worth our while trying to figure out a way to implement this using something like a Future.then() syntax to return a future which does the prepare and then calls prepare_on_all_hosts() when that's successful. This will take some more plumbing to support this use case but it's a much simpler implementation and such an impl maps naturally onto this use case.

absurdfarce · 2024-12-10T21:40:36Z

cassandra/cluster.py

-    def _create_response_future(self, query, parameters, trace, custom_payload,
-                                timeout, execution_profile=EXEC_PROFILE_DEFAULT,
-                                paging_state=None, host=None):
+    def prepare_async(self, query, custom_payload=None, keyspace=None, prepare_on_all_hosts=None):


This function should live near the impl for prepare(). So either it should be moved down below prepare() or we should bring prepare() up here.

absurdfarce · 2024-12-11T06:59:53Z

cassandra/cluster.py

@@ -5105,6 +5130,49 @@ def __str__(self):
    __repr__ = __str__


+class PrepareFuture(ResponseFuture):


I'm extremely skeptical of the idea of extending ResponseFuture to a prepare-specific future implementation. There's a lot of functionality in ResponseFuture and we'd have to make sure everything we need there was duplicated here... and it's easy to miss things. Specifically ResponseFuture already has a lot of logic for dealing with prepare statements + responses... I'd rather find a way to re-use that and handle the prepare-on-all-hosts ops via callbacks (or perhaps something better) rather than subclass the future impl.

absurdfarce · 2024-12-11T07:03:12Z

cassandra/cluster.py

            except Exception:
                log.exception("Error preparing query on all hosts:")
+        return response


This seems exactly right: prepare() should be implemented as prepare_async().get() once we have a good working prepare_async(). But why do we need to do the prepare_on_all_nodes() operation here? We should've already done that when future.result() completed (since it's done in PrepareFuture._set_final_result()).

absurdfarce · 2024-12-11T07:11:30Z

cassandra/cluster.py

+            # we are on event loop thread, so do not execute those synchronously
+            session.submit(
+                session.prepare_on_all_nodes,
+                self.query_string, self._current_host, self._keyspace)


It seems strange to me to have this logic embedded within the future impl like this. Seems like this should be done when the future is created, something like:

# _create_prepare_response_future() in this impl returns a regular ResponseFuture future = self._create_prepare_response_future(query, keyspace, custom_payload, prepare_on_all_hosts) if prepare_on_all_hosts: # Hand waving about partial application here in order to pass in parameters; the point is we get to a future that # calls prepare_on_all_hosts() after we've received a response to our prepare here here future = future.then(prepare_on_all_hosts) future._protocol_handler = self.client_protocol_handler

Problem of course is the ResponseFuture doesn't support this then() syntax. It does have native support for callbacks but that isn't the same; that's just a function that gets invoked when the operation completes. We don't return a new future that returns the result of the function defined in the then() call like you do in most future APIs.

I'm wondering if we can either (a) add something like that or (b) find a way to wrap this functionality in another future lib in order to simplify an impl like this.

lukasz-antoniak · 2024-12-11T19:01:53Z

I totally agree with your comments. Previously I could not find another way to reuse ResponseFuture and return PreparedStatement object. Can you check current state of PR?

Add method to asynchronously prepare CQL statements

a47b75a

lukasz-antoniak force-pushed the prepare_ansyc branch from 814dbf2 to a47b75a Compare December 2, 2024 10:37

Preserve synchronous prepare logic when preparing statement on all nodes

0e2903e

lukasz-antoniak added 2 commits December 4, 2024 11:24

Document difference in prepare_on_all_hosts handling

f3b58c7

Test prepare_on_all_hosts with prepare_async function

3453cb1

lukasz-antoniak marked this pull request as ready for review December 6, 2024 09:47

absurdfarce requested changes Dec 11, 2024

View reviewed changes

Refactor prepare_async implementation

5ac7d20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add method to asynchronously prepare CQL statements #1239

Add method to asynchronously prepare CQL statements #1239

lukasz-antoniak commented Dec 2, 2024 •

edited

Loading

lukasz-antoniak commented Dec 3, 2024

absurdfarce left a comment

absurdfarce Dec 10, 2024

absurdfarce Dec 11, 2024

absurdfarce Dec 11, 2024

absurdfarce Dec 11, 2024

lukasz-antoniak commented Dec 11, 2024

		@@ -5105,6 +5130,49 @@ def __str__(self):
		__repr__ = __str__


		class PrepareFuture(ResponseFuture):

Add method to asynchronously prepare CQL statements #1239

Are you sure you want to change the base?

Add method to asynchronously prepare CQL statements #1239

Conversation

lukasz-antoniak commented Dec 2, 2024 • edited Loading

lukasz-antoniak commented Dec 3, 2024

absurdfarce left a comment

Choose a reason for hiding this comment

absurdfarce Dec 10, 2024

Choose a reason for hiding this comment

absurdfarce Dec 11, 2024

Choose a reason for hiding this comment

absurdfarce Dec 11, 2024

Choose a reason for hiding this comment

absurdfarce Dec 11, 2024

Choose a reason for hiding this comment

lukasz-antoniak commented Dec 11, 2024

lukasz-antoniak commented Dec 2, 2024 •

edited

Loading