Add host replacement to tracked keyspaces #4396

bdeggleston · 2025-09-25T20:23:09Z

No description provided.

aratno · 2025-09-27T03:11:59Z

src/java/org/apache/cassandra/replication/MutationTrackingService.java

+    // for correctness vs complex protocols topology updates. You could make the case that mutable state would be
+    // a better tradeoff for node replacement, but it seems likely that handling token movements will be simpler
+    // if we use a copy on write pattern for topology changes.
+    private final ReentrantReadWriteLock shardLock = new ReentrantReadWriteLock();


Practically it should be fine to keep this lock unfair, but I wonder if we'll find workloads with high read and write query throughput to starve topology changes. Could be worth using StampedLock here, we don't seem to require reentrancy.

I agree this is probably fine, but it would be worth looking at better solutions. Would you mind if we just added a TODO to consider better options here before merge, instead of addressing it in this ticket?

Yeah definitely, fine to defer

aratno · 2025-09-27T03:48:21Z

src/java/org/apache/cassandra/streaming/StreamPlan.java

+     */
+    private boolean isTrackedReplicationEnabled(String keyspace)
+    {
+        return ClusterMetadata.current().schema.getKeyspaceMetadata(keyspace).useMutationTracking();


Null check for dropped keyspace?

StreamPlan assumes that you're only sending/requesting streams for keyspaces that exist, and the rest of the class has the same null unsafety around keyspace lookups. I can add a more descriptive error message if you like, but if the keyspace doesn't exist, other parts of stream plan will have already thrown an NPE before getting here

aratno · 2025-09-27T03:54:08Z

src/java/org/apache/cassandra/streaming/messages/OutgoingMutationLogStreamMessage.java

+
+    public long serializedSize(int version)
+    {
+        return 0;


Intentional?

it is, yes. OutgoingStreamMessage does the same thing since these messages aren't serialized into a buffer, but put directly into the socket.

aratno · 2025-09-27T03:57:04Z

src/java/org/apache/cassandra/streaming/messages/OutgoingMutationLogStreamMessage.java

+        // end-of-stream marker
+        out.writeBoolean(false);
+
+        session.logStreamSent(this);


Should we avoid side effects like timeout scheduling during serde?

this is the same pattern used by sstable streaming

aratno · 2025-09-27T03:59:08Z

src/java/org/apache/cassandra/streaming/messages/IncomingMutationLogStreamMessage.java

+                                     mutation.getKeyspaceName(),
+                                     mutation.key().getToken());
+
+                    mutation.apply();


I was thinking we'd receive all the mutation logs, then do replay to apply all before completing the session, rather than deserializing and applying each at a time

I’d thought about doing that as part of this initial implementation, but it wasn’t really clear what would be gained from the additional state changes. IIRC the motivation for staging sstables before making them visible to reads is to prevent data resurrection, but that’s not a concern with read reconciliation. Additionally, I think we’ll only be doing log streaming like this for pending ranges, so any data written here won’t actually be read until the streams complete successfully anyway.

Add host replacement to tracked keyspaces

eeebf78

aratno reviewed Sep 27, 2025

View reviewed changes

aratno approved these changes Sep 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add host replacement to tracked keyspaces #4396

Add host replacement to tracked keyspaces #4396

Uh oh!

bdeggleston commented Sep 25, 2025

Uh oh!

aratno Sep 27, 2025

Uh oh!

bdeggleston Sep 27, 2025

Uh oh!

aratno Sep 29, 2025

Uh oh!

aratno Sep 27, 2025

Uh oh!

bdeggleston Sep 27, 2025

Uh oh!

aratno Sep 27, 2025

Uh oh!

bdeggleston Sep 27, 2025

Uh oh!

aratno Sep 27, 2025

Uh oh!

bdeggleston Sep 27, 2025

Uh oh!

aratno Sep 27, 2025

Uh oh!

bdeggleston Sep 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add host replacement to tracked keyspaces #4396

Are you sure you want to change the base?

Add host replacement to tracked keyspaces #4396

Uh oh!

Conversation

bdeggleston commented Sep 25, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants