Unexpected higher latencies during rolling upgrade #8190
helenapoleri
started this conversation in
General
Replies: 1 comment 4 replies
-
I don't think nobody can discuss this without more details:
Without that, there is not much to discuss. Every broker restart creates disruption, leaderships change, client reconnects happen, replicas have to catch up. So there will be always visible impact to it. |
Beta Was this translation helpful? Give feedback.
4 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Describe the bug
We have a setup with 3 Kafka nodes. We have a system A producing records to Kafka, which are consumed by a system B. System B does some processing and produces response records to be consumed by system A.
We are experiencing unexpected higher latencies end-to-end latencies when using Strimzi and performing a rolling upgrade. We are using the default configurations.
End-to-end latency (measured at system A):

We have tried doing a manual restart of the Kafka pods (by killing the Kafka process PID (with a SIGTERM) for each pod and waiting for the latencies to stabilize) and we are not seeing the same behaviour.
End-to-end latency (measured at system A:
While it might be expected that during a rolling upgrade we see a spike in latencies, we were not expecting to see such a big difference between the manual restarts and the rolling upgrade.
To Reproduce
We are reproducing by just triggering a rolling upgrade (with no changes).
Expected behavior
We are expecting at least to have similar latencies to when we perform the restarts manually, but we also don't know whether this is expected behaviour using Kafka.
Environment (please complete the following information):
YAML files and logs
Kafka cluster:
Additional context
Beta Was this translation helpful? Give feedback.
All reactions