Consumers as k8s deployment - failing with rebalances when scaling #6440

jakub-klapka · 2022-02-28T18:31:44Z

jakub-klapka
Feb 28, 2022

Hello,
we are having quite strange issues with our consumers in k8s.
We are using Strimzi for "broker side" - mostly with defaults, running 3 instances of brokers and ZKs (quay.io/strimzi/kafka:0.25.0-kafka-2.7.0).
On client side, we have simple .NET app using Confluent's .NET Kafka SDK (basically just wrapper around librdkafka). We have k8s deployment for each consumer group with motivation to be able to scale clients as traffic demands.
When we start whole DP at once (let's say 3 instances), everything starts up fine. All clients joins empty consumer group, assigns partitions and start working.
But problem arises, when we try to scale that DP. Once we increase instance count (to 6 for example), the new instances seems to correctly join group and start consuming messages. But already existing instances fail with exception:

Fatal error encountered: Broker: Specified group generation id is not valid
Confluent.Kafka.KafkaException: Broker: Specified group generation id is not valid

And to make things worse, those instances are terminated and k8s instantly tries to create new ones to replace them. This puts us into infinitely failing loop of new instances causing old ones to fail, so whole DP is useless at that moment.

All related problems, I've already found, deals with various timeouts - specifically session timeout, Fetch timeouts, maxBytes, auto commit handling etc. But even with various experiments with those parametes, I'm still not able to make it work.

Out settings:

SessionTimeout - default 45 sec
AutoCommit - disabled, we are commiting manually, once message is processed
FetchMaxBytes - default 52MB
HeartBeatInterval - default 3 sec
SocketTimeout - default 60 sec
Message size - around 1KB
One message process time - around 100ms (without any significant deviations)
Consumers are "way behind" at this stage - so for each start, they have enough data to catch and are probably fetching max ammount of data all the time
We are calling consumer.Close() on exception (and we observe rebalance starting right after instance fails)

What I've tried (separately):

Lowering FetchMaxBytes to 2KB (getting one or two messages with each fetch). My thinking was, that maybe in those 52MB, we'll get so many messages, that they are not processed within session timeout. I'm not sure, how session timeout actually affects this - if we have to process all messages in one fetch, or does it specifies max processing time for each one message? Either way, it didn't change anything.
Lowering SessionTimeout to 2 sec. Doesn't make sense, just experiment without much hopes :)
Creating "barebones" consumer - basically only the example code for that SDK, to isolate our problem. Still having same problem, although it seems, that it handles new clients joining (maybe since it's super-fast), but not scaling down.

I'm pretty sure, that there are no "long processing" issues - like some messages, which would take long time and than fails. With failing instaces, I can see in log, that it "flies", processing several messages per second and suddenly fails with aformentioned exception.

I'm not sure, if that is enough information, so anybody will see problem right away (which would be great!).
But I'm also interested in "best practices" of running consumers in k8s. I thought, that running each consumer group as DP actually fits perfectly with DPs purpose, since each client is kind of stateless and whole system should be able to scale up and down (and reschedule between nodes etc.) as needed.
But this "infinite fail loop" really terrifies me. Even, if we have some timeouts wrong, there would be always possibility, that one "bad message" could kill whole group. I'm still thinking, that we are still doing something wrong, since this has to be the way, how "everybody is running it" right?

I will be very grafetul for any ideas towards resolving this, or just experience, how you run consumers in k8s.
Thanks a lot! Jakub

scholzj · 2022-02-28T19:11:40Z

scholzj
Feb 28, 2022
Maintainer

I think running the clients in one consumer group as a single Deployment is fine. I don't think that alone is the problem. But I'm afraid I never used the .NET client and I never saw a problem like this in the Java client.

6 replies

scholzj Feb 28, 2022
Maintainer

TBH, for me it in the past just worked out of the box. I guess when scaling up/down, you definitely want to create / delete all the pods at once to avoid multiple rebalances (for each pod created / removed). The rolling updates probably depend more on the application logic.

jakub-klapka Feb 28, 2022
Author

OK, that's actually what happens with default DP. When i'm scaling from 0 to 6, everything is fine (empty group -> one rebalance with 6 members). But when I'm scaling from 3 to 6 - k8s starts new 3 instances at once (so probably 1 rebalance) and that is the situation, where all old 3 instances fail.
So I'll try to focus more on .NET implementation of rebalances.
Thanks!

scholzj Feb 28, 2022
Maintainer

Normally I would ask if you are polling / fetching often enough. But you mentioned that changing the FetchMaxBytes did not helped. That should control how often you fetch I guess.

scholzj Feb 28, 2022
Maintainer

Could this be related? confluentinc/librdkafka#3306

jakub-klapka Mar 3, 2022
Author

Yes, we have poll interval at defaults 500ms, but tried to fiddle with this as well.
The issue you've mentioned seems to have similar symptoms, but it's not entirely it. We are not using cooperative assignments (yet) and also they've already fixed it (I've checked, that we are using fixed version of librdkafka).
I've noticed some other things, but will start another thread. Really seems, that it's not related to k8s like I thought.

jakub-klapka · 2022-03-03T12:38:37Z

jakub-klapka
Mar 3, 2022
Author

So I've managed to get more verbose logging from librdkafka. Whole log of consumer failing is below, but there are some points I've noticed in chronological order:

 Got FlightDocument on ish-event.position-cat062-cartesian-changed.fdc-id.v0/1 with offset: 1312874, timestamp: 03/02/2022 04:25:06

Last correctly processed message.

%7|1646301137.852|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Committing offsets for 1 partition(s) with generation-id 204 in join-state steady: manual
%7|1646301137.896|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: OffsetCommit for 1 partition(s) in join-state steady: manual: returned: Success

Sucessful commit of that last message

%7|1646301137.898|HEARTBEAT|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" heartbeat error response in state up (join-state steady, 3 partition(s) assigned): Broker: Group rebalance in progress
%7|1646301137.899|REBALANCE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" is rebalancing (EAGER) in state up (join-state steady) with 3 assigned partition(s): rebalance in progress

First time, consumer notices rebalance during heartbeat (there are 6 partitions and we are scaling consumers from 2 to 6)

%7|1646301137.901|REMOVE|ish#consumer-1| [thrd:main]: Removing ish-event.position-cat062-cartesian-changed.fdc-id.v0 [0] from assignment (started=true, pending=false, queried=false, stored offset=INVALID)
%7|1646301137.901|REMOVE|ish#consumer-1| [thrd:main]: Removing ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] from assignment (started=true, pending=false, queried=false, stored offset=INVALID)
%7|1646301137.901|REMOVE|ish#consumer-1| [thrd:main]: Removing ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] from assignment (started=true, pending=false, queried=false, stored offset=INVALID)

Consumer is dumping all it's partition assignments

%7|1646301137.902|STOPSERVE|ish#consumer-1| [thrd:main]: All partitions awaiting stop are now stopped: serving assignment

Done

%7|1646301137.902|JOIN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": join with 1 subscribed topic(s)
%7|1646301137.903|JOIN|ish#consumer-1| [thrd:main]: sasl_ssl://ish-kafka-cluster-kafka-2.ish-kafka-cluster-kafka-brokers.kafka.svc:9092/2: Joining group "eta-calculator-processor" with 1 subscribed topic(s) and member id "ish-43d43b5a-aeb4-4232-85d4-d83f9494bb4b"
%7|1646301137.903|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state init -> wait-join (state up)

Consumer got new partition assignments

  Got FlightDocument on ish-event.position-cat062-cartesian-changed.fdc-id.v0/1 with offset: 1312875, timestamp: 03/02/2022 04:25:06

Consumer receives first message from newly assigned parition

%7|1646301137.935|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op OFFSET_COMMIT in state up (join-state wait-join)
%7|1646301137.935|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Committing offsets for 1 partition(s) with generation-id 204 in join-state wait-join: manual

Trying to commit it with old generation ID 204. It still doesn't have new generation ID after rebalance!

%7|1646301138.000|JOINGROUP|ish#consumer-1| [thrd:main]: JoinGroup response: GenerationId 205, Protocol range, LeaderId ish-477248aa-14c5-43f0-bb2c-8ee04743fac4, my MemberId ish-43d43b5a-aeb4-4232-85d4-d83f9494bb4b, member metadata count 0: (no error)
%7|1646301138.000|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-join -> wait-sync (state up)

Now, consumer receives new generation ID 205. But this is too late, since it already tried to commit message.

%7|1646301138.018|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: OffsetCommit for 1 partition(s) in join-state wait-sync: manual: returned: Broker: Specified group generation id is not valid

Commit with generation ID 204 rightfully rejected

  Confluent.Kafka.KafkaException: Broker: Specified group generation id is not valid

Exception bubbled up to our application

Log continues for a little while with another rebalance (in librdkafka thread), but at this point, application is shutting down due to unhandled ex and it's going to kill that kafka thread in a second.

So, what feels weird to me is that consumer actually receives message, process it and tries to commit it even before rebalance is complete and GroupJoin response with new generation id is received.
But that is only my understanding of those logs. I don't know, if it's even possible. Am I missing something there, or it seems like we got some lead?

If so, than it sounds like problem of rebalance implementation in librdkafka?
And one more thing, that comes to my mind - I don't know, how this whole fetch works - My understanding is, that if we have 1KB message and MaxFetchBytes set to 25MB, than with each fetch, client gets 25MB worth of messages in one request (25k messages), saves them to memory and than passes them one by one into application, commiting each one after process. Is that understanding correct? Maybe, there is problem with this kind of caching, where another message is passed to application even that rebalance has not finished?

Whole log:

info: ISH.Kafka.KafkaEventStream[0]
      Got FlightDocument on ish-event.position-cat062-cartesian-changed.fdc-id.v0/1 with offset: 1312872, timestamp: 03/02/2022 04:25:06
info: EtaCalculator.EtaCalculator[0]
      First ETA: 03/01/2022 14:29:00 for FDC 621ee4e9f849b2b605ecf953.
info: ISH.Kafka.KafkaEventStream[0]
      Event type FlightDocument delivered to topic: ish.eta-calc-out.fdc-id.v0, partition: 2, offset: 1116805
%7|1646301137.672|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op OFFSET_COMMIT in state up (join-state steady)
%7|1646301137.674|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Committing offsets for 1 partition(s) with generation-id 204 in join-state steady: manual
%7|1646301137.710|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: OffsetCommit for 1 partition(s) in join-state steady: manual: returned: Success
%7|1646301137.710|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=3, wait_stop_cnt=0)
%7|1646301137.711|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 3 partition(s):
%7|1646301137.711|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [0] offset STORED
%7|1646301137.711|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset STORED
%7|1646301137.711|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset STORED
%7|1646301137.711|DUMP_PND|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.711|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.711|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.711|ASSIGNDONE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": assignment operations done in join-state steady (rebalance rejoin=false)
info: ISH.Kafka.KafkaEventStream[0]
      Got FlightDocument on ish-event.position-cat062-cartesian-changed.fdc-id.v0/1 with offset: 1312873, timestamp: 03/02/2022 04:25:06
info: EtaCalculator.EtaCalculator[0]
      First ETA: 03/01/2022 14:33:00 for FDC 621ee5622c44e8b7a32525ed.
info: ISH.Kafka.KafkaEventStream[0]
      Event type FlightDocument delivered to topic: ish.eta-calc-out.fdc-id.v0, partition: 5, offset: 1119343
%7|1646301137.742|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op OFFSET_COMMIT in state up (join-state steady)
%7|1646301137.742|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Committing offsets for 1 partition(s) with generation-id 204 in join-state steady: manual
%7|1646301137.760|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: OffsetCommit for 1 partition(s) in join-state steady: manual: returned: Success
%7|1646301137.760|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=3, wait_stop_cnt=0)
%7|1646301137.760|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 3 partition(s):
%7|1646301137.760|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [0] offset STORED
%7|1646301137.760|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset STORED
%7|1646301137.760|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset STORED
%7|1646301137.760|DUMP_PND|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.760|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.761|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.761|ASSIGNDONE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": assignment operations done in join-state steady (rebalance rejoin=false)
info: ISH.Kafka.KafkaEventStream[0]
      Got FlightDocument on ish-event.position-cat062-cartesian-changed.fdc-id.v0/1 with offset: 1312874, timestamp: 03/02/2022 04:25:06
info: EtaCalculator.EtaCalculator[0]
      First ETA: 03/01/2022 11:15:00 for FDC 621dedb1c07407f1c4fe569e.
info: ISH.Kafka.KafkaEventStream[0]
      Event type FlightDocument delivered to topic: ish.eta-calc-out.fdc-id.v0, partition: 0, offset: 1119717
%7|1646301137.852|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op OFFSET_COMMIT in state up (join-state steady)
%7|1646301137.852|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Committing offsets for 1 partition(s) with generation-id 204 in join-state steady: manual
%7|1646301137.876|HEARTBEAT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Heartbeat for group "eta-calculator-processor" generation id 204
%7|1646301137.896|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: OffsetCommit for 1 partition(s) in join-state steady: manual: returned: Success
%7|1646301137.897|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=3, wait_stop_cnt=0)
%7|1646301137.897|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 3 partition(s):
%7|1646301137.897|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [0] offset STORED
%7|1646301137.897|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset STORED
%7|1646301137.897|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset STORED
%7|1646301137.897|DUMP_PND|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.897|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.897|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.897|ASSIGNDONE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": assignment operations done in join-state steady (rebalance rejoin=false)
%7|1646301137.898|HEARTBEAT|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" heartbeat error response in state up (join-state steady, 3 partition(s) assigned): Broker: Group rebalance in progress
%7|1646301137.899|REBALANCE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" is rebalancing (EAGER) in state up (join-state steady) with 3 assigned partition(s): rebalance in progress
%7|1646301137.899|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state steady -> wait-unassign-call (state up)
%7|1646301137.899|CLEARASSIGN|ish#consumer-1| [thrd:main]: Clearing current assignment of 3 partition(s)
%7|1646301137.899|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-unassign-call -> wait-unassign-to-complete (state up)
%7|1646301137.899|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=3, wait_stop_cnt=0)
%7|1646301137.900|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.900|DUMP_PND|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.900|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.900|DUMP_REM|ish#consumer-1| [thrd:main]: List with 3 partition(s):
%7|1646301137.901|DUMP_REM|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [0] offset STORED
%7|1646301137.901|DUMP_REM|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset STORED
%7|1646301137.901|DUMP_REM|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset STORED
%7|1646301137.901|REMOVE|ish#consumer-1| [thrd:main]: Removing ish-event.position-cat062-cartesian-changed.fdc-id.v0 [0] from assignment (started=true, pending=false, queried=false, stored offset=INVALID)
%7|1646301137.901|REMOVE|ish#consumer-1| [thrd:main]: Removing ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] from assignment (started=true, pending=false, queried=false, stored offset=INVALID)
%7|1646301137.901|REMOVE|ish#consumer-1| [thrd:main]: Removing ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] from assignment (started=true, pending=false, queried=false, stored offset=INVALID)
%7|1646301137.901|REMOVE|ish#consumer-1| [thrd:main]: Served 3 removed partition(s), with 0 offset(s) to commit
%7|1646301137.901|ASSIGNMENT|ish#consumer-1| [thrd:main]: Current assignment of 0 partition(s) with 0 pending adds, 0 offset queries, 3 partitions awaiting stop and 0 offset commits in progress
%7|1646301137.901|ASSIGNMENT|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": clearing group assignment
%7|1646301137.901|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op PARTITION_LEAVE in state up (join-state wait-unassign-to-complete) for ish-event.position-cat062-cartesian-changed.fdc-id.v0 [0]
%7|1646301137.901|PARTDEL|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": delete ish-event.position-cat062-cartesian-changed.fdc-id.v0 [0]
%7|1646301137.901|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op PARTITION_LEAVE in state up (join-state wait-unassign-to-complete) for ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1]
%7|1646301137.902|PARTDEL|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": delete ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1]
%7|1646301137.902|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op PARTITION_LEAVE in state up (join-state wait-unassign-to-complete) for ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2]
%7|1646301137.902|PARTDEL|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": delete ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2]
%7|1646301137.902|STOPSERVE|ish#consumer-1| [thrd:main]: All partitions awaiting stop are now stopped: serving assignment
%7|1646301137.902|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=0, wait_stop_cnt=0)
%7|1646301137.902|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.902|DUMP_PND|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.902|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.902|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301137.902|ASSIGNDONE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": assignment operations done in join-state wait-unassign-to-complete (rebalance rejoin=false)
%7|1646301137.902|UNASSIGN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": unassign done in state up (join-state wait-unassign-to-complete)
%7|1646301137.902|REJOIN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": Rejoining group without an assignment: Unassignment done
%7|1646301137.902|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-unassign-to-complete -> init (state up)
%7|1646301137.902|JOIN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": join with 1 subscribed topic(s)
%7|1646301137.902|CGRPMETADATA|ish#consumer-1| [thrd:main]: consumer join: metadata for subscription is up to date (68358ms old)
%7|1646301137.903|JOIN|ish#consumer-1| [thrd:main]: sasl_ssl://ish-kafka-cluster-kafka-2.ish-kafka-cluster-kafka-brokers.kafka.svc:9092/2: Joining group "eta-calculator-processor" with 1 subscribed topic(s) and member id "ish-43d43b5a-aeb4-4232-85d4-d83f9494bb4b"
%7|1646301137.903|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state init -> wait-join (state up)
info: ISH.Kafka.KafkaEventStream[0]
      Got FlightDocument on ish-event.position-cat062-cartesian-changed.fdc-id.v0/1 with offset: 1312875, timestamp: 03/02/2022 04:25:06
info: EtaCalculator.EtaCalculator[0]
      Previous ETA: 02/28/2022 11:57:00  New ETA: 03/01/2022 11:39:00 for 621a86deb30f9a4f581e40d9 -> new ETA set.
info: ISH.Kafka.KafkaEventStream[0]
      Event type FlightDocument delivered to topic: ish.eta-calc-out.fdc-id.v0, partition: 0, offset: 1119718
%7|1646301137.935|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op OFFSET_COMMIT in state up (join-state wait-join)
%7|1646301137.935|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Committing offsets for 1 partition(s) with generation-id 204 in join-state wait-join: manual
%7|1646301138.000|JOINGROUP|ish#consumer-1| [thrd:main]: JoinGroup response: GenerationId 205, Protocol range, LeaderId ish-477248aa-14c5-43f0-bb2c-8ee04743fac4, my MemberId ish-43d43b5a-aeb4-4232-85d4-d83f9494bb4b, member metadata count 0: (no error)
%7|1646301138.000|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-join -> wait-sync (state up)
%7|1646301138.018|COMMIT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: OffsetCommit for 1 partition(s) in join-state wait-sync: manual: returned: Broker: Specified group generation id is not valid
%7|1646301138.018|REBALANCE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": rebalance (EAGER) already in progress, skipping in state up (join-state wait-sync) with 0 assigned partition(s) (lost): OffsetCommit error: Illegal generation
%7|1646301138.018|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=0, wait_stop_cnt=0)
%7|1646301138.018|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301138.018|DUMP_PND|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301138.018|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301138.018|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301138.018|ASSIGNDONE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": assignment operations done in join-state wait-sync (rebalance rejoin=false)
crit: EtaCalculator.EtaCalculationService[0]
      Fatal error encountered: Broker: Specified group generation id is not valid
      Confluent.Kafka.KafkaException: Broker: Specified group generation id is not valid
         at Confluent.Kafka.Impl.SafeKafkaHandle.Commit(IEnumerable`1 offsets)
         at Confluent.Kafka.Consumer`2.Commit(ConsumeResult`2 result)
         at ISH.Kafka.KafkaEventStream`2.StartProcessing(CancellationToken cancellationToken) in /src/ISH.Kafka/KafkaEventStream.cs:line 261
         at ISH.Common.Abstracts.EventStreamSource`1.StartProcessing(CancellationToken cancellationToken) in /src/Common/Impl/EventStreamSource.cs:line 37
         at ISH.Common.Abstracts.EventProcessorBase`2.Run(CancellationToken cancellationToken) in /src/Common/Impl/EventProcessorBase.cs:line 38
         at ISH.Common.MicroserviceHost.TerminatingServiceBase.<>c__DisplayClass6_0.<<StartAsync>b__0>d.MoveNext() in /src/Common/MicroserviceHost/TerminatingServiceBase.cs:line 44
%7|1646301138.099|SYNCGROUP|ish#consumer-1| [thrd:main]: SyncGroup response: Success (77 bytes of MemberState data)
%7|1646301138.099|ASSIGNMENT|ish#consumer-1| [thrd:main]: List with 2 partition(s):
%7|1646301138.099|ASSIGNMENT|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset INVALID
%7|1646301138.099|ASSIGNMENT|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] offset INVALID
%7|1646301138.099|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-sync -> wait-assign-call (state up)
%7|1646301138.100|ASSIGN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": new assignment of 2 partition(s) in join-state wait-assign-call
%7|1646301138.100|CLEARASSIGN|ish#consumer-1| [thrd:main]: No current assignment to clear
%7|1646301138.100|ASSIGNMENT|ish#consumer-1| [thrd:main]: Added 2 partition(s) to assignment which now consists of 2 partition(s) where of 2 are in pending state and 0 are being queried
%7|1646301138.100|PAUSE|ish#consumer-1| [thrd:main]: Resuming fetchers for 2 assigned partition(s): assign called
%7|1646301138.100|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-assign-call -> steady (state up)
%7|1646301138.100|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=0, wait_stop_cnt=0)
%7|1646301138.100|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 2 partition(s):
%7|1646301138.100|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset STORED
%7|1646301138.100|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] offset STORED
%7|1646301138.101|DUMP_PND|ish#consumer-1| [thrd:main]: List with 2 partition(s):
%7|1646301138.101|DUMP_PND|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset STORED
%7|1646301138.101|DUMP_PND|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] offset STORED
%7|1646301138.101|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301138.101|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301138.101|SRVPEND|ish#consumer-1| [thrd:main]: Querying committed offset for pending assigned partition ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3]
%7|1646301138.101|SRVPEND|ish#consumer-1| [thrd:main]: Querying committed offset for pending assigned partition ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2]
%7|1646301138.101|OFFSETFETCH|ish#consumer-1| [thrd:main]: Fetching committed offsets for 2 pending partition(s) in assignment
%7|1646301138.101|OFFSET|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Fetch committed offsets for 2/2 partition(s)
%7|1646301138.101|ASSIGNMENT|ish#consumer-1| [thrd:main]: Current assignment of 2 partition(s) with 2 pending adds, 2 offset queries, 0 partitions awaiting stop and 0 offset commits in progress
%7|1646301138.101|ASSIGNMENT|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": setting group assignment to 2 partition(s)
%7|1646301138.101|GRPASSIGNMENT|ish#consumer-1| [thrd:main]: List with 2 partition(s):
%7|1646301138.101|GRPASSIGNMENT|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset STORED
%7|1646301138.101|GRPASSIGNMENT|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] offset STORED
%7|1646301138.216|OFFSETFETCH|ish#consumer-1| [thrd:main]: Adding ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] back to pending list with offset 1256124
%7|1646301138.216|OFFSETFETCH|ish#consumer-1| [thrd:main]: Adding ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] back to pending list with offset 1302427
%7|1646301138.216|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=0, wait_stop_cnt=0)
%7|1646301138.216|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 2 partition(s):
%7|1646301138.216|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset STORED
%7|1646301138.216|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] offset STORED
%7|1646301138.216|DUMP_PND|ish#consumer-1| [thrd:main]: List with 2 partition(s):
%7|1646301138.216|DUMP_PND|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset 1256124
%7|1646301138.216|DUMP_PND|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] offset 1302427
%7|1646301138.216|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301138.216|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301138.216|SRVPEND|ish#consumer-1| [thrd:main]: Starting pending assigned partition ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] at offset 1302427
%7|1646301138.216|SRVPEND|ish#consumer-1| [thrd:main]: Starting pending assigned partition ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] at offset 1256124
%7|1646301138.216|ASSIGNDONE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": assignment operations done in join-state steady (rebalance rejoin=false)
%7|1646301138.217|FETCH|ish#consumer-1| [thrd:main]: Partition ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] start fetching at offset 1302427
%7|1646301138.217|FETCH|ish#consumer-1| [thrd:main]: Partition ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] start fetching at offset 1256124
%7|1646301138.217|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op PARTITION_JOIN in state up (join-state steady) for ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3]
%7|1646301138.217|PARTADD|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": add ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3]
%7|1646301138.217|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op PARTITION_JOIN in state up (join-state steady) for ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2]
%7|1646301138.217|PARTADD|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": add ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2]
%7|1646301141.099|HEARTBEAT|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Heartbeat for group "eta-calculator-processor" generation id -1
%7|1646301141.102|HEARTBEAT|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" heartbeat error response in state up (join-state steady, 2 partition(s) assigned): Broker: Specified group generation id is not valid
%7|1646301141.102|REBALANCE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" initiating rebalance (EAGER) in state up (join-state steady) with 2 assigned partition(s) (lost): illegal generation
%7|1646301141.102|LOST|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": current assignment of 2 partition(s) lost: illegal generation: revoking assignment and rejoining
%7|1646301141.102|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state steady -> wait-unassign-call (state up)
%7|1646301141.102|CLEARASSIGN|ish#consumer-1| [thrd:main]: Clearing current assignment of 2 partition(s)
%7|1646301141.103|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-unassign-call -> wait-unassign-to-complete (state up)
%7|1646301141.103|LOST|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": current assignment no longer considered lost: unassign() called
%7|1646301141.103|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=2, wait_stop_cnt=0)
%7|1646301141.103|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301141.103|DUMP_PND|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301141.103|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301141.103|DUMP_REM|ish#consumer-1| [thrd:main]: List with 2 partition(s):
%7|1646301141.103|DUMP_REM|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] offset STORED
%7|1646301141.103|DUMP_REM|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] offset STORED
%7|1646301141.103|REMOVE|ish#consumer-1| [thrd:main]: Removing ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2] from assignment (started=true, pending=false, queried=false, stored offset=INVALID)
%7|1646301141.103|REMOVE|ish#consumer-1| [thrd:main]: Removing ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3] from assignment (started=true, pending=false, queried=false, stored offset=INVALID)
%7|1646301141.103|REMOVE|ish#consumer-1| [thrd:main]: Served 2 removed partition(s), with 0 offset(s) to commit
%7|1646301141.103|ASSIGNMENT|ish#consumer-1| [thrd:main]: Current assignment of 0 partition(s) with 0 pending adds, 0 offset queries, 2 partitions awaiting stop and 0 offset commits in progress
%7|1646301141.103|ASSIGNMENT|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": clearing group assignment
%7|1646301141.103|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op PARTITION_LEAVE in state up (join-state wait-unassign-to-complete) for ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2]
%7|1646301141.103|PARTDEL|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": delete ish-event.position-cat062-cartesian-changed.fdc-id.v0 [2]
%7|1646301141.103|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op PARTITION_LEAVE in state up (join-state wait-unassign-to-complete) for ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3]
%7|1646301141.103|PARTDEL|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": delete ish-event.position-cat062-cartesian-changed.fdc-id.v0 [3]
%7|1646301141.103|STOPSERVE|ish#consumer-1| [thrd:main]: All partitions awaiting stop are now stopped: serving assignment
%7|1646301141.103|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=0, wait_stop_cnt=0)
%7|1646301141.103|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301141.103|DUMP_PND|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301141.103|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301141.103|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301141.103|ASSIGNDONE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": assignment operations done in join-state wait-unassign-to-complete (rebalance rejoin=false)
%7|1646301141.103|UNASSIGN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": unassign done in state up (join-state wait-unassign-to-complete)
%7|1646301141.103|REJOIN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": Rejoining group without an assignment: Unassignment done
%7|1646301141.103|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-unassign-to-complete -> init (state up)
%7|1646301141.103|JOIN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": join with 1 subscribed topic(s)
%7|1646301141.103|CGRPMETADATA|ish#consumer-1| [thrd:main]: consumer join: metadata for subscription is up to date (71558ms old)
%7|1646301141.103|JOIN|ish#consumer-1| [thrd:main]: sasl_ssl://ish-kafka-cluster-kafka-2.ish-kafka-cluster-kafka-brokers.kafka.svc:9092/2: Joining group "eta-calculator-processor" with 1 subscribed topic(s) and member id "ish-43d43b5a-aeb4-4232-85d4-d83f9494bb4b"
%7|1646301141.103|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state init -> wait-join (state up)
%7|1646301141.194|JOINGROUP|ish#consumer-1| [thrd:main]: JoinGroup response: GenerationId 206, Protocol range, LeaderId ish-477248aa-14c5-43f0-bb2c-8ee04743fac4, my MemberId ish-43d43b5a-aeb4-4232-85d4-d83f9494bb4b, member metadata count 0: (no error)
%7|1646301141.194|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-join -> wait-sync (state up)
%7|1646301141.939|SYNCGROUP|ish#consumer-1| [thrd:main]: SyncGroup response: Broker: Group rebalance in progress (0 bytes of MemberState data)
%7|1646301141.939|GRPSYNC|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": synchronization failed: Broker: Group rebalance in progress: rejoining
%7|1646301141.939|REJOIN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": Rejoining group without an assignment: SyncGroup error: Broker: Group rebalance in progress
%7|1646301141.939|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-sync -> init (state up)
%7|1646301142.354|JOIN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": join with 1 subscribed topic(s)
%7|1646301142.354|CGRPMETADATA|ish#consumer-1| [thrd:main]: consumer join: metadata for subscription is up to date (72809ms old)
%7|1646301142.354|JOIN|ish#consumer-1| [thrd:main]: sasl_ssl://ish-kafka-cluster-kafka-2.ish-kafka-cluster-kafka-brokers.kafka.svc:9092/2: Joining group "eta-calculator-processor" with 1 subscribed topic(s) and member id "ish-43d43b5a-aeb4-4232-85d4-d83f9494bb4b"
%7|1646301142.354|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state init -> wait-join (state up)
%7|1646301142.606|JOINGROUP|ish#consumer-1| [thrd:main]: JoinGroup response: GenerationId 207, Protocol range, LeaderId ish-477248aa-14c5-43f0-bb2c-8ee04743fac4, my MemberId ish-43d43b5a-aeb4-4232-85d4-d83f9494bb4b, member metadata count 0: (no error)
%7|1646301142.606|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-join -> wait-sync (state up)
%7|1646301142.634|SYNCGROUP|ish#consumer-1| [thrd:main]: SyncGroup response: Success (73 bytes of MemberState data)
%7|1646301142.634|ASSIGNMENT|ish#consumer-1| [thrd:main]: List with 1 partition(s):
%7|1646301142.634|ASSIGNMENT|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset INVALID
%7|1646301142.634|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-sync -> wait-assign-call (state up)
%7|1646301142.634|ASSIGN|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": new assignment of 1 partition(s) in join-state wait-assign-call
%7|1646301142.634|CLEARASSIGN|ish#consumer-1| [thrd:main]: No current assignment to clear
%7|1646301142.634|ASSIGNMENT|ish#consumer-1| [thrd:main]: Added 1 partition(s) to assignment which now consists of 1 partition(s) where of 1 are in pending state and 0 are being queried
%7|1646301142.634|PAUSE|ish#consumer-1| [thrd:main]: Resuming fetchers for 1 assigned partition(s): assign called
%7|1646301142.634|CGRPJOINSTATE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" changed join state wait-assign-call -> steady (state up)
%7|1646301142.634|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=0, wait_stop_cnt=0)
%7|1646301142.634|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 1 partition(s):
%7|1646301142.634|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset STORED
%7|1646301142.634|DUMP_PND|ish#consumer-1| [thrd:main]: List with 1 partition(s):
%7|1646301142.634|DUMP_PND|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset STORED
%7|1646301142.634|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301142.634|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301142.634|SRVPEND|ish#consumer-1| [thrd:main]: Querying committed offset for pending assigned partition ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1]
%7|1646301142.634|OFFSETFETCH|ish#consumer-1| [thrd:main]: Fetching committed offsets for 1 pending partition(s) in assignment
%7|1646301142.634|OFFSET|ish#consumer-1| [thrd:main]: GroupCoordinator/2: Fetch committed offsets for 1/1 partition(s)
%7|1646301142.636|ASSIGNMENT|ish#consumer-1| [thrd:main]: Current assignment of 1 partition(s) with 1 pending adds, 1 offset queries, 0 partitions awaiting stop and 0 offset commits in progress
%7|1646301142.636|ASSIGNMENT|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": setting group assignment to 1 partition(s)
%7|1646301142.636|GRPASSIGNMENT|ish#consumer-1| [thrd:main]: List with 1 partition(s):
%7|1646301142.636|GRPASSIGNMENT|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset STORED
%7|1646301142.638|OFFSETFETCH|ish#consumer-1| [thrd:main]: Adding ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] back to pending list with offset 1312896
%7|1646301142.638|DUMP|ish#consumer-1| [thrd:main]: Assignment dump (started_cnt=0, wait_stop_cnt=0)
%7|1646301142.638|DUMP_ALL|ish#consumer-1| [thrd:main]: List with 1 partition(s):
%7|1646301142.638|DUMP_ALL|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset STORED
%7|1646301142.638|DUMP_PND|ish#consumer-1| [thrd:main]: List with 1 partition(s):
%7|1646301142.638|DUMP_PND|ish#consumer-1| [thrd:main]:  ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] offset 1312896
%7|1646301142.640|DUMP_QRY|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301142.640|DUMP_REM|ish#consumer-1| [thrd:main]: List with 0 partition(s):
%7|1646301142.640|SRVPEND|ish#consumer-1| [thrd:main]: Starting pending assigned partition ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] at offset 1312896
%7|1646301142.640|ASSIGNDONE|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": assignment operations done in join-state steady (rebalance rejoin=false)
%7|1646301142.642|FETCH|ish#consumer-1| [thrd:main]: Partition ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1] start fetching at offset 1312896
%7|1646301142.643|CGRPOP|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor" received op PARTITION_JOIN in state up (join-state steady) for ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1]
%7|1646301142.643|PARTADD|ish#consumer-1| [thrd:main]: Group "eta-calculator-processor": add ish-event.position-cat062-cartesian-changed.fdc-id.v0 [1]
dbug: Microsoft.Extensions.Hosting.Internal.Host[3]
      Hosting stopping
info: Microsoft.Hosting.Lifetime[0]
      Application is shutting down...
dbug: Microsoft.Extensions.Hosting.Internal.Host[4]
      Hosting stopped

3 replies

scholzj Mar 3, 2022
Maintainer

@tombentley @mimaison Do you have any idea what is wrong ^^^

mimaison Mar 3, 2022
Collaborator

Which version of confluent-kafka-dotnet are you using?

jakub-klapka Mar 3, 2022
Author

1.8.2, also tried 1.7.0. I can also see librdkafka v1.8.2 (0x10802ff) in log during startup.

jakub-klapka · 2022-03-04T20:13:43Z

jakub-klapka
Mar 4, 2022
Author

Just FYI, I've also started issue on Confluent's repo, since it really seems to be very specific to their DotNet implementation (or maybe even librdkafka itself). Just if you want to catch up with that: confluentinc/confluent-kafka-dotnet#1771 I'll let you know, if we manage to resolve it there.
Thanks for your comments and feel free to come up with any more ideas :)

0 replies

Strimzi

Consumers as k8s deployment - failing with rebalances when scaling #6440

Uh oh!

jakub-klapka Feb 28, 2022

Replies: 3 comments · 9 replies

Uh oh!

scholzj Feb 28, 2022 Maintainer

Uh oh!

scholzj Feb 28, 2022 Maintainer

Uh oh!

Uh oh!

jakub-klapka Feb 28, 2022 Author

Uh oh!

scholzj Feb 28, 2022 Maintainer

Uh oh!

scholzj Feb 28, 2022 Maintainer

Uh oh!

jakub-klapka Mar 3, 2022 Author

Uh oh!

jakub-klapka Mar 3, 2022 Author

Uh oh!

scholzj Mar 3, 2022 Maintainer

Uh oh!

mimaison Mar 3, 2022 Collaborator

Uh oh!

jakub-klapka Mar 3, 2022 Author

Uh oh!

jakub-klapka Mar 4, 2022 Author

jakub-klapka
Feb 28, 2022

Replies: 3 comments 9 replies

scholzj
Feb 28, 2022
Maintainer

scholzj Feb 28, 2022
Maintainer

jakub-klapka Feb 28, 2022
Author

scholzj Feb 28, 2022
Maintainer

scholzj Feb 28, 2022
Maintainer

jakub-klapka Mar 3, 2022
Author

jakub-klapka
Mar 3, 2022
Author

scholzj Mar 3, 2022
Maintainer

mimaison Mar 3, 2022
Collaborator

jakub-klapka Mar 3, 2022
Author

jakub-klapka
Mar 4, 2022
Author