Kafka streams: consumer poll timeout has expired #45151

kennylam91 · 2024-11-18T11:08:55Z

kennylam91
Nov 18, 2024

Describe the bug

Our service is using extension quarkus-kafka-streams@3.8.5 to handle kafka messages. Everything was working fine until we added some new logic which took some more time for each message. The service didn't receive or consume any kafka messages.
We found this log before the service stopped working:
2024-11-15T10:55:30.336+0000 WARN [or.ap.ka.cl.co.in.ConsumerCoordinator] (kafka-coordinator-heartbeat-thread | ...) [Consumer clientId=d48eb33d-243e-429a-9036-a990faf0da9c-StreamThread-2-consumer, groupId=...] consumer poll timeout has expired. This means the time between subsequent calls to poll() was longer than the configured max.poll.interval.ms, which typically implies that the poll loop is spending too much time processing messages. You can address this either by increasing max.poll.interval.ms or by reducing the maximum size of batches returned in poll() with max.poll.records.

But actually the service was not down, it just stopped consuming messages.
I've search for this error and the results said that when this issue consumer poll timeout has expired happens, the consumer would be kicked out of consumer group. Is that true? How can we deal with it?
The log level WARN is also a little bit confusing.

Any help or suggestion is appreciated, thanks in advance.

Expected behavior

No response

Actual behavior

No response

How to Reproduce?

No response

Output of `uname -a` or `ver`

No response

Output of `java -version`

No response

Quarkus version or git rev

No response

Build tool (ie. output of `mvnw --version` or `gradlew --version`)

No response

Additional information

No response

Answered by kennylam91

Mar 7, 2025

So actually the correct configuration is:

kafka.max.poll.records=***
kafka.max.poll.interval.ms=***

For our case, the consuming rate seems not to be the root cause. For some reasons the stream threads stopped consuming messages => missed a poll => got kicked out of consumer group. During that period, the node cpu utilization was really high, so we suspect that could affect the consumer pod, but not sure.

View full answer

@alesj · 2024-11-18T11:09:00Z

quarkus-bot[bot]
bot Nov 18, 2024

/cc @alesj (kafka,kafka-streams), @cescoffier (kafka), @gunnarmorling (kafka-streams), @ozangunalp (kafka,kafka-streams), @rquinio (kafka-streams)

0 replies

@alesj · 2024-12-16T18:16:49Z

quarkus-bot[bot]
bot Dec 16, 2024

/cc @alesj (kafka,kafka-streams), @cescoffier (kafka), @gunnarmorling (kafka-streams), @ozangunalp (kafka,kafka-streams), @rquinio (kafka-streams)

0 replies

ozangunalp · 2024-12-16T18:23:07Z

ozangunalp
Dec 16, 2024
Collaborator

I've search for this error and the results said that when this issue consumer poll timeout has expired happens, the consumer would be kicked out of consumer group. Is that true? We can we deal with it?

You can increase the max.poll.timeout.ms of your streams application.
In any case if the Kafka Streams state is other than running or rebalancing we do put the health check to down.

In missed poll timeout however, I am not sure how Kafka Streams behaves, it may go to rebalancing state and stay there forever because its consumer is kicked out of the group.

Hope this helps.

0 replies

kennylam91 · 2025-01-09T05:58:24Z

kennylam91
Jan 9, 2025
Author

I've tried overriding the related configurations as following, but they have no effects.

kafka-streams.max.poll.records=400
kafka-streams.max.poll.interval.ms=600000

I guess the reason is they are not defined/declared in StreamsConfig.class.

0 replies

kennylam91 · 2025-03-07T10:55:44Z

kennylam91
Mar 7, 2025
Author

So actually the correct configuration is:

kafka.max.poll.records=***
kafka.max.poll.interval.ms=***

For our case, the consuming rate seems not to be the root cause. For some reasons the stream threads stopped consuming messages => missed a poll => got kicked out of consumer group. During that period, the node cpu utilization was really high, so we suspect that could affect the consumer pod, but not sure.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Kafka streams: consumer poll timeout has expired #45151

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 5 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Kafka streams: consumer poll timeout has expired #45151

Uh oh!

Uh oh!

kennylam91 Nov 18, 2024

Describe the bug

Expected behavior

Actual behavior

How to Reproduce?

Output of uname -a or ver

Output of java -version

Quarkus version or git rev

Build tool (ie. output of mvnw --version or gradlew --version)

Additional information

Replies: 5 comments

Uh oh!

quarkus-bot[bot] bot Nov 18, 2024

Uh oh!

quarkus-bot[bot] bot Dec 16, 2024

Uh oh!

ozangunalp Dec 16, 2024 Collaborator

Uh oh!

kennylam91 Jan 9, 2025 Author

Uh oh!

kennylam91 Mar 7, 2025 Author

kennylam91
Nov 18, 2024

Output of `uname -a` or `ver`

Output of `java -version`

Build tool (ie. output of `mvnw --version` or `gradlew --version`)

quarkus-bot[bot]
bot Nov 18, 2024

quarkus-bot[bot]
bot Dec 16, 2024

ozangunalp
Dec 16, 2024
Collaborator

kennylam91
Jan 9, 2025
Author

kennylam91
Mar 7, 2025
Author