Replies: 1 comment 5 replies
-
This is a kind to timeout. It means your server(oap) can't process the data in time. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Description:
We encountered a situation where the SkyWalking Java agent logged repeated
DEADLINE_EXCEEDED
errors while trying to callServiceManagementClient
. Shortly after those errors appeared, multiple threads began consuming 100% of the CPU indefinitely. Thread dumps show these busy threads are stuck inorg.apache.skywalking.apm.dependencies.io.netty.util.internal.shaded.org.jctools.queues.BaseMpscLinkedArrayQueue.poll
(i.e.,MpscUnboundedArrayQueue.poll
) within the Netty event loop.Below are the relevant log messages and a snippet of one such thread stack trace:
And one of the 100% CPU threads shows the following stack:
These threads appear to be in a busy-spin loop (
RUNNABLE
state) underBaseMpscLinkedArrayQueue.poll()
, using 100% CPU.Environment:
Additional Context or Screenshots:
Collector traceSegment service doesn't response in xxx seconds. ...
grpc-nio-worker-ELG-*
threads go intoRUNNABLE
state in a loop.Could you please advise if this is a known bug or a configuration issue? Any recommended workaround or fix would be appreciated.
Beta Was this translation helpful? Give feedback.
All reactions