Ingester CPU spikes #8163

LasseHels · 2024-05-21T12:58:24Z

LasseHels
May 21, 2024

Hey folks

We run ingesters in three zones. Typically, average CPU consumption per ingester in a zone is ~3. However, we observe that occasionally, CPU will spike to ~6 for a minute or two, and then return to normal without any manual intervention. It looks like this:

In the screenshot above, we see CPU usage soar briefly, even though write requests, read requests and memory series are unchanged.

The pattern is also noticeable on our node level metrics:

All nodes included in the screenshot above run exclusively ingester pods (not counting DaemonSet pods).

Looking at our disk metrics, we see increased read activity on some ingester nodes, but not all:

Write activity looks unchanged.

In the Mimir / Writes Resources dashboard, we can see that it is the CPU usage of all ingesters that spikes:

We also see that memory usage goes up by quite a bit across the three zones. It comes back down again, but not quite to the level it was at pre-spike:

We've taken a look at the following and did not find any obvious smoking guns:

Ingester logs.
Node syslogs.
Eviction logs (i.e., did any evictions take place around the time of the spike).

We'd be curious to get the Mimir team's input on what leads to chase to get to the bottom of the spikes. As you can see from the resources dashboard, our ingesters are currently over-provisioned by quite a bit, and we would like to right-size them, but are concerned about these CPU spikes.

EDIT: Forgot to include our Mimir version 🤦. We run Mimir 2.11.0.

LasseHels · 2024-05-23T11:59:09Z

LasseHels
May 23, 2024
Author

Any chance for some input on this issue? We spent some more time investigating this and have come up empty-handed again.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Ingester CPU spikes #8163

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Ingester CPU spikes #8163

Uh oh!

Uh oh!

LasseHels May 21, 2024

Replies: 1 comment

Uh oh!

LasseHels May 23, 2024 Author

LasseHels
May 21, 2024

LasseHels
May 23, 2024
Author