QueryNode Memory Overload Due to Unbalanced Growing Segments #42778
Unanswered
navneetrajput06
asked this question in
Q&A and General discussion
Replies: 2 comments 3 replies
-
Which version of your milvus? Use describe_collection() to double-check the shards_num value of the collection:
How many data nodes in the cluster? You can install a monitoring service to observe the state of querynode/datacoord/datanode |
Beta Was this translation helpful? Give feedback.
1 reply
-
|
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi team,
We are encountering an issue where one of our QueryNodes becomes fully utilized in terms of memory (reaching 100%), while the others are underutilized. At one point, this caused our data ingestion to stop completely.
Using the following query:
sum(milvus_querynode_segment_num{segment_state="Growing"}) by (pod, segment_level)
we observed that a single QueryNode is holding 325 growing segments with segment_level="L1", whereas all other QueryNodes are showing zero growing segments.
Here are some relevant details about our setup:
• We have a collection with 4 shards and a total of 25 QueryNodes.
• Auto-balancing is enabled with the default configuration in QueryCoord.
• Restarting the affected QueryNode temporarily resolves the issue and ingestion resumes, but after some time, the imbalance returns.
We would appreciate any insights into why all growing segments are being assigned to just one QueryNode, and how we can ensure proper distribution to prevent ingestion failures.
Thanks in advance for your support.
Beta Was this translation helpful? Give feedback.
All reactions