The node was low on resource: ephemeral-storage #40153
-
Few query pods and a mixcoord pod reported "The node was low on resource: ephemeral-storage" even though an S3 bucket is used as external storage. What could consume the disk space, logs, cache? Should I put a limit on |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 7 replies
-
Only datanode and index can persist data into S3. Query node only reads data from S3, and never writes data to S3. Query node could download data to local disk for "cache" or "mmap", according to the queryNode.cache and queryNode.mmap configurations, by default the mmap disk usage should be less than 50%: Line 420 in 36a4fad Line 430 in 36a4fad Logs could be output to local disk if the log.file.rootPath is set to a valid path, the max size is limited by the configurations, by default no more than 3GB: Line 758 in 36a4fad You can check the disk usage of path "/var/lib/milvus/data/" inside the querynode. The sizing tool can help you estimate resource requirement for each node: https://milvus.io/tools/sizing |
Beta Was this translation helpful? Give feedback.
queryNode.mmap.chunkCache is true, which means the mmap is enabled.
With mmap is enabled, if you have search/query with output_fields, the data files of the output_fields will be cached to local disk. With more and more search/query executed, more data files are downloaded to local disk, and the disk usage of each query node will increase.
291GB local disk is for each query node. Since the total raw data size is 2.15TB, each query node might download 2.15TB / 18 = 122GB data, we have a safety ratio = 2.x so that the sizing tool shows 291GB for each query node. The Total Local Disk should be displayed as 291GB * 18 = 5.23TB. It is a bug in the sizing tool that shows 291GB of the total loca…