Memory-Aware Auto Load/Release Mechanism for Collections #42623
Replies: 3 comments 1 reply
-
So far as I know, the target of the next major version Milvus v2.6.x includes these points for loading:
The goal is to make the loading process more controllable and precise. In my opinion, if all these features are done, it is possible to achieve "automatically load/release collection". Wait @xiaofan-luan to comment. |
Beta Was this translation helpful? Give feedback.
-
Wow very quick reply. Thank you @yhmo. |
Beta Was this translation helpful? Give feedback.
-
Tiered storage might be something you need. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi Milvus team,
I'd like to propose a new feature that could enhance how Milvus handles memory and collection management.
Background
Currently, Milvus requires users to manually load and release collections, which can become cumbersome, especially in multi-tenant or memory-constrained deployments. If a collection is not loaded, queries or inserts will fail unless the user explicitly loads it.
Proposal
Introduce a memory-bound collection management mechanism where Milvus automatically handles loading and releasing collections based on system memory constraints.
The idea is:
When a user queries or inserts documents into an unloaded collection, Milvus checks:
If there is enough memory to load the collection.
If not, it automatically releases other collections (e.g., using an LRU as strategy to free memory).
Then loads the requested collection automatically.
Would love to hear your thoughts. I'm happy to elaborate further or help contribute if there's interest in this feature.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions