Request for a better support for RAG with large datasets #40725
Replies: 3 comments 7 replies
-
"fetch the source content by a simple get api by milvus" --------------------- I believe the query() interface already works for this purpose
In v2.5, the approach:
|
Beta Was this translation helpful? Give feedback.
-
Data not loaded won't be able to retrieve or filters. In Milvus 2.6, we will support a new data type named TEXT. The text datatype won't be able to load or filterins. it only support tree operations
|
Beta Was this translation helpful? Give feedback.
-
building a 1B rag case is cool! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
The issue we see is as follows:
Suppose you'd like to build a RAG system with milvus using 1B dataset.
The source content for each vector needs to be retrieved during/after performing an ANN search.
If you keep the source content along with the vector in one collection, for example like langchain does , you'll end up fetching huge amount of date from object storage to querynode local drive.
The current workaround we use is to keep it separately on the object storage and fetch it directly after retrieving the required vector id by ann search. If it was possible for milvus to avoid loading such field during load(I understand this is available in beta) and then fetch the source content by a simple get api by milvus, or any other solution by milvus that would better support the RAG case with big datasets.
Beta Was this translation helpful? Give feedback.
All reactions