Replies: 1 comment
-
hi @pietz Have you found a solution for that? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I want to create an Azure Cognitive Search application using a vector store for document embeddings plus additional meta data I can filter for. It's basically the second to last example on langchains Python Docs on Azure Search here.
There is one twist I have to deal with: A large number of meta data combinations will point to the same document. If I use langchain with AzureSearch providing an embedding function to create embeddings, I will create a very large number of duplicate embeddings because it will run through the same documents multiple times. Given that the OpenAI embedding API is not cheap for large databases, I really don't want that.
Any ideas how I can get around that?
Beta Was this translation helpful? Give feedback.
All reactions