Replies: 1 comment 5 replies
-
More detailled: I dont think of the .faiss-file but the .db-file. Not the .faiss-file I can import like this |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi there,
I'm trying to train a retriever for dense passage retrieval. The training works fine. And updating the embeddings of the FAISS document store also works fine.
But I've got a couple of trained dpr models to evaluate, which I have to update individually.
Each time I update a retriever I take the following steps:
instantiate a FAISSDocumentStore:
_document_store = FAISSDocumentStore(faiss_index_factory_str="Flat)
convert files to document store
docs = convert_files_to_docs(dir_path=doc_store_dir, clean_func=clean_wiki_text, split_paragraphs=True)
write the documents into the document store
_document_store.write_documents(docs)
The step "writing" takes very long (about 5 hours), as I have about 4M entities. I would like to export this document store to be able to import it every time I want to update the embeddings with another retriever but I don't see a possibility to export and import the file "faiss_document_store.db".
I appreciate every kind of help!
Beta Was this translation helpful? Give feedback.
All reactions