Training a Dense Retriever Model: Embedding Retriever (sentence_transformers) #4247
-
Hello there! Anyone knows how to train a embedding retriever model? I've tried to train a sentence-transformers model with the dpr-squad data format, but I got this error:
MODEL_FORMAT = 'sentence_transformers' If anyone could tell me which format the json needs to be in order to train this model correctly. Thanks a lot! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
Hey @nachoperezzv, the training data is different than SQuAD format. Instead, it should have:
As the error message states, |
Beta Was this translation helpful? Give feedback.
Hey @nachoperezzv, the training data is different than SQuAD format. Instead, it should have:
As the error message states,
{'question', 'pos_doc'}
fields are necessary for 'mnrl' loss. You can see the API Reference here for details.