Sparse embeddings - Benchmarks? #281

paulmartrencharpro · 2024-06-24T09:48:03Z

paulmartrencharpro
Jun 24, 2024

Hello,

I have done a RAG app with the hybrid retrieval with Qdrant & fastembed. I used the prithvida/Splade_PP_en_v1 model on my first implementation and it works very well. Much better than with a standard Qdrant Embedding Retriever.

With the latest version, there's now a second sparse embedding Qdrant/bm42-all-minilm-l6-v2-attentions. From my tests, I can see it's smaller and faster, but I can't quantify if it's better or worst. By testing my whole RAG system, I evaluated that the quality of the answers generated are similar, but that does not really grade the retrieval part of the process.

Is there any benchmarks that I could use to compare the different sparse embeddings that fastembed supports?

Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Sparse embeddings - Benchmarks? #281

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Sparse embeddings - Benchmarks? #281

Uh oh!

paulmartrencharpro Jun 24, 2024

Replies: 0 comments

paulmartrencharpro
Jun 24, 2024