Skip to content

InMemory document store #4039

Discussion options

You must be logged in to vote

Hello!

InMemoryDocumentStore is not recommended for production workloads for several reasons:

  • it lives only in memory, so you can't save or reload it
  • its size is conditioned by the resources available in your machine
  • it is based on simple Python data structures, which are not optimized for fast writing and retrieval
  • speaking of dense retrieval, it can't apply Approximate Nearest Neighbours (ANN) at query time.
    Each time a query is made, it scans each document in the Document Store to compute the similarity to the query.

So, InMemoryDocumentStore is good for experimenting and does not require any particular setup.
Depending on your use case (sparse or dense retrieval), there more efficie…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by chaitanya-nit
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants