EmbeddingRetriever - Load model from memory #4154

wilsonlimaneto · 2023-02-14T12:12:33Z

wilsonlimaneto
Feb 14, 2023

Hello, is there a way to load the transformer model into a EmbeddingRetriever from memory? Basically, the idea is to load from disk upfront so the queries run faster (instead of loading from disk every time). Otherwise I just should make up another arquitecture where I may keep up an EmbeddingRetriever object and use it to run forecoming queries...

Answered by danielbichuetti

Feb 14, 2023

Hi @wilsonlimaneto

The best pattern I could recommend based on my self-experience is to load any node that, in the background, loads any transformer models to be held in memory some way. You may choose to use Ray, or a global object, or any solution that meets your needs.

You should be aware that transformers are not thread-safe, you will need to handle them or use some sort of multiprocessing pool.

View full answer

danielbichuetti · 2023-02-14T12:21:44Z

danielbichuetti
Feb 14, 2023

Hi @wilsonlimaneto

The best pattern I could recommend based on my self-experience is to load any node that, in the background, loads any transformer models to be held in memory some way. You may choose to use Ray, or a global object, or any solution that meets your needs.

You should be aware that transformers are not thread-safe, you will need to handle them or use some sort of multiprocessing pool.

1 reply

wilsonlimaneto Feb 17, 2023
Author

Thanks again. We will setup some sort of serving api!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

EmbeddingRetriever - Load model from memory #4154

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

EmbeddingRetriever - Load model from memory #4154

Uh oh!

wilsonlimaneto Feb 14, 2023

Replies: 1 comment · 1 reply

Uh oh!

danielbichuetti Feb 14, 2023

Uh oh!

wilsonlimaneto Feb 17, 2023 Author

wilsonlimaneto
Feb 14, 2023

Replies: 1 comment 1 reply

danielbichuetti
Feb 14, 2023

wilsonlimaneto Feb 17, 2023
Author