You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, wonder if there are any benchmarks done to compare the retrieval latency between using GPU and CPU? It would be great to understand the tradeoff in using LEANN on compute constraint environments with no GPU access :)