Skip to content

ContainerSolutions/cws-ollama-py-replacement

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

cws-ollama-py-replacement

Docker build and run

podman build -t embedding-service .
podman run -p 8000:8000 embedding-service

Testing conditions

-n 1000: 1000 total requests

-c 50: 50 concurrent workers

hey -n 1000 -c 50 -m POST -H "Content-Type: application/json" -d '{"text":"John Smith"}' http://localhost:8000/embed

Model benchmarks

  1. all-MiniLM-L6-v2
Summary:
  Total:	18.9186 secs
  Slowest:	3.8480 secs
  Fastest:	0.0144 secs
  Average:	0.8902 secs
  Requests/sec:	52.8581

  Total data:	8050000 bytes
  Size/request:	8050 bytes
  1. nli-mpnet-base-v2

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published