Build and deploy llama-server on fly.io #8026

hazelnutcloud · 2024-06-20T04:44:09Z

hazelnutcloud
Jun 20, 2024

Hi all, I’ve written a Dockerfile that’ll build and deploy llama-server on fly.io along with the fly.toml configuration file. Here’s the Github repo.

It uses the most minimal dependencies possible to create a small image and downloads model files on initial boot and caches them in a volume for fast subsequent cold starts.

Hope this helps!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Build and deploy llama-server on fly.io #8026

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Build and deploy llama-server on fly.io #8026

Uh oh!

hazelnutcloud Jun 20, 2024

Replies: 0 comments

hazelnutcloud
Jun 20, 2024