Skip to content

dimforge/slai

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

slai − GPU local inference every platform

crates.io


slai is a set of Rust libraries exposing Slang shaders and kernels for local Large Language Models (LLMs) inference on the GPU. It is cross-platform and runs on the web. slai can be used as a rust library to assemble your own transformer from the provided operators (and write your owns on top of it).

Aside from the library, two binary crates are provided:

  • slai-bench is a basic benchmarking utility for measuring calculation times for matrix multiplication with various quantization formats.
  • slai-chat is a basic chat GUI application for loading GGUF files and chat with the model. It can be run natively or on the browser. Check out its README for details on how to run it. You can run it from your browser with the online demo.

⚠️ slai is still under heavy development and might be lacking some important features. Contributions are welcome!


About

Slang cross-platform GPU inference

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published