Inference and embeddings directly on web browser with wasm #6055
ngxson
started this conversation in
Show and tell
Replies: 2 comments 3 replies
-
Wow, nice job. I will give it a try. I see in the TODO: Are you working on something similar ? |
Beta Was this translation helpful? Give feedback.
1 reply
-
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi everyone,
I've just made a WebAssembly binding for llama.cpp that can run completely on browser. It already have support for high-level API like completions, embeddings, as well as some low-level functions like sampling, (de)tokenization,...
Wrapper code is written in typescript, can be embedded inside a ReactJS project (there's an example on my repo).
Would be nice if someone can play around and make a complete RAG-in-browser demo :-)
Link to the project: https://github.com/ngxson/wllama
Beta Was this translation helpful? Give feedback.
All reactions