Replies: 1 comment
-
I just thought that maybe I could split the [7B] model file in half, where the first half would contain layers 0-15 and the second half would contain layers 16-31. I could then distribute those partial models across the nodes and forward the contents of the |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
So I was wondering, would it be possible to have the model split across multiple nodes communicating over a slow link?
My use case would be to split the model into <4 GB chunks ([32-bit] WebAssembly memory limit) distributed across peers connected via WebRTC.
BLOOM already caters to a use case similar to mine (i.e. Petals), except for that its requirements exceed the resources available in this case…
Beta Was this translation helpful? Give feedback.
All reactions