test-backend-ops | tensor row parallel examples #9253

mjkpolo · 2024-08-30T14:19:32Z

mjkpolo
Aug 30, 2024

Hello,

I was wondering if there is a simple way to use test-backend-ops.cpp with tensor row parallelism (-sm row). Is llama.cpp the only code that shows how to implement tensor parallelism?

Thanks!

Answered by slaren

Aug 30, 2024

test-backend-ops does not support tensor parallelism. As far as I know yes, only llama.cpp supports it. However, all you need to do use it is to allocate the weights in a ggml_backend_cuda_split_buffer_type.

View full answer

slaren · 2024-08-30T14:44:13Z

slaren
Aug 30, 2024
Maintainer

test-backend-ops does not support tensor parallelism. As far as I know yes, only llama.cpp supports it. However, all you need to do use it is to allocate the weights in a ggml_backend_cuda_split_buffer_type.

1 reply

mjkpolo Aug 30, 2024
Author

thank you! I will try this much appreciated 😎

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test-backend-ops | tensor row parallel examples #9253

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

test-backend-ops | tensor row parallel examples #9253

Uh oh!

mjkpolo Aug 30, 2024

Replies: 1 comment · 1 reply

Uh oh!

slaren Aug 30, 2024 Maintainer

Uh oh!

mjkpolo Aug 30, 2024 Author

mjkpolo
Aug 30, 2024

Replies: 1 comment 1 reply

slaren
Aug 30, 2024
Maintainer

mjkpolo Aug 30, 2024
Author