Skip to content

Multiple GPUs #10

@austin2035

Description

@austin2035

Thank you very much for your work, the inference speed has indeed been dramatically improved and the GPU utilization has been increased by several times.

I'm curious if inference can be further improved by tensor parallelism across multiple GPUs.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions