Skip to content

[QST] How does cutlass profiler test gemm performance? #1808

@chenhongyu2048

Description

@chenhongyu2048

What is your question?
I would like to know how the cutlass profiler tests the performance of gemm?
Because for small matrices, the large L2 cache on the GPU will have a great impact on the measurement of its computation time. The tests of the cutlass profiler seem can avoid such interference

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions