Skip to content

Track model pte size in the benchmark dashboard #8810

Open
@jackzhxng

Description

@jackzhxng

🚀 The feature, motivation and pitch

Use cases:

  • Guard against pte size regressions
  • Check quantization numbers for various models in different quantization schemes

Alternatives

No response

Additional context

No response

RFC (Optional)

No response

cc @guangy10 @huydhn @kirklandsign @shoumikhin

Metadata

Metadata

Labels

module: benchmarkIssues related to the benchmark infrastructuretriagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

Status

No status

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions