Skip to content

Please privide more 4bit matmul docs and example? #3641

@alanzhai219

Description

@alanzhai219

oneDNN enables 4bits for matmul, like u4/s4 and float-4bit. However, there is no document or example about 4-bit data type. Especially, 4bit storage structure and use cases in the real inference.
Could you provide the more details to describe it?

Metadata

Metadata

Assignees

Labels

platform:cpu-x64Intel64/AMD64 processors. Codeowner: @oneapi-src/onednn-cpu-x64question

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions