Skip to content

Export Llama 405 IR with MLIR mxfp4 scaled matmul kernels #22002

@jtuyls

Description

@jtuyls

We need to an MLIR module for Llama 405b without asm/wave kernels to compile it completely through IREE and start enabling data-tiling.

See existing work: https://github.com/nod-ai/shark-ai/pull/1703/files

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions