about precision loss

Compared with llama.cpp, does tmac lose precision when running quantized models, or does it give the same results? I am running qwen1.5 4bit（https://huggingface.co/Qwen/Qwen1.5-4B-Chat-GPTQ-Int4） now, and I found that the answers given by the model are sometimes wrong, especially in English. like this:
![微信截图_20240926091000](https://github.com/user-attachments/assets/19328b60-c406-4f2d-9386-f45910a0a9a2)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

about precision loss #52

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

about precision loss #52

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions