Replies: 1 comment 1 reply
-
Does this work as a benchmark between models trained on the same data, like llama 7B and 13B? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
https://www.reddit.com/r/LocalLLaMA/comments/1816h1x/how_much_does_quantization_actually_impact_models/
Mistral 7b average KL divergence:

Llama 13b average KL divergence:

Beta Was this translation helpful? Give feedback.
All reactions