support microsoft/bitnet-b1.58-2B-4T-bf16 #13307
csabakecskemeti
started this conversation in
Ideas
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I found the followings:
MS named it's butnet architecture slightly different, so we need to update the converter script to support their naming:
@Model.register("BitnetForCausalLM", "BitNetForCausalLM")
They also use slightly different layer names, I've manually edited the gguf-py/gguf/tensor_mapping.py file:
Now I've reached the state where the converter looks for the tokenizer model
raise FileNotFoundError(f"File not found: {tokenizer_path}")
(I've get to the same point with their own BitNet.cpp utils/convert-hf-to-gguf-bitnet.py )
Any hit how to move forward?
Related change
e112b61
Beta Was this translation helpful? Give feedback.
All reactions