You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Update README.md with revised model path and quantization details
Update the example LLaMA model file path to reflect the new naming convention. Clarify support for GGUF format models, specifying full FP16 support and partial support for Q8_0 and Q4_0 quantization.
0 commit comments