Skip to content

Conversation

@b4rtaz
Copy link
Owner

@b4rtaz b4rtaz commented Aug 9, 2025

This pull request adds a new option to dllama.

./dllama perplexity --prompt "Lorem ipsum dolor sit amet, consectetur adipiscing elit. Suspendisse consectetur nibh risus, id volutpat justo pulvinar sed. Nulla gravida justo id mauris ullamcorper suscipit. Sed ultrices dui ac justo consequat, vel bibendum orci dictum. Fusce nisi turpis, blandit eu porttitor ac, tincidunt in augue." \
  --model models/llama3_1_8b_instruct_q40/dllama_model_llama3_1_8b_instruct_q40.m --tokenizer models/llama3_1_8b_instruct_q40/dllama_tokenizer_llama3_1_8b_instruct_q40.t --buffer-float-type q80 --nthreads 4 --max-seq-len 4096

@b4rtaz b4rtaz merged commit a5a1a2f into main Aug 9, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants