Question about --train-data Format in llama.cpp Finetune #7817
Unanswered
saaraahfar
asked this question in
Q&A
Replies: 1 comment
-
after a long investigation in the code I think this is what I need and I need to format my data myself based on which model I am finetuning. can anybody please confirm that I am on the right path?
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I'm trying to understand how the
--train-data
option works in the finetune script for llama.cpp. The example shows using a single text file (shakespeare.txt), but I'm not sure how to format the data if I have multiple entries.I have a list of formatted messages like:
Is there a way to provide a list of such text entries for fine-tuning? How should the training data be structured if it's not just one continuous text file?
Beta Was this translation helpful? Give feedback.
All reactions