Skip to content

IliaShkola/CSV_Jsonl_Converter

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CSV FineTune Converter

This lightweight app allows you to effortlessly convert your training data from .csv to .jsonl for OpenAI model fine-tuning or .json for Llama (Alpaca structure) models.

Screenshot 2025-02-21 155933

Installation

  1. Clone repository
git clone https://github.com/IliaShkola/CSV_Jsonl_Converter.git
  1. Move to the project folder
cd CSV_Jsonl_Converter
  1. Create new python environment
python -m venv myenv
  1. Activate the environment
myenv\Scripts\activate
  1. Upgrade pip
python.exe -m pip install --upgrade pip
  1. Install the libraries
pip install -r requirements.txt
  1. Create an executable with PyInstaller
pyinstaller app.spec --noconfirm --clean

The executable file will be stored in the 'dist' project folder.

Usage

To convert a CSV file to JSONL or JSON, simply drag and drop your .csv file into the designated area. Then, enter the system prompt in the text box and select the output format based on the model you want to fine-tune. Alpaca models don't require a system prompt.

CSV requirements

The .csv file containing training data should include two columns: 'Prompt' and 'Answer'.

image

Important

Please do not use training data from the TestData directory in a real fine-tuning project!

About

csv - jsonl/json converter python app for fine-tuning OpenAI and Alpaca models

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages