Fine-tuning-LLMs

fine-tune of different type LLMs with the various dataset (Available+Own) used for learning purpose.

This repository demonstrates fine-tuning techniques for low-parameter Large Language Models (LLMs) using several advanced methods:

Overview

The repository showcases practical examples and scripts for efficiently fine-tuning LLMs (1B–13B parameters). We cover:

For large model training, ensure you have access to GPUs with sufficient VRAM.
For QLoRA, bitsandbytes and quantization setup is required.
RLHF and RLAIF implementations require careful tuning of reward models and PPO hyperparameters.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
fine-tune-TinyLlama-1.1B.ipynb		fine-tune-TinyLlama-1.1B.ipynb
fine-tune-llm_Llama-2-7b.ipynb		fine-tune-llm_Llama-2-7b.ipynb
finetune-llm-using-rlaif.ipynb		finetune-llm-using-rlaif.ipynb
model_finetuning.ipynb		model_finetuning.ipynb