Energy-Efficient Fine-Tuning of Large Language Models on Devanagari Script-based Languages

This project focuses on fine-tuning the Llama 2 language model for Devanagari script languages, emphasizing Hindi and Sanskrit. The goal is to enhance language understanding and generation with a particular emphasis on energy-efficient strategies.

Introduction

This project presents the fine-tuning of the Llama 2 language model for Devanagari script languages, specifically focusing on Hindi and Sanskrit. The aim is to overcome challenges in language understanding and generation while emphasizing energy-efficient strategies for accessibility in resource-constrained settings.

Features

Fine-tuned Llama 2 model for improved Devanagari language understanding.
Implementation of energy-efficient strategies for fine-tuning.
Open-source alternative for developers seeking cost-effective language processing tools.

Getting Started

Prerequisites

accelerate==0.21.0
peft==0.4.0
bitsandbytes==0.40.2
transformers==4.31.0
trl==0.4.7
datasets
Python==3.11

Usage

For Training Please refer to the notebooks.
For inference from the model use python inference.py message

Methodology

For detailed information on the methodology used in this project, please refer to the Report.

Results

Our fine-tuned Llama 2 model excelled in understanding diverse Devanagari languages, achieving a commendable BLEU score. While ChatGPT boasts a higher BLEU score for Hindi, our model’s versatility extends to Sanskrit and other Devanagari languages, while being fine-tuned on a significantly lesser amount of resources.

Model	BLEU Score
Baseline Llama 2	5.20
OURS	34.87
ChatGPT-3.5 Turbo	72.69

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Notebooks		Notebooks
DS303_Algo_and_AI_Report.pdf		DS303_Algo_and_AI_Report.pdf
README.md		README.md
config.py		config.py
inference.py		inference.py
model_config.json		model_config.json
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Energy-Efficient Fine-Tuning of Large Language Models on Devanagari Script-based Languages

Table of Contents

Introduction

Features

Getting Started

Prerequisites

Usage

Methodology

Results

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 5

Uh oh!

Languages

DataScience-ArtificialIntelligence/QLoRALLama2_Devanagiri-TextDataset

Folders and files

Latest commit

History

Repository files navigation

Energy-Efficient Fine-Tuning of Large Language Models on Devanagari Script-based Languages

Table of Contents

Introduction

Features

Getting Started

Prerequisites

Usage

Methodology

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 5

Uh oh!

Languages

Packages