LLM Fine-tuning Challenge: Enhancing Qwen 2.5-3B for AI Research QA

This project demonstrates a comprehensive approach to fine-tuning the Qwen 2.5-3B model for specialized AI research question-answering. The implementation focuses on creating an efficient domain-specific QA system that can accurately answer questions about technical AI infrastructure concepts, particularly those related to distributed file systems and performance optimization.

📥 Model Access

Due to their large file sizes, the trained models are not included in this repository but can be accessed via the following links:

Fine-tuned Model: Download from Google Drive
Quantized Model: Download from Google Drive
Complete Project Files: Access on Google Drive (where the project was run and tested)

📋 Project Overview

The project implements a complete pipeline for:

Processing technical research documents
Generating high-quality synthetic QA pairs
Fine-tuning Qwen 2.5-3B using QLoRA
Building a retrieval-augmented generation (RAG) system
Evaluating model performance using multiple metrics

🧩 Components

Document Processing

Extracts structured information from technical markdown documents
Segments text into meaningful chunks for context preservation
Handles specialized formatting and technical content

QA Generation

Creates synthetic question-answer pairs from processed documents
Employs instruction templates optimized for technical QA formatting
Generates training and validation datasets

Fine-tuning Pipeline

Implements QLoRA (Quantized Low-Rank Adaptation) for efficient fine-tuning
Optimizes hyperparameters for the technical domain
Uses BitsAndBytes for quantization
Tracks training with Weights & Biases integration

RAG System

FAISS-based vector store for semantic document retrieval
Optimized embeddings for technical content
Context-aware question answering

Evaluation Framework

Multiple metrics including ROUGE, BLEU, and custom accuracy measures
Comprehensive evaluation of model output quality

🚀 Usage

Prerequisites

# Clone the repository
git clone https://github.com/yourusername/LLM-Fine-tuning-Challenge-Enhancing-Qwen-2.5-3B-for-AI-Research-QA.git
cd LLM-Fine-tuning-Challenge-Enhancing-Qwen-2.5-3B-for-AI-Research-QA

# Install dependencies
uv sync

# Run
uv run llm_fine_tuning_challenge_enhancing_qwen_2_5_3b_for_ai_research_qa.py

📊 Results

The fine-tuned model demonstrates significant improvements over the base model for technical AI research questions:

Higher accuracy in addressing complex technical concepts
Improved response quality for system architecture questions
Better context maintenance for multi-part technical explanations

🧪 Dataset

The model is trained using the Q3 dataset containing detailed technical documentation about:

Fire-Flyer File System (3FS) architecture
Chain Replication with Apportioned Queries (CRAQ)
Performance optimizations for distributed systems
AI infrastructure components

📃 License

This project is licensed under the GPL-3.0 License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data		data
dataset/q3_dataset		dataset/q3_dataset
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
LLM_Fine_tuning_Challenge_Enhancing_Qwen_2_5_3B_for_AI_Research_QA.ipynb		LLM_Fine_tuning_Challenge_Enhancing_Qwen_2_5_3B_for_AI_Research_QA.ipynb
README.md		README.md
llm_fine_tuning_challenge_enhancing_qwen_2_5_3b_for_ai_research_qa.py		llm_fine_tuning_challenge_enhancing_qwen_2_5_3b_for_ai_research_qa.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLM Fine-tuning Challenge: Enhancing Qwen 2.5-3B for AI Research QA

📥 Model Access

📋 Project Overview

🧩 Components

Document Processing

QA Generation

Fine-tuning Pipeline

RAG System

Evaluation Framework

🚀 Usage

Prerequisites

📊 Results

🧪 Dataset

📃 License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

OutllierRejects/Intellihack_OutlierRejects_Task3

Folders and files

Latest commit

History

Repository files navigation

LLM Fine-tuning Challenge: Enhancing Qwen 2.5-3B for AI Research QA

📥 Model Access

📋 Project Overview

🧩 Components

Document Processing

QA Generation

Fine-tuning Pipeline

RAG System

Evaluation Framework

🚀 Usage

Prerequisites

📊 Results

🧪 Dataset

📃 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages