Invesitgating Trace-based Knowledge Distillation

Installation

# Clone the repository
conda create -n trace_kd python=3.10
conda activate trace_kd
cd Trace_Check_QA
pip install -r requirements.txt
cd ..

Additional step if CUDA Toolkit is missing

conda install -c conda-forge cudatoolkit-dev -y

Usage

Run inference on CotempQA dataset:

# example
python inference.py \
    --model_name meta-llama/Llama-3.2-1B \
    --data_path data/cotempqa/mix.json \
    --mode default \
    --output_dir results/Cotempqa/evaluation_outputs/ \
    --evaluate_result_dir results/Cotempqa/evaluation_results/

Create SFT dataset for CotempQA with Input/Output format (ensure CSV is formatted correctly):

# example
python create_sft_dataset.py \
    --csv_path data/cotempqa/dataset_with_labels.csv \
    --input_col question \
    --output_col answer \
    --output_dir data/cotempqa/sft_dataset

Create SFT dataset for CotempQA with Input/Reasoning/Output format (ensure CSV is formatted correctly):

# example
python create_sft_dataset.py \
    --csv_path data/cotempqa/dataset_with_labels.csv \
    --input_col question \
    --output_col answer \
    --output_dir data/cotempqa/sft_dataset_with_reasoning \
    --include_reasoning \
    --reasoning_col label

Push data to your Huggingface and load that dataset in the SFT scripts accordingly. (Data will be made public later on.)

SFT on Cotempqa example using default settings with QLoRA for the CotempQA SFT dataset (input/output)

sbatch ./scripts/bash_scripts/cotempqa_sft_vanilla.sh "meta-llama/Llama-3.2-1B-Instruct"

SFT on Cotempqa example using default settings with QLoRA for the CotempQA SFT dataset with Temporal Relation and Facts in reasoning trace (input/reasoning + output) (gold labels + facts)

sbatch ./scripts/bash_scripts/cotempqa_sft_reasoning_facts.sh "meta-llama/Llama-3.2-1B-Instruct"

Inference

# for SFT models without any reasoning trace
sbatch ./scripts/bash_scripts/cotempqa_sft_inference.sh "meta-llama/Llama-3.2-1B-Instruct" False False

# for SFT models with reasoning trace with only temporal relation
sbatch ./scripts/bash_scripts/cotempqa_sft_inference.sh "meta-llama/Llama-3.2-1B-Instruct" True True

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
data/cotempqa		data/cotempqa
results/Cotempqa		results/Cotempqa
scripts		scripts
visualization		visualization
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Invesitgating Trace-based Knowledge Distillation

Installation

Additional step if CUDA Toolkit is missing

Usage

Push data to your Huggingface and load that dataset in the SFT scripts accordingly. (Data will be made public later on.)

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

sbhambr1/Trace_Check_QA

Folders and files

Latest commit

History

Repository files navigation

Invesitgating Trace-based Knowledge Distillation

Installation

Additional step if CUDA Toolkit is missing

Usage

Push data to your Huggingface and load that dataset in the SFT scripts accordingly. (Data will be made public later on.)

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages