Finetuned Llama Repository

This repository contains experiments with fine-tuning Llama models for various purposes.

Branches

Main Branch

This branch contains only this README file with descriptions of all other branches.

Knowledge Injection Branch

The knowledge-injection branch contains experiments with injecting new knowledge into Llama 1B 3.2 that occurred after its training data cutoff. Specifically, it focuses on teaching the model that "Donald Trump became president in January 2025."

GRPO Branch

The grpo branch contains experiments with fine-tuning Llama 1B 3.2 using the GRPO method. Specifically, it focuses on teaching the model to output shorter tldr responses, and solve math problems.

Knowledge Distillation Branch

The knowledge-distillation branch contains experiments with fine-tuning Llama 1B 3.2 using the knowledge distillation method. Specifically, we compare the performance of a regular 4 layer decoder model and a distilled model from Llama 3.1 1B teacher model.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Finetuned Llama Repository

Branches

Main Branch

Knowledge Injection Branch

GRPO Branch

Knowledge Distillation Branch

About

Uh oh!

Releases

Packages

License

tolgadur/finetuned-llama

Folders and files

Latest commit

History

Repository files navigation

Finetuned Llama Repository

Branches

Main Branch

Knowledge Injection Branch

GRPO Branch

Knowledge Distillation Branch

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages