🐶 DOGe

Defensive Output Generation for LLM Protection Against Knowledge Distillation

TLDR: We make LLMs much more difficult to distill while maintaining their performance/quality.

Checkpoints will be released soon.

1. Setup Environment

Simply run bash setup.sh under the root directory of this repository to set up the environment.

2. Replicate DOGe

2.1 Generate training data

Launch the model through vllm, for example:

CUDA_VISIBLE_DEVICES=0 vllm serve --model deepseek-ai/DeepSeek-R1-Distill-Qwen-7B --port 2333

Then check the generate.sh script, you might want to comment/modify some lines/parameters to fit your needs.

bash generate.sh

2.2 Train DOGe on the teacher model

Check out the train-doge.sh script, you might want to comment/modify some lines/parameters to fit your needs.

bash train-doge.sh

2.3 Distill DOGe/vanilla teacher model to a student model

Check out the train-distill.sh script, you might want to comment/modify some lines/parameters to fit your needs.

bash train-distill.sh

2.4 Evaluate the distilled student model (or any model)

Check out the eval-task.sh script, you might want to comment/modify some lines/parameters to fit your needs.

bash eval-task.sh

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
configs		configs
scripts		scripts
src/doge		src/doge
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eval-task.sh		eval-task.sh
extract-model.sh		extract-model.sh
generate.sh		generate.sh
requirements.txt		requirements.txt
setup.sh		setup.sh
train-distill.sh		train-distill.sh
train-doge.sh		train-doge.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🐶 DOGe

1. Setup Environment

2. Replicate DOGe

2.1 Generate training data

2.2 Train DOGe on the teacher model

2.3 Distill DOGe/vanilla teacher model to a student model

2.4 Evaluate the distilled student model (or any model)

About

Uh oh!

Uh oh!

Languages

License

UNITES-Lab/DOGe

Folders and files

Latest commit

History

Repository files navigation

🐶 DOGe

1. Setup Environment

2. Replicate DOGe

2.1 Generate training data

2.2 Train DOGe on the teacher model

2.3 Distill DOGe/vanilla teacher model to a student model

2.4 Evaluate the distilled student model (or any model)

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Languages