Skip to content

2junhyeok/safety_alignment

 
 

Repository files navigation

Safety Performance Enhancement Experiment with SFT + RLHF (PPO)

Deliberative Alignment 구현을 위한 koGPT 및 kcBERT를 활용한 toy project

About

Safety Performance Enhancement Experiment with RLHF (PPO)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%