Graduate from ShangHai Jiao Tong University, ACM Honors Class
-
Shanghai Jiao Tong University
- Shanghai, China
-
14:56
(UTC +08:00) - https://thunderous77.github.io
Highlights
- Pro
Pinned Loading
-
Reward-Hacking-in-RLHF
Reward-Hacking-in-RLHF PublicOfficial code for the ACL 2025 paper 'From Lists to Emojis: How Format Bias Affects Model Alignment'.
Python 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.