@inproceedings{
li2024aligning,
title={Aligning Crowd Feedback via Distributional Preference Reward Modeling},
author={Dexun Li and Cong Zhang and Kuicai Dong and Derrick Goh Xin Deik and Ruiming Tang and Yong Liu},
booktitle={ICML 2024 Workshop on Models of Human Feedback for AI Alignment},
year={2024},
url={https://openreview.net/forum?id=HHtV1kshHP}
}
-
Notifications
You must be signed in to change notification settings - Fork 0
zcaicaros/DPRM
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Our paper "Aligning Crowd Feedback via Distributional Preference Reward modelling"
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published