JohannesAck

Follow

🗼

Johannes Ackermann JohannesAck

🗼

Follow

PhD student at the University of Tokyo working on Reinforcement Learning and broader Machine Learning

49 followers · 23 following

Achievements

Achievements

Pinned Loading

OffPolicyCorrectedRewardModeling OffPolicyCorrectedRewardModeling Public

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

Python 7
pfnet-research/multi-stage-blended-diffusion pfnet-research/multi-stage-blended-diffusion Public

Python 29 5
OfflineRLStructuredNonstationarity OfflineRLStructuredNonstationarity Public

Implementation for RLC paper "Offline Reinforcement Learning from Datasets with Structured Non-Stationarity".

Python 6
tf2multiagentrl tf2multiagentrl Public

Clean implementation of Multi-Agent Reinforcement Learning methods (MADDPG, MATD3, MASAC, MAD4PG) in TensorFlow 2.x

Python 157 32