sdpkjc

Follow

🐢

Focusing

Adam Yanxiao Zhao sdpkjc

🐢

Focusing

Follow

🧑‍🎓 CS PhD Student @ UCAS | 🤖 Reinforcement Learning | 🏄‍♂️ Research Intern @zai-org | 🦶 Ex-Intern @ LiAuto @SenseTime @ ZeronTruck.com

59 followers · 158 following

University of Chinese Academy of Sciences
Beijing, China
14:32 (UTC +08:00)
sdpkjc.me
https://orcid.org/0000-0001-9842-4706
@sdpkjc_adam

Achievements

Achievements

Pinned Loading

abcdrl abcdrl Public

Modular Single-file Reinfocement Learning Algorithms Library

Python 38 1
vwxyzjn/cleanrl vwxyzjn/cleanrl Public

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 7.7k 828
openrlbenchmark/openrlbenchmark openrlbenchmark/openrlbenchmark Public

Python 235 14
snapshotrl snapshotrl Public

Open source code for "Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency"

Python 3
SATQuest SATQuest Public

SATQuest: A Verifier for Logical Reasoning Evaluation and Reinforcement Fine-Tuning of LLMs

Python 1
xlang-ai/OSWorld xlang-ai/OSWorld Public

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2.1k 283