π’
Focusing
π§βπ CS PhD Student @ UCAS | π€ Reinforcement Learning | πββοΈ Research Intern @zai-org | π¦Ά Ex-Intern @ LiAuto @SenseTime @ ZeronTruck.com
-
University of Chinese Academy of Sciences
- Beijing, China
-
14:32
(UTC +08:00) - sdpkjc.me
- https://orcid.org/0000-0001-9842-4706
- @sdpkjc_adam
Pinned Loading
-
vwxyzjn/cleanrl
vwxyzjn/cleanrl PublicHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
-
-
snapshotrl
snapshotrl PublicOpen source code for "Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency"
Python 3
-
xlang-ai/OSWorld
xlang-ai/OSWorld Public[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.