Ph.D. Student @ Peking University | THUC3I | Shanghai AI Lab
-
Shanghai AI Laboratory
- Shanghai, China
-
09:21
(UTC +08:00) - https://yuczhang.com/
- https://scholar.google.com/citations?user=Y2oqeP0AAAAJ&hl=zh-CN
- @yuchenzhan84564
Pinned Loading
-
PRIME-RL/Entropy-Mechanism-of-RL
PRIME-RL/Entropy-Mechanism-of-RL PublicThe Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
-
NUS-HPC-AI-Lab/GEOM
NUS-HPC-AI-Lab/GEOM PublicPytorch implementation of ICML-2024 "Navigating Complexity: Toward Lossless Graph Condensation via Expanding Window Matching"
Python 26
-
volcengine/verl
volcengine/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.