lkevinzc

Follow

🎯

Learning

zclzc lkevinzc

🎯

Learning

Follow

NUS PhD student working on RL @sail-sg

109 followers · 162 following

Achievements

Achievements

Organizations

Pinned Loading

sail-sg/oat sail-sg/oat Public

🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

Python 416 31
sail-sg/understand-r1-zero sail-sg/understand-r1-zero Public

Understanding R1-Zero-Like Training: A Critical Perspective

Python 1k 50
sail-sg/oat-zero sail-sg/oat-zero Public

A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

Python 245 10
mosecorg/mosec mosecorg/mosec Public

A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

Python 849 61
spiral-rl/spiral spiral-rl/spiral Public

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Python 123 11
sail-sg/VeriFree sail-sg/VeriFree Public

Reinforcing General Reasoning without Verifiers

Python 75 6