Skip to content
View lkevinzc's full-sized avatar
🎯
Learning
🎯
Learning

Organizations

@mosecorg

Block or report lkevinzc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. sail-sg/oat sail-sg/oat Public

    🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.

    Python 416 31

  2. sail-sg/understand-r1-zero sail-sg/understand-r1-zero Public

    Understanding R1-Zero-Like Training: A Critical Perspective

    Python 1k 50

  3. sail-sg/oat-zero sail-sg/oat-zero Public

    A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.

    Python 245 10

  4. mosecorg/mosec mosecorg/mosec Public

    A high-performance ML model serving framework, offers dynamic batching and CPU/GPU pipelines to fully exploit your compute machine

    Python 849 61

  5. spiral-rl/spiral spiral-rl/spiral Public

    SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

    Python 123 11

  6. sail-sg/VeriFree sail-sg/VeriFree Public

    Reinforcing General Reasoning without Verifiers

    Python 75 6