yuanxuns

syx yuanxuns

Optimization | ML

Popular repositories Loading

McCormick-Relaxations-in-Python McCormick-Relaxations-in-Python Public

MCPy is a python library for McCormick relaxations with sub-gradients. This is quite useful for prototyping and testing new convex relaxation and global optimization algorithms.

Jupyter Notebook 1
RLHF-with-PPO-from-Scratch-in-Pytorch RLHF-with-PPO-from-Scratch-in-Pytorch Public

A pytorch impementation of RLHF with PPO from scratch.

Python
DPO-in-Pytorch DPO-in-Pytorch Public

This repository implements Direct Preference Optimization (DPO) in pytorch.

Python
Transformer-From-Scratch-in-Pytorch Transformer-From-Scratch-in-Pytorch Public

A pytorch impementation of transformer from scratch.

Python
Improved-GRPO-From-Scratch-in-Pytorch Improved-GRPO-From-Scratch-in-Pytorch Public

Implement an improved group relative policy optimization (GRPO) algorithms from scatch in pytorch.

Python