yk7333

Follow

Kai Yang yk7333

Follow

Hi, I'm Kai Yang. I earned my master's degree from Tsinghua in 2025, specializing in RL and LLM. I am presently employed at Tencent Hunyuan.

13 followers · 1 following

Hunyuan X, Tencent
Shenzhen, China
16:09 (UTC +08:00)
https://yk7333.github.io/

Achievements

Achievements

yk7333/README.md

👋 Hi, I’m Kai Yang(杨恺).
👀 Research Focus: Large Language Models (LLM) & Reinforcement Learning (RL).
🎓 M.S. in Artificial Intelligence, Tsinghua University.
💼 Tencent Hunyuan X Team | Research Engineer, specializing in RL for LLMs.
📫 Contact: yangkaisigsrl@gmail.com

Pinned Loading

d3po d3po Public

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Python 242 18
RoboEden/Luxai-s2-Baseline RoboEden/Luxai-s2-Baseline Public

Python 11 1
Graduation-project-design Graduation-project-design Public

采用模拟退火策略优化的免疫算法解决无人机协同分配问题

MATLAB 22
DRND DRND Public

[ICML 2024]Exploration and Anti-exploration with Distributional Random Network Distillation

Python 15
TaskAllocation TaskAllocation Public

[EAAI] A two-stage reinforcement learning-based approach for multi-entity task allocation.

Python 26 6