🎯
Focusing
I am both a NLP researcher and an AI developer, with a primary focus on PEFT techniques with Reinforcement Learning.
-
Xi'an Jiaotong-Liverpool University
- Suzhou
-
04:55
(UTC +08:00)
Pinned Loading
-
Chinese-MedQA-Qwen2
Chinese-MedQA-Qwen2 Public基于Qwen2+SFT+DPO的医疗问答系统,项目中使用了LLaMA-Factory用于训练,fastllm和vllm用于推理,
-
Travel-Agent-based-on-Qwen2-RLHF
Travel-Agent-based-on-Qwen2-RLHF PublicA travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, us…
-
A-star-search-based-neural-network-performance-comparison
A-star-search-based-neural-network-performance-comparison PublicA comparison of neural networks in limb movement classification tasks based on the A* hyperparameter search algorithm. Our goal is to design an A* algorithm capable of finding the optimal combinat…
Python 1
-
An-Academic-Paper-Chatbot-based-on-LLama3.1-and-Knowledge-Graph
An-Academic-Paper-Chatbot-based-on-LLama3.1-and-Knowledge-Graph Public基于知识图谱和大模型的对话系统
-
bert-based-autoregressive-model
bert-based-autoregressive-model PublicChange the Bert model to a GPT-style autoregressive decoder.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.