NJUxlj

Follow

🎯

Focusing

Zhongyang Hu NJUxlj

🎯

Focusing

Follow

I am both a NLP researcher and an AI developer, with a primary focus on PEFT techniques with Reinforcement Learning.

19 followers · 29 following

Xi'an Jiaotong-Liverpool University
Suzhou
04:55 (UTC +08:00)

Achievements

Achievements

Pinned Loading

Chinese-MedQA-Qwen2 Chinese-MedQA-Qwen2 Public

基于Qwen2+SFT+DPO的医疗问答系统，项目中使用了LLaMA-Factory用于训练，fastllm和vllm用于推理，

Python 15 2
Travel-Agent-based-on-Qwen2-RLHF Travel-Agent-based-on-Qwen2-RLHF Public

A travel agent based on Qwen2.5, fine-tuned by SFT + DPO/PPO/GRPO using traveling question-answer dataset, a mindmap can be output using the response. A RAG system is build upon the tuned qwen2, us…

Python 25 1
A-star-search-based-neural-network-performance-comparison A-star-search-based-neural-network-performance-comparison Public

A comparison of neural networks in limb movement classification tasks based on the A* hyperparameter search algorithm. Our goal is to design an A* algorithm capable of finding the optimal combinat…

Python 1
An-Academic-Paper-Chatbot-based-on-LLama3.1-and-Knowledge-Graph An-Academic-Paper-Chatbot-based-on-LLama3.1-and-Knowledge-Graph Public

基于知识图谱和大模型的对话系统

Python 10 1
bert-based-autoregressive-model bert-based-autoregressive-model Public

Change the Bert model to a GPT-style autoregressive decoder.

Python
llm-hub llm-hub Public

Popular Large Language Model's modeling file and finetune+pretrain scripts, including: llama, grok, chatglm2+3, glm, qwen, qwen2, gpt2, mistral, ...

Python 1