About using Lora #37

msra-jqxu · 2025-05-14T07:58:24Z

Hi, thanks for open-sourcing this excellent code repository!

I wonder does the code include a part that directly enables LoRa to fine-tune the policy model? Or do I need to manually use PEFT to warp the policy model?

Thank you!

hejujie · 2025-05-15T10:27:45Z

We haven't used its LoRA training feature ourselves. You might need to seek help from the official verl repository https://github.com/volcengine/verl/ ; modifications there should be relatively quick to adapt here.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

About using Lora #37

About using Lora #37

msra-jqxu commented May 14, 2025

hejujie commented May 15, 2025

Uh oh!

About using Lora #37

About using Lora #37

Comments

msra-jqxu commented May 14, 2025

hejujie commented May 15, 2025

Uh oh!