Fine tune multi-steps agent #24242
Cgrandjean
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello ,
I really like langchain and I like the ability to create agents. I have recently encountered a problem about it.I have an agent that must input terminal commands and I would like to be able to fine tune (and align it) on domain specific data. I know how to fine tune and align LLM using SFT and alignment method like DPO or ORPO . I would like to know if there is a method to fine tune an LLM on multi steps tool calling problems.
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions