Fine tune multi-steps agent #24242

Cgrandjean · 2024-07-14T17:41:10Z

Cgrandjean
Jul 14, 2024

Hello ,
I really like langchain and I like the ability to create agents. I have recently encountered a problem about it.I have an agent that must input terminal commands and I would like to be able to fine tune (and align it) on domain specific data. I know how to fine tune and align LLM using SFT and alignment method like DPO or ORPO . I would like to know if there is a method to fine tune an LLM on multi steps tool calling problems.
Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fine tune multi-steps agent #24242

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Fine tune multi-steps agent #24242

Uh oh!

Cgrandjean Jul 14, 2024

Replies: 0 comments

Cgrandjean
Jul 14, 2024