TorchTitan for Multimodal models such as Qwen 2.5 VL or InternVL2 #1386
anindya-saha
started this conversation in
Ideas
Replies: 1 comment
-
Thanks for your interest. We have plans to extend to multimodal but have had limited bandwidth. In terms of community contribution, I'll share contributing guidelines for adding new models soon. Meanwhile, for each model to be added in torchtitan, there should be clear reasons why adding this particular model (e.g. is it SotA among its category at the moment?). If you are interested in working on this, a short proposal would be helpful. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Is there any plan to extend TorchTitan beyond LLM to VLM or Multimodal models such as Qwen 2.5 VL or InternVL2 ? Would there be any ideas from the community where to start from ?
Beta Was this translation helpful? Give feedback.
All reactions