We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
VisionAddOn
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Currently, the only multi-modal models that have been migrated to the "unified" architecture are Gemma3 and Pixtral:
mlx-engine/mlx_engine/model_kit/model_kit.py
Lines 35 to 38 in ecc2cf4
Extending this pattern to Qwen2.5VL/Qwen2VL is desired.
Relevant mlx-vlm components:
mlx-vlm
Relevant mlx-lm components:
mlx-lm
This will likely look like:
Qwen2_5_VLVisionAddOn
ModelKit
mlx-engine
The text was updated successfully, but these errors were encountered:
No branches or pull requests
Currently, the only multi-modal models that have been migrated to the "unified" architecture are Gemma3 and Pixtral:
mlx-engine/mlx_engine/model_kit/model_kit.py
Lines 35 to 38 in ecc2cf4
Extending this pattern to Qwen2.5VL/Qwen2VL is desired.
Relevant
mlx-vlm
components:Relevant
mlx-lm
components:This will likely look like:
mlx-lm
(including MRoPE, see https://arxiv.org/abs/2502.13923 for details and Apply PR #319 fixes to Qwen 2.5VL position id #349 for mlx-vlm in progress work)Qwen2_5_VLVisionAddOn
and wire it inModelKit
mlx-engine
still passThe text was updated successfully, but these errors were encountered: