Change the repository type filter
All
Repositories list
20 repositories
Agentic-ADK
PublicPixelle-MCP
PublicAn Open-Source Multimodal AIGC Solution based on ComfyUI + MCP + LLM https://pixelle.aiOvis
PublicA novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.Marco-Voice
Public- Awesome Unified Multimodal Models
Marco-Bench-MIF
PublicOvis-U1
PublicAn unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerful framework.TeEFusion
PublicTeEFusion: Blending Text Embeddings to Distill Classifier-Free Guidance (ICCV 2025)flashinfer
PublicUNIC-Adapter
PublicParrot
Public🎉 The code repository for "Parrot: Multilingual Visual Instruction Tuning" in PyTorch.Marco-o1
PublicTransBench
PublicTG-LLaVA
PublicWings
PublicThe code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]M3Bench
PublicMeissonic
PublicAutoGPTQ
Public