
†Work done at KlingAI ‡Project Leader ✉Corresponding Author
🎉 Accepted by ICML 2025 Spotlight 🎉
[📃 Paper ] [📦 Code ] [⚒️ Project ] [📅 Slide ]

TL;DR: We i) identify attention deficit disorder as a critical barrier hindering fine-grained content understanding in MLLMs; ii) introduce a modular duplex attention mechanism to mitigate modality bias and enhance attention score justification; and iii) develop MODA-based MLLMs that enable fine-grained multimodal understanding across perception, cognition, and emotion tasks.
- 🔥2025-07-10: Creating repository. The code is uploading ...
- 2025-05-01: MODA has been accepted to ICML 2025!