Shixin Jiang1∗,
Zerui Chen1,
Jiafeng Liang1,
Yanyan Zhao1,
Ming Liu1,2†,
Bing Qin1,2
1Harbin Institute of Technology, Harbin, China
2Peng Cheng Laboratory, Shenzhen, China
This repository contains the resources for EMNLP2024 paper Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models
For more details, please refer to the paper: Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models.
- 2025/04/19 We have updated our dataset.
- 2024/09/22 Our paper is accepted by EMNLP2024 Finding.