Skip to content

Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models[EMNLP2024 Finding]

License

Notifications You must be signed in to change notification settings

threegold116/Infrared-LLaVA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models

Shixin Jiang1∗, Zerui Chen1, Jiafeng Liang1, Yanyan Zhao1, Ming Liu1,2†, Bing Qin1,2
1Harbin Institute of Technology, Harbin, China
2Peng Cheng Laboratory, Shenzhen, China

This repository contains the resources for EMNLP2024 paper Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models

taxonomy

For more details, please refer to the paper: Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models.

🎉 Updates

visitors

  • 2025/04/19 We have updated our dataset.
  • 2024/09/22 Our paper is accepted by EMNLP2024 Finding.

Dataset

Benchmark

About

Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models[EMNLP2024 Finding]

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published