GitHub - threegold116/Infrared-LLaVA: Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models[EMNLP2024 Finding]

Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models

Shixin Jiang^1∗, Zerui Chen¹, Jiafeng Liang¹, Yanyan Zhao¹, Ming Liu^1,2†, Bing Qin^1,2

¹Harbin Institute of Technology, Harbin, China

²Peng Cheng Laboratory, Shenzhen, China

This repository contains the resources for EMNLP2024 paper Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models

For more details, please refer to the paper: Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models.

🎉 Updates

2025/04/19 We have updated our dataset.
2024/09/22 Our paper is accepted by EMNLP2024 Finding.

Dataset

Benchmark

Infrared-Benchmark

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
figure		figure
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models

🎉 Updates

Dataset

Benchmark

About

Uh oh!

Releases

Packages

License

threegold116/Infrared-LLaVA

Folders and files

Latest commit

History

Repository files navigation

Infrared-LLaVA: Enhancing Understanding of Infrared Images in Multi-Modal Large Language Models

🎉 Updates

Dataset

Benchmark

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Packages