Effectiveness Assessment of Recent Large Vision-Language Models

This is the source code and result of our "Effectiveness Assessment of Recent Large Vision-Language Models".

Evaluation

Download the data

The data used in our testbed can be downloaded from the following links: DUTS, SOC, COD10K, Trans10K, ColonDB, ETIS, ISIC, MVTec AD, VisA.

Recognition

Modify the prompt in "client_shikra.py", "demo_v2_minigpt.py", and the file in LLaVA, and run these files to get the recognition results of Shikra, MiniGPT-v2, and LLaVA-1.5.
Evaluate the results using "CLS_Metrics.py".

Localization

Modify the prompt in "client_shikra.py", "demo_v2_minigpt.py", and the file in LLaVA, and run these files to get the detection results of Shikra, MiniGPT-v2, and LLaVA-1.5.
Evaluate the results using "Detection_Metrics.py" for MiniGPT-v2 and LLaVA-1.5, and "Detection_Metrics_Shikra.py" exclusively for Shikra.
Get segmentation results from these models using the above detection results and "seg_with_SAM.py".
These results are then evaluated using segmentation evaluation metrics.

Citation

Please cite our paper if you find the work useful:

    @article{jiang2024effectiveness,
    title={Effectiveness assessment of recent large vision-language models},
    author={Jiang, Yao and Yan, Xinyu and Ji, Ge-Peng and Fu, Keren and Sun, Meijun and Xiong, Huan and Fan, Deng-Ping and Khan, Fahad Shahbaz},
    journal={Visual Intelligence},
    volume={2},
    number={1},
    pages={17},
    year={2024},
    publisher={Springer}
    }

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
.vscode		.vscode
Object-Detection-Metrics-master		Object-Detection-Metrics-master
bbox_label		bbox_label
results		results
CLS_Metrics.py		CLS_Metrics.py
Detection_Metrics.py		Detection_Metrics.py
Detection_Metrics_Shikra.py		Detection_Metrics_Shikra.py
README.md		README.md
client_shikra.py		client_shikra.py
demo_v2_minigpt.py		demo_v2_minigpt.py
generate_box_from_mask.py		generate_box_from_mask.py
seg_with_SAM.py		seg_with_SAM.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Effectiveness Assessment of Recent Large Vision-Language Models

Evaluation

Download the data

Recognition

Localization

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

jiangyao-scu/LVLMs-Evaluation

Folders and files

Latest commit

History

Repository files navigation

Effectiveness Assessment of Recent Large Vision-Language Models

Evaluation

Download the data

Recognition

Localization

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages