perception-evaluation

Here is 1 public repository matching this topic...

zeyofu / BLINK_Benchmark

This repo contains evaluation code for the paper "BLINK: Multimodal Large Language Models Can See but Not Perceive". https://arxiv.org/abs/2404.12390 [ECCV 2024]

benchmark natural-language-processing ai computer-vision perception multimodal-learning multimodal vision-and-language 3d-understanding multimodal-large-language-models perception-evaluation

Updated Jul 3, 2024
Python

Improve this page

Add a description, image, and links to the perception-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the perception-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perception-evaluation

Here is 1 public repository matching this topic...

zeyofu / BLINK_Benchmark

Improve this page

Add this topic to your repo