This repo contains the official PyTorch implementation of vLMIG: Improving Visual Commonsense in Language Models via Multiple Image Generation
deep-learning language-model vision-and-language multimodal-deep-learning visual-commonsense-reasoning visual-commonsense
-
Updated
Jul 1, 2024 - Python