The official implementation of "Learning Compact Vision Tokens for Efficient Large Multimodal Models"
llama efficient-inference efficient-deep-learning llava large-vision-language-models large-multimodal-models token-fusion token-merging vision-token-merging
-
Updated
Jun 11, 2025 - Python