Skip to content
@DAMO-NLP-SG

Language Technology Lab at Alibaba DAMO Academy

Pinned Loading

  1. VideoLLaMA3 VideoLLaMA3 Public

    Frontier Multimodal Foundation Models for Image and Video Understanding

    Jupyter Notebook 1k 73

  2. DAMO-SeaLLMs DAMO-SeaLLMs Public

    [ACL 2024 Demo] SeaLLMs - Large Language Models for Southeast Asia

    JavaScript 173 17

  3. CoI-Agent CoI-Agent Public

    Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents

    Python 475 28

  4. Inf-CLIP Inf-CLIP Public

    [CVPR 2025 Highlight] The official CLIP training codebase of Inf-CL: "Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss". A super memory-efficiency CLIP training sc…

    Python 269 12

  5. multimodal_textbook multimodal_textbook Public

    [ICCV 2025 Highlight] The official repository for "2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining"

    Python 174 16

  6. VideoLLaMA2 VideoLLaMA2 Public

    VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

    Python 1.2k 83

Repositories

Showing 10 of 53 repositories

Top languages

Loading…

Most used topics

Loading…