Skip to content
Change the repository type filter

All

    Repositories list

    • vlm-lens

      Public
      Extracting internal representations from vision-language models. Doc: https://compling-wat.github.io/vlm-lens/
      Python
      01081Updated Aug 9, 2025Aug 9, 2025
    • FORG3D

      Public
      Customizable 3D Rendering Tool
      Python
      0100Updated Aug 2, 2025Aug 2, 2025
    • [CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
      Python
      50000Updated Jul 26, 2025Jul 26, 2025
    • Preset Blender objects for the FORG3D toolkit
      0000Updated Mar 28, 2025Mar 28, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      2.2k000Updated Feb 1, 2025Feb 1, 2025
    • Practice tasks for the CompLING lab internship application.
      TeX
      01200Updated Jan 6, 2025Jan 6, 2025