Skip to content
Change the repository type filter

All

    Repositories list

    • LLaVA-NeXT architecture
      Jupyter Notebook
      412100Updated Oct 17, 2025Oct 17, 2025
    • Watching the Watchers: Exposing Gender Disparities in Machine Translation Quality Estimation
      Jupyter Notebook
      0000Updated Oct 6, 2025Oct 6, 2025
    • Code for the paper "Instituto de Telecomunicações at IWSLT 2025: Aligning Small-Scale Speech and Language Models for Speech-to-Text Learning"
      Python
      0100Updated Sep 30, 2025Sep 30, 2025
    • adasplash

      Public
      AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)
      Python
      12620Updated Sep 30, 2025Sep 30, 2025
    • lmms-eval

      Public
      Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
      Python
      398001Updated Sep 26, 2025Sep 26, 2025
    • asentmax

      Public
      Code for Long-Context Generalization with Sparse Attention.
      0000Updated Sep 26, 2025Sep 26, 2025
    • Jupyter Notebook
      21100Updated Sep 25, 2025Sep 25, 2025
    • MF2

      Public
      Python
      0400Updated Sep 24, 2025Sep 24, 2025
    • A package for sampling from Gibbs distributions during inference with LLMs.
      Python
      2910Updated Aug 14, 2025Aug 14, 2025
    • Python
      0100Updated Jul 15, 2025Jul 15, 2025
    • Python
      43000Updated Jun 24, 2025Jun 24, 2025
    • Ongoing research training transformer models at scale
      Python
      3.2k101Updated Jun 20, 2025Jun 20, 2025
    • From a+b to sparsemax(QK^T)V in Triton!
      Jupyter Notebook
      02700Updated Jun 19, 2025Jun 19, 2025
    • zsb

      Public
      Python
      0500Updated Jun 9, 2025Jun 9, 2025
    • treqa

      Public
      LLM-based QAG framework for MT Evaluation
      Python
      1311Updated May 13, 2025May 13, 2025
    • Repository containing code to reproduce results of the paper "Sparse Activations as Conformal Predictors".
      Jupyter Notebook
      1210Updated Apr 27, 2025Apr 27, 2025
    • A PyTorch native library for large model training
      Python
      568000Updated Apr 1, 2025Apr 1, 2025
    • fy-vi

      Public
      Jupyter Notebook
      0000Updated Mar 21, 2025Mar 21, 2025
    • doce

      Public
      This is the a repo of DOCE
      Python
      0200Updated Mar 14, 2025Mar 14, 2025
    • latim

      Public
      Jupyter Notebook
      0600Updated Feb 24, 2025Feb 24, 2025
    • CHM-Net

      Public
      Modern Hopfield Networks with Continuous-Time Memories
      Python
      1200Updated Feb 21, 2025Feb 21, 2025
    • 0000Updated Feb 17, 2025Feb 17, 2025
    • \infty-Video: A Training-Free Approach to Long Video Understanding via Continuous-Time Memory Consolidation
      Python
      01810Updated Feb 14, 2025Feb 14, 2025
    • ssm-mt

      Public
      Jupyter Notebook
      0100Updated Feb 8, 2025Feb 8, 2025
    • Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks
      Python
      515000Updated Feb 4, 2025Feb 4, 2025
    • HFYN

      Public
      Hopfield-Fenchel-Young Networks: A Unified Framework for Associative Memory Retrieval
      Jupyter Notebook
      0100Updated Jan 31, 2025Jan 31, 2025
    • Jupyter Notebook
      0000Updated Oct 17, 2024Oct 17, 2024
    • Jupyter Notebook
      1300Updated Oct 15, 2024Oct 15, 2024
    • Python
      0200Updated Oct 10, 2024Oct 10, 2024
    • axolotl

      Public
      Go ahead and axolotl questions
      Python
      1.2k000Updated Sep 26, 2024Sep 26, 2024