Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.2k 179

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 872 71

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 821 109

  4. PanzaMail PanzaMail Public

    Python 292 19

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 277 22

  6. QUIK QUIK Public

    Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024

    C++ 181 13

Repositories

Showing 10 of 63 repositories
  • FP-Quant Public
    IST-DASLab/FP-Quant’s past year of commit activity
    Python 23 0 1 1 Updated Aug 3, 2025
  • EvoPress Public
    IST-DASLab/EvoPress’s past year of commit activity
    Python 26 2 0 0 Updated Jul 30, 2025
  • qutlass Public

    QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

    IST-DASLab/qutlass’s past year of commit activity
    C++ 60 Apache-2.0 2 1 0 Updated Jul 15, 2025
  • IST-DASLab/ISTA-DASLab-Optimizers’s past year of commit activity
    Python 9 Apache-2.0 0 0 0 Updated Jun 30, 2025
  • QuEST Public

    Work in progress.

    IST-DASLab/QuEST’s past year of commit activity
    Jupyter Notebook 70 MIT 6 2 0 Updated Jun 29, 2025
  • Quartet Public
    IST-DASLab/Quartet’s past year of commit activity
    Jupyter Notebook 76 MIT 6 4 0 Updated Jun 27, 2025
  • Yolov8-Pose-Detection-on-Browser Public Forked from akbartus/Yolov8-Pose-Detection-on-Browser

    Example of YOLOv8 pose detection (estimation) on browser. It shows implementations powered by ONNX and TFJS served through JavaScript without any frameworks. It demonstrates pose detection (estimation) on image as well as live web camera,

    IST-DASLab/Yolov8-Pose-Detection-on-Browser’s past year of commit activity
    HTML 0 MIT 3 0 0 Updated Jun 13, 2025
  • MoE-Quant Public

    Code for data-aware compression of DeepSeek models

    IST-DASLab/MoE-Quant’s past year of commit activity
    Python 42 6 2 0 Updated Jun 10, 2025
  • influence_distillation Public

    Official implementation of Influence Distillation: https://www.arxiv.org/abs/2505.19051

    IST-DASLab/influence_distillation’s past year of commit activity
    Python 3 0 1 0 Updated May 29, 2025
  • PanzaMail Public
    IST-DASLab/PanzaMail’s past year of commit activity
    Python 292 Apache-2.0 19 4 6 Updated Apr 8, 2025

Most used topics

Loading…