Skip to content
View kir152's full-sized avatar
👾
👾

Block or report kir152

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python

  2. flex_head_fa flex_head_fa Public

    Forked from xiayuqing0622/flex_head_fa

    Fast and memory-efficient exact attention

    Python

  3. sglang sglang Public

    Forked from sgl-project/sglang

    SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

    Python

  4. MoBA MoBA Public

    Forked from MoonshotAI/MoBA

    MoBA: Mixture of Block Attention for Long-Context LLMs

    Python

  5. s1 s1 Public

    Forked from simplescaling/s1

    s1: Simple test-time scaling

    Python

  6. SelfCite SelfCite Public

    Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"

    Python 4