kir152

Follow

👾

kiran kir152

👾

Follow

2 followers · 0 following

Achievements

Achievements

Pinned Loading

vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
flex_head_fa flex_head_fa Public

Forked from xiayuqing0622/flex_head_fa

Fast and memory-efficient exact attention

Python
sglang sglang Public

Forked from sgl-project/sglang

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Python
MoBA MoBA Public

Forked from MoonshotAI/MoBA

MoBA: Mixture of Block Attention for Long-Context LLMs

Python
s1 s1 Public

Forked from simplescaling/s1

s1: Simple test-time scaling

Python
SelfCite SelfCite Public

Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"

Python 4