Skip to content
View Knarf04's full-sized avatar

Block or report Knarf04

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. Mamba-Megatron-DeepSpeed Mamba-Megatron-DeepSpeed Public

    Forked from deepspeedai/Megatron-DeepSpeed

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Python

  2. LongRoPE LongRoPE Public

    Forked from microsoft/LongRoPE

    LongRoPE is a novel method that can extends the context window of pre-trained LLMs to an impressive 2048k tokens.

    Python

  3. transformers transformers Public

    Forked from huggingface/transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python

  4. long_context_eval long_context_eval Public

    Evaluation scripts for long context tasks.

    Python 1

  5. RULER RULER Public

    Forked from NVIDIA/RULER

    This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

    Python

  6. vllm vllm Public

    Forked from vllm-project/vllm

    A high-throughput and memory-efficient inference and serving engine for LLMs

    Python