Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      0100Updated Aug 22, 2025Aug 22, 2025
    • Python
      409000Updated Aug 21, 2025Aug 21, 2025
    • Python
      0000Updated Aug 21, 2025Aug 21, 2025
    • Friendli Suite python SDK
      Python
      1100Updated Aug 20, 2025Aug 20, 2025
    • examples

      Public
      FriendliAI Example and Tutorial Code
      Jupyter Notebook
      0600Updated Aug 6, 2025Aug 6, 2025
    • FlashInfer: Kernel Library for LLM Serving
      Cuda
      461000Updated Jul 8, 2025Jul 8, 2025
    • cutlass

      Public
      CUDA Templates for Linear Algebra Subroutines
      C++
      1.4k000Updated Jul 6, 2025Jul 6, 2025
    • Ruby
      0000Updated Jul 4, 2025Jul 4, 2025
    • friendli-client

      Public archive
      [⛔️ DEPRECATED] Friendli: the fastest serving engine for generative AI
      Python
      74821Updated Jun 25, 2025Jun 25, 2025
    • TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that orchestrate the inference execution in performant way.
      C++
      1.7k000Updated Jun 23, 2025Jun 23, 2025
    • 0000Updated Jun 20, 2025Jun 20, 2025
    • Python
      29000Updated Jun 5, 2025Jun 5, 2025
    • litellm

      Public
      Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
      Python
      3.9k000Updated May 27, 2025May 27, 2025
    • Golang Playground Driven by AI, Created from FriendliAI Hackathon
      TypeScript
      0800Updated May 12, 2025May 12, 2025
    • ai

      Public
      Build AI-powered applications with React, Svelte, Vue, and Solid
      TypeScript
      2.8k001Updated May 9, 2025May 9, 2025
    • gorilla

      Public
      Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)
      Python
      1.2k000Updated Apr 5, 2025Apr 5, 2025
    • langchain

      Public
      🦜🔗 Build context-aware reasoning applications
      Jupyter Notebook
      19k000Updated Apr 5, 2025Apr 5, 2025
    • LlamaIndex (formerly GPT Index) is a data framework for your LLM applications
      Python
      6.3k000Updated Apr 5, 2025Apr 5, 2025
    • 🦜🔗 Build context-aware reasoning applications 🦜🔗
      TypeScript
      2.7k000Updated Apr 5, 2025Apr 5, 2025
    • Public repo for HF blog posts
      Jupyter Notebook
      903100Updated Jan 22, 2025Jan 22, 2025
    • FMO (Friendli Model Optimizer)
      Python
      31231Updated Jan 8, 2025Jan 8, 2025
    • A framework for few-shot evaluation of autoregressive language models.
      Python
      2.7k100Updated Jan 2, 2025Jan 2, 2025
    • FriendliAI LLM Hackathon tutorial scripts
      Jupyter Notebook
      0601Updated Dec 2, 2024Dec 2, 2024
    • 0000Updated Nov 12, 2024Nov 12, 2024
    • Python
      04710Updated Sep 7, 2024Sep 7, 2024
    • Website for the Weaviate vector database
      MDX
      140000Updated Aug 28, 2024Aug 28, 2024
    • weaviate

      Public
      Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.
      Go
      1k000Updated Aug 22, 2024Aug 22, 2024
    • Private fork of ariga/atlas-go-sdk
      Go
      0000Updated Aug 21, 2024Aug 21, 2024
    • aipm

      Public archive
      AI Agent who manages your Jira project
      Python
      22000Updated Jun 23, 2024Jun 23, 2024
    • A Locust metrics exporter for Prometheus
      Go
      38000Updated Mar 18, 2024Mar 18, 2024