Skip to content
Change the repository type filter

All

    Repositories list

    • moshi

      Public
      Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
      Python
      7728.8k5212Updated Aug 13, 2025Aug 13, 2025
    • Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.
      Python
      2082.2k200Updated Aug 13, 2025Aug 13, 2025
    • unmute

      Public
      Make text LLMs listen and speak
      Python
      126800230Updated Aug 11, 2025Aug 11, 2025
    • sphn

      Public
      python bindings for symphonia/opus - read various audio formats from python and write opus files
      Rust
      76510Updated Jul 23, 2025Jul 23, 2025
    • Python
      2927970Updated Jul 7, 2025Jul 7, 2025
    • Swift
      911110Updated Jun 26, 2025Jun 26, 2025
    • yomikomi

      Public
      A small rust-based data loader
      Rust
      13100Updated Jun 9, 2025Jun 9, 2025
    • dactory

      Public
      Python
      34100Updated Apr 30, 2025Apr 30, 2025
    • hibiki

      Public
      Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- Hibiki adapts its flow to accumulate just enough context to produce a correct translation in real-time, chunk by chunk.
      Rust
      941.3k81Updated Apr 15, 2025Apr 15, 2025
    • moshivis

      Public
      Kyutai with an "eye"
      Python
      2721610Updated Mar 26, 2025Mar 26, 2025
    • kaudio

      Public
      Rust crate for some audio utilities
      Rust
      02600Updated Mar 8, 2025Mar 8, 2025
    • Proof of concept for running moshi/hibiki using webrtc
      Rust
      12000Updated Feb 28, 2025Feb 28, 2025
    • JAX bindings for the flash-attention2 kernels
      C++
      0900Updated Jan 16, 2025Jan 16, 2025
    • ogg-table

      Public
      Ogg-vorbis reader with fast random access
      Rust
      1600Updated Aug 29, 2024Aug 29, 2024
    • JAX bindings for the flash-attention3 kernels
      C++
      11201Updated Aug 6, 2024Aug 6, 2024