Skip to content
Change the repository type filter

All

    Repositories list

    • malaya

      Public
      Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/
      Jupyter Notebook
      132501421Updated Aug 18, 2025Aug 18, 2025
    • Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/
      Jupyter Notebook
      4626540Updated Aug 18, 2025Aug 18, 2025
    • A Neural Audio Codec (NAC) for Universal Audio
      Python
      4000Updated Aug 8, 2025Aug 8, 2025
    • We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/
      Jupyter Notebook
      11032262Updated Aug 6, 2025Aug 6, 2025
    • Currently this chat widget optimized for https://nous.my, but to change to use your own should be super easy to do it.
      Vue
      3800Updated Jul 24, 2025Jul 24, 2025
    • [ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
      Python
      100000Updated Jun 23, 2025Jun 23, 2025
    • [ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound
      Python
      7400Updated Jun 23, 2025Jun 23, 2025
    • A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      9.5k000Updated Jun 20, 2025Jun 20, 2025
    • A TTS model capable of generating ultra-realistic dialogue in one pass.
      Python
      1.5k100Updated May 29, 2025May 29, 2025
    • trl-fix

      Public
      Train transformer language models with reinforcement learning.
      Python
      2.1k000Updated May 26, 2025May 26, 2025
    • Emilia

      Public
      Fork open-mmlab/Amphion Emilia
      Python
      0000Updated May 25, 2025May 25, 2025
    • Fused kernel chunk loss to include LoRA to reduce memory, support DeepSpeed ZeRO3.
      Python
      1120Updated Apr 23, 2025Apr 23, 2025
    • csm

      Public
      A Conversational Speech Generation Model
      Python
      1.4k000Updated Mar 27, 2025Mar 27, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      30k100Updated Mar 15, 2025Mar 15, 2025
    • 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
      Python
      1.2k000Updated Feb 26, 2025Feb 26, 2025
    • CCE for LoRA LM Head
      Python
      43000Updated Feb 5, 2025Feb 5, 2025
    • High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese, Korean and Malay.
      Python
      925300Updated Feb 5, 2025Feb 5, 2025
    • StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
      Python
      604300Updated Feb 5, 2025Feb 5, 2025
    • Inference and training library for high-quality TTS models.
      Python
      570200Updated Feb 3, 2025Feb 3, 2025
    • Train transformer language models with reinforcement learning.
      Python
      2.1k500Updated Feb 2, 2025Feb 2, 2025
    • Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
      Python
      44100Updated Jan 25, 2025Jan 25, 2025
    • Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
      Python
      54000Updated Jan 23, 2025Jan 23, 2025
    • cookbook

      Public
      cookbooks 📖 for Mesolitica products!
      Jupyter Notebook
      1900Updated Jan 20, 2025Jan 20, 2025
    • F5-TTS

      Public
      Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
      Python
      1.9k100Updated Jan 17, 2025Jan 17, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      30k100Updated Jan 14, 2025Jan 14, 2025
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      30k000Updated Dec 13, 2024Dec 13, 2024
    • vocos

      Public
      Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
      Python
      114000Updated Dec 13, 2024Dec 13, 2024
    • CCE for Whisper
      Python
      43300Updated Dec 11, 2024Dec 11, 2024
    • 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
      Python
      30k200Updated Nov 25, 2024Nov 25, 2024
    • Brand new TTS solution
      Python
      1.9k000Updated Nov 18, 2024Nov 18, 2024