Mesolitica

All

48 repositories

malaya
Public
Natural Language Toolkit for Malaysian language, https://malaya.readthedocs.io/
natural-language-processing sentiment-analysis tensorflow language-detection entity-framework normalizer ner emotion-analysis pos-tagging malay
Jupyter Notebook
•
MIT License
•132•501•4•21•Updated Aug 18, 2025Aug 18, 2025
malaya-speech
Public
Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/
Jupyter Notebook
•
MIT License
•46•265•4•0•Updated Aug 18, 2025Aug 18, 2025
DistilCodec
Public
A Neural Audio Codec (NAC) for Universal Audio
Python
•4•0•0•0•Updated Aug 8, 2025Aug 8, 2025
malaysian-dataset
Public
We gather Malaysian dataset! https://malaysian-dataset.readthedocs.io/
text-mining corpus malaysia bahasa-melayu manglish malay-dataset
Jupyter Notebook
•
Apache License 2.0
•110•322•6•2•Updated Aug 6, 2025Aug 6, 2025
nous-chat-widget
Public
Currently this chat widget optimized for https://nous.my, but to change to use your own should be super easy to do it.
Vue
•3•8•0•0•Updated Jul 24, 2025Jul 24, 2025
WavTokenizer-package
Public
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
Python
•
MIT License
•100•0•0•0•Updated Jun 23, 2025Jun 23, 2025
UniCodec-fix
Public
[ACL 2025 Main] UniCodec: a unified audio codec with a single codebook to support multi-domain audio data, including speech, music, and sound
Python
•7•4•0•0•Updated Jun 23, 2025Jun 23, 2025
vllm-llmaudio
Public
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
•
Apache License 2.0
•9.5k•0•0•0•Updated Jun 20, 2025Jun 20, 2025
dia-fix-compile
Public
A TTS model capable of generating ultra-realistic dialogue in one pass.
Python
•
Apache License 2.0
•1.5k•1•0•0•Updated May 29, 2025May 29, 2025
trl-fix
Public
Train transformer language models with reinforcement learning.
Python
•
Apache License 2.0
•2.1k•0•0•0•Updated May 26, 2025May 26, 2025
Emilia
Public
Fork open-mmlab/Amphion Emilia
Python
•0•0•0•0•Updated May 25, 2025May 25, 2025
Chunk-loss-LoRA
Public
Fused kernel chunk loss to include LoRA to reduce memory, support DeepSpeed ZeRO3.
Python
•1•1•2•0•Updated Apr 23, 2025Apr 23, 2025
csm
Public
A Conversational Speech Generation Model
Python
•
Apache License 2.0
•1.4k•0•0•0•Updated Mar 27, 2025Mar 27, 2025
initial-paged-flash-attention
Public
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
•
Apache License 2.0
•30k•1•0•0•Updated Mar 15, 2025Mar 15, 2025
accelerate-torch-compile-speechlm
Public
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Python
•
Apache License 2.0
•1.2k•0•0•0•Updated Feb 26, 2025Feb 26, 2025
ml-cross-entropy-lora-lm-head
Public
CCE for LoRA LM Head
Python
•
Other
•43•0•0•0•Updated Feb 5, 2025Feb 5, 2025
MeloTTS-MS
Public
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese, Korean and Malay.
Python
•
MIT License
•925•3•0•0•Updated Feb 5, 2025Feb 5, 2025
StyleTTS2-MS
Public
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Python
•
MIT License
•604•3•0•0•Updated Feb 5, 2025Feb 5, 2025
async-parler-tts
Public
Inference and training library for high-quality TTS models.
Python
•
Apache License 2.0
•570•2•0•0•Updated Feb 3, 2025Feb 3, 2025
memory-efficient-grpo
Public
Train transformer language models with reinforcement learning.
Python
•
Apache License 2.0
•2.1k•5•0•0•Updated Feb 2, 2025Feb 2, 2025
AuxiliaryASR-Phonemizer
Public
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
Python
•
MIT License
•44•1•0•0•Updated Jan 25, 2025Jan 25, 2025
PL-BERT-MS
Public
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
Python
•
MIT License
•54•0•0•0•Updated Jan 23, 2025Jan 23, 2025
cookbook
Public
cookbooks 📖 for Mesolitica products!
Jupyter Notebook
•
MIT License
•1•9•0•0•Updated Jan 20, 2025Jan 20, 2025
F5-TTS
Public
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
Python
•
MIT License
•1.9k•1•0•0•Updated Jan 17, 2025Jan 17, 2025
qwen2audio-multipack
Public
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
•
Apache License 2.0
•30k•1•0•0•Updated Jan 14, 2025Jan 14, 2025
llama-flex-attention-multipack
Public
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
•
Apache License 2.0
•30k•0•0•0•Updated Dec 13, 2024Dec 13, 2024
vocos
Public
Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis
Python
•
MIT License
•114•0•0•0•Updated Dec 13, 2024Dec 13, 2024
ml-cross-entropy-whisper
Public
CCE for Whisper
Python
•
Other
•43•3•0•0•Updated Dec 11, 2024Dec 11, 2024
t5-sdpa-multipack
Public
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python
•
Apache License 2.0
•30k•2•0•0•Updated Nov 25, 2024Nov 25, 2024
fish-speech
Public
Brand new TTS solution
Python
•
Other
•1.9k•0•0•0•Updated Nov 18, 2024Nov 18, 2024