AMAAI Lab

All

51 repositories

t2m-inferalign
Public
Improving Symbolic Music Generation with Inference-Time Alignment
midi genai text2midi inference-alignment
Python
•
MIT License
•0•15•1•0•Updated Aug 2, 2025Aug 2, 2025
PreBit
Public
This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin"
Jupyter Notebook
•4•9•0•0•Updated Jul 29, 2025Jul 29, 2025
SonicVerse
Public
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
Python
•
MIT License
•2•42•2•0•Updated Jul 28, 2025Jul 28, 2025
Music2Emotion
Public
Music2Emo: Towards Unified Music Emotion Recognition across Dimensional and Categorical Models
Python
•
MIT License
•6•25•0•0•Updated Jul 6, 2025Jul 6, 2025
to-embody-or-not
Public
Repo for paper: To Embody or Not: The Effect Of Embodiment On User Perception Of LLM-based Conversational Agents
avatar ai embodied llm embodiedai
Python
•
Apache License 2.0
•1•1•0•0•Updated Jun 4, 2025Jun 4, 2025
mustango
Public
Mustango: Toward Controllable Text-to-Music Generation
diffusion-models text-to-audio text-to-music large-language-models
Python
•
MIT License
•30•372•8•0•Updated Jun 2, 2025Jun 2, 2025
MelodySim
Public
MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection
similarity melody representation-learning plagiarism
Python
•
MIT License
•0•7•0•0•Updated May 29, 2025May 29, 2025
JamendoMaxCaps
Public
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
music captions dataset
Python
•
MIT License
•0•38•0•0•Updated May 24, 2025May 24, 2025
awesome-MER
Public
A curated list of Datasets, Models and Papers for Music Emotion Recognition (MER)
3•38•0•0•Updated Apr 27, 2025Apr 27, 2025
DART
Public
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
Python
•2•11•2•0•Updated Apr 15, 2025Apr 15, 2025
Text2midi
Public
Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2midi allows users to create symbolic music that aligns with detailed textual prompts, including musical attributes like chords, tempo, and style.
music ai midi
Python
•
MIT License
•12•102•2•0•Updated Feb 28, 2025Feb 28, 2025
mirflex
Public
Music Information Retrieval Feature Library for Extraction
Python
•
MIT License
•8•20•0•0•Updated Nov 14, 2024Nov 14, 2024
megamusicaps
Public
Python
•
MIT License
•0•11•0•1•Updated Nov 14, 2024Nov 14, 2024
cross-dataset-emotion-alignment
Public
code for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
Python
•
MIT License
•0•8•1•0•Updated Oct 16, 2024Oct 16, 2024
Audio-Music-AI-Research-Resources
Public
0•1•0•0•Updated Sep 3, 2024Sep 3, 2024
survey-music-nlp
Public
Repository for "Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval Systems: a Survey"
1•1•0•0•Updated Aug 20, 2024Aug 20, 2024
IAMM
Public
An exploration of how generative text-to-music AI models can be used for emotion guidance
0•1•0•0•Updated Jul 31, 2024Jul 31, 2024
Video2Music
Public
Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
ai deep-learning music-generation affective-computing multimodal
Python
•
MIT License
•23•180•1•0•Updated Jul 30, 2024Jul 30, 2024
MidiCaps
Public
A large-scale dataset of caption-annotated MIDI files.
Python
•
MIT License
•3•70•1•0•Updated Jul 23, 2024Jul 23, 2024
DisfluencySpeech
Public
Resources for DisfluencySpeech
MIT License
•0•8•0•0•Updated Jul 15, 2024Jul 15, 2024
midi-miner
Public
Python MIDI track classifier and tonal tension calculation based on spiral array theory
Python
•
GNU General Public License v3.0
•23•0•0•0•Updated Jun 18, 2024Jun 18, 2024
Accented-TTS-MLVAE-ADV
Public
Python
•0•6•0•0•Updated Jun 5, 2024Jun 5, 2024
CVAE-Tacotron
Public
Conditional VAE for Accented Speech Generation
HTML
•7•1•0•0•Updated Jun 4, 2024Jun 4, 2024
CM-HRNN
Public
Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure
Python
•2•1•0•0•Updated May 31, 2024May 31, 2024
emotionweb
Public
Website emotion guidance
JavaScript
•0•1•0•0•Updated Mar 14, 2024Mar 14, 2024
genmusic_demo_list
Public
a list of demo websites for automatic music generation research
52•1•0•0•Updated Nov 15, 2023Nov 15, 2023
ai-audio-datasets-list
Public
This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
MIT License
•77•3•0•0•Updated Oct 31, 2023Oct 31, 2023
kylo-ren-app
Public
Web interface for AI music generation models
JavaScript
•2•1•0•0•Updated Oct 19, 2023Oct 19, 2023
singapore-music-classifier
Public
Code for paper A dataset and classification model for Malay, Hindi, Tamil and Chinese music
music classification singapore ismir
Jupyter Notebook
•0•1•0•0•Updated Oct 19, 2023Oct 19, 2023
FundamentalMusicEmbedding
Public
Fundamental Music Embedding, FME
Python
•10•0•0•0•Updated Oct 16, 2023Oct 16, 2023