Skip to content
Change the repository type filter

All

    Repositories list

    • Improving Symbolic Music Generation with Inference-Time Alignment
      Python
      01510Updated Aug 2, 2025Aug 2, 2025
    • PreBit

      Public
      This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bitcoin"
      Jupyter Notebook
      4900Updated Jul 29, 2025Jul 29, 2025
    • SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
      Python
      24220Updated Jul 28, 2025Jul 28, 2025
    • Music2Emo: Towards Unified Music Emotion Recognition across Dimensional and Categorical Models
      Python
      62500Updated Jul 6, 2025Jul 6, 2025
    • Repo for paper: To Embody or Not: The Effect Of Embodiment On User Perception Of LLM-based Conversational Agents
      Python
      1100Updated Jun 4, 2025Jun 4, 2025
    • mustango

      Public
      Mustango: Toward Controllable Text-to-Music Generation
      Python
      3037280Updated Jun 2, 2025Jun 2, 2025
    • MelodySim

      Public
      MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection
      Python
      0700Updated May 29, 2025May 29, 2025
    • JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
      Python
      03800Updated May 24, 2025May 24, 2025
    • A curated list of Datasets, Models and Papers for Music Emotion Recognition (MER)
      33800Updated Apr 27, 2025Apr 27, 2025
    • DART

      Public
      Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
      Python
      21120Updated Apr 15, 2025Apr 15, 2025
    • Text2midi

      Public
      Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language models and a powerful autoregressive transformer decoder, text2midi allows users to create symbolic music that aligns with detailed textual prompts, including musical attributes like chords, tempo, and style.
      Python
      1210220Updated Feb 28, 2025Feb 28, 2025
    • mirflex

      Public
      Music Information Retrieval Feature Library for Extraction
      Python
      82000Updated Nov 14, 2024Nov 14, 2024
    • Python
      01101Updated Nov 14, 2024Nov 14, 2024
    • code for Leveraging LLM Embeddings for Cross Dataset Label Alignment and Zero Shot Music Emotion Prediction
      Python
      0810Updated Oct 16, 2024Oct 16, 2024
    • 0100Updated Sep 3, 2024Sep 3, 2024
    • Repository for "Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval Systems: a Survey"
      1100Updated Aug 20, 2024Aug 20, 2024
    • IAMM

      Public
      An exploration of how generative text-to-music AI models can be used for emotion guidance
      0100Updated Jul 31, 2024Jul 31, 2024
    • Video2Music: Suitable Music Generation from Videos using an Affective Multimodal Transformer model
      Python
      2318010Updated Jul 30, 2024Jul 30, 2024
    • MidiCaps

      Public
      A large-scale dataset of caption-annotated MIDI files.
      Python
      37010Updated Jul 23, 2024Jul 23, 2024
    • Resources for DisfluencySpeech
      0800Updated Jul 15, 2024Jul 15, 2024
    • Python MIDI track classifier and tonal tension calculation based on spiral array theory
      Python
      23000Updated Jun 18, 2024Jun 18, 2024
    • Python
      0600Updated Jun 5, 2024Jun 5, 2024
    • Conditional VAE for Accented Speech Generation
      HTML
      7100Updated Jun 4, 2024Jun 4, 2024
    • CM-HRNN

      Public
      Hierarchical Recurrent Neural Networks for Conditional Melody Generation with Long-term Structure
      Python
      2100Updated May 31, 2024May 31, 2024
    • Website emotion guidance
      JavaScript
      0100Updated Mar 14, 2024Mar 14, 2024
    • a list of demo websites for automatic music generation research
      52100Updated Nov 15, 2023Nov 15, 2023
    • This is a list of datasets consisting of speech, music, and sound effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications. It is mainly used for speech recognition, speech synthesis, singing voice synthesis, music information retrieval, music generation, etc.
      77300Updated Oct 31, 2023Oct 31, 2023
    • Web interface for AI music generation models
      JavaScript
      2100Updated Oct 19, 2023Oct 19, 2023
    • Code for paper A dataset and classification model for Malay, Hindi, Tamil and Chinese music
      Jupyter Notebook
      0100Updated Oct 19, 2023Oct 19, 2023
    • Fundamental Music Embedding, FME
      Python
      10000Updated Oct 16, 2023Oct 16, 2023