-
Notifications
You must be signed in to change notification settings - Fork 5
Open
Description
Hello,
Thank you very much for this great work. I have few questions about the paper/code.
1- Have you tried training with Wavcaps or a larger dataset? From the wavcaps paper, it seems that using more data significantly improved the results
2- Are the results reported in the paper use the checkpoint with the highest validation score?
3- From the ablations, it seems that MCM does not contribute much to the results (Cider drops by only 0.02 points), I am wondering if you have performed any ablation on the audiocaps dataset, especially with regard to the main components (MCM, and CLAP)
Thank you very much for your help.
Metadata
Metadata
Assignees
Labels
No labels