Music understanding amaai-lab/SonicVerse Audio-Text-to-Text • Updated Jun 19 • 409 • 13 SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning Paper • 2506.15154 • Published Jun 18 • 8
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning Paper • 2506.15154 • Published Jun 18 • 8
Midi Tools and datasets for generative MIDI models. Incl. MidiCaps dataset and the Text2midi model. amaai-lab/MidiCaps Viewer • Updated Mar 15 • 168k • 2.66k • 42 MidiCaps -- A large-scale MIDI dataset with text captions Paper • 2406.02255 • Published Jun 4, 2024 • 1 amaai-lab/text2midi Updated Jan 10 • 14 Text2midi: Generating Symbolic Music from Captions Paper • 2412.16526 • Published Dec 21, 2024 • 3
MidiCaps -- A large-scale MIDI dataset with text captions Paper • 2406.02255 • Published Jun 4, 2024 • 1
Text-to-Music Mustango model, MusicBench Dataset,... amaai-lab/JamendoMaxCaps Viewer • Updated Jun 17 • 344k • 3.54k • 19 declare-lab/mustango Text-to-Audio • Updated Dec 17, 2023 • 177 • 40 declare-lab/mustango-pretrained Text-to-Audio • Updated Dec 17, 2023 • 7 • 11 amaai-lab/MusicBench Viewer • Updated Mar 20 • 53.6k • 3.26k • 51
Speech Tools, models and datasets related to generative speech models amaai-lab/DisfluencySpeech Viewer • Updated Jun 27, 2024 • 5k • 268 • 15
Music understanding amaai-lab/SonicVerse Audio-Text-to-Text • Updated Jun 19 • 409 • 13 SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning Paper • 2506.15154 • Published Jun 18 • 8
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning Paper • 2506.15154 • Published Jun 18 • 8
Text-to-Music Mustango model, MusicBench Dataset,... amaai-lab/JamendoMaxCaps Viewer • Updated Jun 17 • 344k • 3.54k • 19 declare-lab/mustango Text-to-Audio • Updated Dec 17, 2023 • 177 • 40 declare-lab/mustango-pretrained Text-to-Audio • Updated Dec 17, 2023 • 7 • 11 amaai-lab/MusicBench Viewer • Updated Mar 20 • 53.6k • 3.26k • 51
Midi Tools and datasets for generative MIDI models. Incl. MidiCaps dataset and the Text2midi model. amaai-lab/MidiCaps Viewer • Updated Mar 15 • 168k • 2.66k • 42 MidiCaps -- A large-scale MIDI dataset with text captions Paper • 2406.02255 • Published Jun 4, 2024 • 1 amaai-lab/text2midi Updated Jan 10 • 14 Text2midi: Generating Symbolic Music from Captions Paper • 2412.16526 • Published Dec 21, 2024 • 3
MidiCaps -- A large-scale MIDI dataset with text captions Paper • 2406.02255 • Published Jun 4, 2024 • 1
Speech Tools, models and datasets related to generative speech models amaai-lab/DisfluencySpeech Viewer • Updated Jun 27, 2024 • 5k • 268 • 15