Spaces:

JanBabela
/

MusicLDM2Melodiff

Running

App Files Files Community

MusicLDM2Melodiff / index.html

JanBabela

Update index.html

105058f verified 24 days ago

raw

history blame contribute delete

4.1 kB

	<!doctype html>
	<html>
	<head>
	<meta charset="utf-8" />
	<meta name="viewport" content="width=device-width" />
	<title>Melodiff MusicLDM v2</title>
	<link rel="stylesheet" href="style.css" />
	</head>
	<body>
	<div class="card">
	<h1>Melodiff MusicLDM v2</h1>
	<p>This is next version after <a href="https://huggingface.co/spaces/JanBabela/Riffusion-Melodiff-v1" target="_blank">Melodiff Riffusion v1</a> </p>
	<p>Melodiff MusicLDM continues to explore the idea of using the audio to audio pipeline of Stable Difussion audio models for creating cover versions of songs.</p>
	<p><br>Melodiff MusicLDM uses <a href="https://huggingface.co/ucsd-reach/musicldm" target="_blank">MusicLDM model</a> as base model for audio generation.</p>
	<p>What was done and what is presented here: Deconstructing the base pipeline and reconstructing back for audio to audio modifications.</p>
	<p>No new model training, nor finetuning was done, only modifications to base pipeline.</p>
	<p><br>MusicLDM generates audio of better quality compared to Riffusion (first) model. It generates samples of length 10s compared to 5s samples of previous model.</p>
	<p>Also speed of generation improved: previously it took about 8s to generate 5s long sample of mono audio. Now it takes about 8s to generate 10s long sample of stereo audio.</p>
	<p>Also consistency. Previosly only about 30% of modified samples were good (or ok) and some prompt and seed play was needed to find good sound quality.</p>
	<p>Now about 70% of modified samples are good (or ok).</p>
	<p>Again longer modifications are possible by splitting, modifying and concatenating back the samples.</p>
	<p>Underlying MusicLDM model is two years old. It would be interesting to try new models, which have notably better quality.</p>
	<p><br> Examples of music generated by modifying the underlying song: <br></p>
	<p>
	Bella Ciao, originally played by saxophone, modified to be played by electric guitar
	<audio controls>
	<source src="BellaElGuitar.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>
	</p>
	<p>
	Bella Ciao, originally played by violin, modified to be played by piano
	<audio controls>
	<source src="BellaPiano.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>
	</p>
	<p>
	Iko iko, originally played by saxophone, modified to be played by violin
	<audio controls>
	<source src="IkoViolin.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>
	</p>
	<p>
	When the Saints, originally played by saxophone, modified to be played by strings
	<audio controls>
	<source src="SaintsStrings.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>
	</p>
	<p><br> Examples of original with modified samples: <br></p>
	<p>
	Saxophone solo, original
	<audio controls>
	<source src="MindscapeResampled.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>
	</p>
	<p>
	Modified to be played by violin
	<audio controls>
	<source src="MindScapeViolin.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>
	</p>
	<p>
	Modified to be played by electric guitar
	<audio controls>
	<source src="MindScapeElguitar.wav" type="audio/wav">
	Your browser does not support the audio element.
	</audio>
	</p>
	</div>
	</body>
	</html>