new paper! 🗣️Sketch2Sound💥
Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals.
paper: https://arxiv.org/abs/2412.08550
web: https://hugofloresgarcia.art/sketch2sound
Sketch2Sound can create sounds from sonic imitations (i.e., a vocal imitation or a reference sound) via interpretable, time-varying control signals.
paper: https://arxiv.org/abs/2412.08550
web: https://hugofloresgarcia.art/sketch2sound
Comments
Sketch2Sound can be implemented on top of any text-to-audio DiT and requires 40k steps of fine-tuning and a single linear layer per control!
in collaboration w/
@urinieto.bsky.social, @justinsalamon.bsky.social, #bryanpardo and the legendary @pseeth.bsky.social!
2. your vocal imitations are everything ❤️