We generate a soundtrack for a silent video, given a text prompt! For example, we can make a cat's meow sound like a lion's roar or a typewriter sound like a piano.

Paper: https://arxiv.org/abs/2411.17698
Webpage: https://ificl.github.io/MultiFoley/

Led by @czyang.bsky.social!

https://bsky.app/profile/czyang.bsky.social/post/3lbvklevtbk27

Comments