CVPR2025

Video-Guided Foley Sound Generation with Multimodal Controls

Ziyang Chen, Prem Seetharaman, Bryan C. Russell, Oriol Nieto, David Bourgin, Andrew Owens, Justin Salamon

摘要

2 Adobe Research https://ificl.github.io/MultiFoley/ (a) Foley with text control (b) Foley with audio control (c) Foley audio extension + "lion roaring" -"cat meowing"