ICLR2025
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control
Hejia Chen, Haoxian Zhang, Shoulong Zhang, Xiaoqiang Liu, Sisi Zhuang, Yuan Zhang, Pengfei Wan, Di Zhang, Shuai Li
摘要
3 Zhongguancun Laboratory ♣ Equal contribution ♠ Intern at Kuaishou Technology ♡ Corresponding author "Angry" & "A senior actor" "Laugh" … how to gh! <mute> Ha! Ha! Something like this? laucoarse ctrl speech render video fine ctrl language portrait label Figure 1: Adding multimodal coarse-and fine-grained control enables more flexible animations: Scenario: A senior actor is arguing with the director about how to smile. Action: The actor responds with anger and concludes with a sudden sarcastic laugh.