CVPR2025

InsTaG: Learning Personalized 3D Talking Head from Few-Second Video

Jiahe Li, Jiawei Zhang, Xiao Bai, Jin Zheng, Jun Zhou, Lin Gu

摘要

Figure 1. With only 5-second video data, InsTaG outperforms the state-of-the-arts [28, 52, 53] by delivering high-quality personalized lip synchronization and realistic rendering with the fastest adaptation, meanwhile attaining low memory overhead and real-time inference.