CVPR2023

Local 3D Editing via 3D Distillation of CLIP Knowledge

Junha Hyung, Sungwon Hwang, Daejin Kim, Hyunji Lee, Jaegul Choo

摘要

Figure 1 . Our local editing NeRF (LENeRF) enables users to edit specific areas of 3D assets based on textual prompts by estimating a 3D mask for tri-plane features. For instance, given an original 3D radiance field (a), users can define their desired area to edit (underlined text prompt, e.g., "eyes"). LENeRF then generates a 3D mask, which is employed for feature fusion, allowing for targeted modifications that adhere to the editing prompt (e.g., "blue eyes") (b). Additionally, as illustrated in (c), the 3D mask itself can be rendered and visualized for further analysis.