CVPR2025

OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints

Mingjie Pan, Jiyao Zhang, Tianshu Wu, Yinghao Zhao, Wenlong Gao, Hao Dong

Abstract

3 AgiBot https://omnimanip.github.io "Press button with hammer" "Press green button" "Insert flower into vase" "Insert pen into holder" "Close drawer" "Recycle battery" "Pour tea into the cup" "Pick up cup on dish" "Open drawer" "Put the lid on the teapot" "Close lid of laptop" "Open bottle" Passive Active Pour tea into the cup. Figure 1 . We proposed OmniManip, an open-vocabulary manipulation method that bridges the gap between the high-level reasoning of vision-language models (VLM) and the low-level precision, featuring closed-loop capabilities in both planning and execution.