CVPR2023

Few-Shot Referring Relationships in Videos

Yogesh Kumar, Anand Mishra

摘要

github.io/projects/refRelations/ <helicopter, fly above, train> <plane, fly above, truck> Support Set video-1 video-2 video-3 video-4 Test video Output fall off Frequency Predicate Accuracy (a) Problem Setup (c) Accuracy Distribution <bird, fly above, person> <plane, fly above, plane> <plane, fly above, person> (b) Predicate Distribution ride a belief propagation-based message passing on the random field to obtain the spatiotemporal localization or subject and object trajectories. We perform extensive experiments using two public benchmarks, namely ImageNet-VidVRD and VidOR, and compare the proposed approach with competitive baselines to assess its efficacy.