CVPR2023
Few-Shot Referring Relationships in Videos
Yogesh Kumar, Anand Mishra
摘要
github.io/projects/refRelations/ <helicopter, fly above, train> <plane, fly above, truck> Support Set video-1 video-2 video-3 video-4 Test video Output fall off Frequency Predicate Accuracy (a) Problem Setup (c) Accuracy Distribution <bird, fly above, person> <plane, fly above, plane> <plane, fly above, person> (b) Predicate Distribution ride a belief propagation-based message passing on the random field to obtain the spatiotemporal localization or subject and object trajectories. We perform extensive experiments using two public benchmarks, namely ImageNet-VidVRD and VidOR, and compare the proposed approach with competitive baselines to assess its efficacy.