CVPR2020

What You See is What You Get: Exploiting Visibility for 3D Object Detection

Peiyun Hu, Jason Ziglar, David Held, Deva Ramanan

摘要

What is a good representation for 3D sensor data? We visualize a bird's-eye-view LiDAR scene and highlight two regions that may contain an object. Many contemporary deep networks process 3D point clouds, making it hard to distinguish the two regions (left). But depth sensors provide more than 3D points -they provide estimates of freespace in between the sensor and the measured 3D point. We visualize freespace by raycasting (right), where green is free and white is unknown. In this paper, we introduce deep 3D networks that leverage freespace to significantly improve 3D object detection accuracy.