A New Monocular 3D Object Detection with Neural Network

摘要：

　　Coherently recognizing objects and localizing its 3D position is vital in a large number of practical applications such as robot,autonomous unmanned systems,etc.To tackle this problem,we present a new framework composed of two coupled deep networks.The framework involves three tight steps: first,by implanting a pyramid strategy into the end-to-end detection network,the goal of detecting and recognizing objects with large or small scales is attained in 2D pictures; second,by a novel lost function we proposed for the depth networks,the 3D points of the objects are accurately estimated; and finally,we propose a windowed point cloud segmentation: combing 3D norm direction,curvature and color information,outlier 3D points are removed from windows,thus the clean object is extracted with its 3D position.The obvious advantage of the framework lies on that most the steps are implemented by the fused networks,and this makes the frame-work efficient,accurate and robust provided with enough training data.The framework is evaluated on the KITTI benchmark,and the efficiency is superior to the state-of-the-art methods while the accuracy is comparable.Our method is several tens of times faster than most of previous methods and capable of applications with real time constraint.In addition,we only use weak supervision learning methods in our framework that need less manual annotations.

关键词： 3D Object Detection Deep network Windowed point cloud segmentation

作者: Weijie Hong Yiguang Liu Yunan Zheng Ying Wang Xuelei Shi

作者单位: Vision and Image Processing Lab(VIPL),College of Computer Science,Sichuan University,Chengdu 610065,Peoples Republic of China

会议类型: 国际会议

会议名称: 中国模式识别与计算机视觉大会(PRCV2018)

会议地点: 广州

会议语种:英文

页码: 174-185

在线出版日期: 2018-11-23（万方平台首次上网日期，不代表论文的发表时间）

会议专题

A New Monocular 3D Object Detection with Neural Network