Background: While detecting the object directly from the 3D point cloud, the natural 3D patterns and invariance of 3D data are often obscure.
Objective: In this work, we aimed at studying the 3D object detection from discrete, disordered and sparse 3D point clouds.
Methods: The CNN comprises the frustum sequence module, 3D instance segmentation module SNET, 3D point cloud transformation module T-NET, and 3D boundary box estimation module ENET. The search space of the object is determined by the frustum sequence module. The instance segmentation of the point cloud is performed by the 3D instance segmentation module. The 3D coordinates of the object are confirmed by the transformation module and the 3D bounding box estimation module.
Results: Evaluated on KITTI benchmark dataset, our method outperforms state of the art by remarkable margins while having real-time capability.
Conclusion: We achieve real-time 3D object detection by proposing an improved Convolutional Neural Network (CNN) based on image-driven point clouds.
Keywords: Convolutional neural network, 3D object detection, convolutional neural network, image-driven, autonomous driving, feature fusion.