Paper
22 December 2022 Improved point-voxel region convolutional neural network for small object detection
Author Affiliations +
Proceedings Volume 12508, International Symposium on Artificial Intelligence and Robotics 2022; 125080S (2022) https://doi.org/10.1117/12.2657106
Event: Seventh International Symposium on Artificial Intelligence and Robotics 2022, 2022, Shanghai, China
Abstract
With the widespread use of LiDAR sensors, 3D object detection through 3D point cloud data processing has become a research target in robotics and autonomous driving. However, the disorder and sparsity of point cloud data are the problems in traditional point cloud data processing. It is challenging to detect objects using a large amount of point cloud data. Conventional 3D object detectors have mainly grid-based methods and point-based methods. PV-RCNN proposed a framework that combines voxel-based and point-based techniques, and object features are extracted using 3D voxel CNNs. However, the resolution reduction caused by the CNN affects the localization of objects. This study aims to improve the detection accuracy of more minor things by feeding not only a single output of the voxel CNN but also multiple outputs, including high-resolution outputs, to the RPN. We came out with a new network that introduces the Multi-Scale Region Proposal Network to reduce the effect of resolution degradation. Our network has better recognition accuracy for small objects like bicycles than the original PV-RCNN. In extensive experiments, we demonstrate that our model has a 5% improvement for small things, such as cyclists training on the KITTI dataset.
© (2022) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Zherui Xie, Masaaki Tsuzaki, Huimin Lu, and Seiichi Serikawa "Improved point-voxel region convolutional neural network for small object detection", Proc. SPIE 12508, International Symposium on Artificial Intelligence and Robotics 2022, 125080S (22 December 2022); https://doi.org/10.1117/12.2657106
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Voxels

Point clouds

Object detection

Machine learning

Feature extraction

Education and training

3D acquisition

Back to Top