Improved point-voxel region convolutional neural network for small object detection

Zherui Xie; Masaaki Tsuzaki; Huimin Lu; Seiichi Serikawa

doi:10.1117/12.2657106

22 December 2022 Improved point-voxel region convolutional neural network for small object detection

Zherui Xie, Masaaki Tsuzaki, Huimin Lu, Seiichi Serikawa

Proceedings Volume 12508, International Symposium on Artificial Intelligence and Robotics 2022; 125080S (2022) https://doi.org/10.1117/12.2657106
Event: Seventh International Symposium on Artificial Intelligence and Robotics 2022, 2022, Shanghai, China

Abstract

With the widespread use of LiDAR sensors, 3D object detection through 3D point cloud data processing has become a research target in robotics and autonomous driving. However, the disorder and sparsity of point cloud data are the problems in traditional point cloud data processing. It is challenging to detect objects using a large amount of point cloud data. Conventional 3D object detectors have mainly grid-based methods and point-based methods. PV-RCNN proposed a framework that combines voxel-based and point-based techniques, and object features are extracted using 3D voxel CNNs. However, the resolution reduction caused by the CNN affects the localization of objects. This study aims to improve the detection accuracy of more minor things by feeding not only a single output of the voxel CNN but also multiple outputs, including high-resolution outputs, to the RPN. We came out with a new network that introduces the Multi-Scale Region Proposal Network to reduce the effect of resolution degradation. Our network has better recognition accuracy for small objects like bicycles than the original PV-RCNN. In extensive experiments, we demonstrate that our model has a 5% improvement for small things, such as cyclists training on the KITTI dataset.

Citation Download Citation

Zherui Xie, Masaaki Tsuzaki, Huimin Lu, and Seiichi Serikawa "Improved point-voxel region convolutional neural network for small object detection", Proc. SPIE 12508, International Symposium on Artificial Intelligence and Robotics 2022, 125080S (22 December 2022); https://doi.org/10.1117/12.2657106

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
16 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Voxels

Point clouds

Object detection

Machine learning

Feature extraction

Education and training

3D acquisition

Show All Keywords

Keywords/Phrases

Search In:

Publication Years