Paper
1 August 2023 Object detection algorithm based on mixed dilated convolution pyramid
Tingjian Yu, Zemin Yuan, Tao Huang, Xiang Fu
Author Affiliations +
Proceedings Volume 12754, Third International Conference on Computer Vision and Pattern Analysis (ICCPA 2023); 127542F (2023) https://doi.org/10.1117/12.2684527
Event: 2023 3rd International Conference on Computer Vision and Pattern Analysis (ICCPA 2023), 2023, Hangzhou, China
Abstract
To address the problem that FPN in the one-stage object detection algorithm loses a large amount of semantic information after upsampling and downsampling fusion, leading to difficulties in detecting small objects in dense scenes. This paper proposes a new FPN structure, Residual dilated FPN (RdFPN), compared to the traditional FPN, which uses multiple Dilated Block instead of the standard convolution in the FPN horizontal structure to mix the receptive fields of different layers in the feature map and expand the object size range of the feature map, while improving the network's ability to extract features at different scales. Experimental results show that RdFPN improves the mAP of the one-stage object detection algorithms RetinaNet, FCOS, and GFL by 0.9%, 0.7%, and 0.8% on the MS COCO dataset, enough to improve the overall performance of the detector.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Tingjian Yu, Zemin Yuan, Tao Huang, and Xiang Fu "Object detection algorithm based on mixed dilated convolution pyramid", Proc. SPIE 12754, Third International Conference on Computer Vision and Pattern Analysis (ICCPA 2023), 127542F (1 August 2023); https://doi.org/10.1117/12.2684527
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Object detection

Convolution

Detection and tracking algorithms

Feature fusion

Image fusion

Deep learning

Image processing

Back to Top