Paper
31 July 2019 MBNet: multi-scale bilinear convolutional neural networks for fine-grained visual classification towards real-time tasks
Tingqiang Deng, Rui Li, Chunguo Li, Rutian Liao, Yang Liu, Zhe Yang, Luxi Yang
Author Affiliations +
Proceedings Volume 11198, Fourth International Workshop on Pattern Recognition; 1119806 (2019) https://doi.org/10.1117/12.2540365
Event: Fourth International Workshop on Pattern Recognition, 2019, Nanjing, China
Abstract
Fine-grained visual classification (FGVC) is difficult due to the under-utilization of low-level features. This paper proposes a real-time method MBNet based on multi-stream multi-scale cross bilinear CNN that contributes to solving the problem. First, each layer of the multi-stream CNN is extracted by basic network such as VGGNet and others, followed by calculating multi-stream cross bilinear vector and bottom bilinear vector of low and high level features respectively. The FGVC results are predicted after feature fusion, which solves the problem that small and low-level details in the original image are easily overlooked. In the widely used datasets Caltech-UCSD Birds, Stanford Cars and Aircraft, the proposed method shows that the accuracy is significantly improved compared to the existing methods, reaching to state of the art level of 88.51%, 94.73% and 92.41%. It also meets the requirements of real-time tasks.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Tingqiang Deng, Rui Li, Chunguo Li, Rutian Liao, Yang Liu, Zhe Yang, and Luxi Yang "MBNet: multi-scale bilinear convolutional neural networks for fine-grained visual classification towards real-time tasks", Proc. SPIE 11198, Fourth International Workshop on Pattern Recognition, 1119806 (31 July 2019); https://doi.org/10.1117/12.2540365
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Visualization

Data modeling

Convolutional neural networks

Computer vision technology

Feature extraction

Lithium

Detection and tracking algorithms

RELATED CONTENT

Model recommendation for pedestrian detection
Proceedings of SPIE (August 29 2016)
Suggestive modeling for machine vision
Proceedings of SPIE (November 01 1992)
Object Recognition by a Hopfield Neural Network
Proceedings of SPIE (March 01 1990)
Relevance feedback-based building recognition
Proceedings of SPIE (July 14 2010)

Back to Top