Multi-channel and multi-scale mid-level image representation for scene classification

Jinfu Yang; Fei Yang; Guanghui Wang; Mingai Li

doi:10.1117/1.JEI.26.2.023018

11 April 2017 Multi-channel and multi-scale mid-level image representation for scene classification

Jinfu Yang, Fei Yang, Guanghui Wang, Mingai Li

Author Affiliations +

Journal of Electronic Imaging, Vol. 26, Issue 2, 023018 (April 2017). https://doi.org/10.1117/1.JEI.26.2.023018

Abstract

Convolutional neural network (CNN)-based approaches have received state-of-the-art results in scene classification. Features from the output of fully connected (FC) layers express one-dimensional semantic information but lose the detailed information of objects and the spatial information of scene categories. On the contrary, deep convolutional features have been proved to be more suitable for describing an object itself and the spatial relations among objects in an image. In addition, the feature map from each layer is max-pooled within local neighborhoods, which weakens the invariance of global consistency and is unfavorable to scenes with highly complicated variation. To cope with the above issues, an orderless multi-channel mid-level image representation on pre-trained CNN features is proposed to improve the classification performance. The mid-level image representation of two channels from the FC layer and the deep convolutional layer are integrated at multi-scale levels. A sum pooling approach is also employed to aggregate multi-scale mid-level image representation to highlight the importance of the descriptors beneficial for scene classification. Extensive experiments on SUN397 and MIT 67 indoor datasets demonstrate that the proposed method achieves promising classification performance.

Citation Download Citation

Jinfu Yang, Fei Yang, Guanghui Wang, and Mingai Li "Multi-channel and multi-scale mid-level image representation for scene classification," Journal of Electronic Imaging 26(2), 023018 (11 April 2017). https://doi.org/10.1117/1.JEI.26.2.023018

Received: 27 September 2016; Accepted: 20 March 2017; Published: 11 April 2017

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available