Paper
22 March 2019 Image recognition using multi-layer sparse feature extraction with ADMM
Tomoya Hirakawa, Kuntopng Wararatpanya, Yoshimistu Kuroki
Author Affiliations +
Proceedings Volume 11049, International Workshop on Advanced Image Technology (IWAIT) 2019; 110493W (2019) https://doi.org/10.1117/12.2521348
Event: 2019 Joint International Workshop on Advanced Image Technology (IWAIT) and International Forum on Medical Imaging in Asia (IFMIA), 2019, Singapore, Singapore
Abstract
Being motivated by the Saak (Subspace approximation with augmented kernels) transform, we propose an image recognition scheme using multi-layer sparse feature extraction with a convex solver ADMM (Alternating Direction Method of Multipliers). The Saak transform consists of a multi-layer PCA (Principal Component Analysis) and S/P (Sign-to-Position) conversion to avoid sign confusion. This paper adopts sparse representation instead of PCA and also compares the S/P conversion with the activation function ReLU (Rectified Linear Unit), which is realized by involving the projection mapping onto the non-negative set in convex formulas. The Saak transform uses PCA not only for feature extraction but also for dimension compression of feature vectors. We expect that our method does not need the dimension compression since sparse representation compresses features more than PCA. Experimental results on the MNIST and Fashion-MNIST dataset show that the proposed method is equivalent to the Saak transform in recognition accuracy, and that our method can make features more sparse and extract features that have high discriminant power locally.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Tomoya Hirakawa, Kuntopng Wararatpanya, and Yoshimistu Kuroki "Image recognition using multi-layer sparse feature extraction with ADMM", Proc. SPIE 11049, International Workshop on Advanced Image Technology (IWAIT) 2019, 110493W (22 March 2019); https://doi.org/10.1117/12.2521348
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Feature extraction

Principal component analysis

Optical spheres

Computer programming

Computer programming languages

Convolutional neural networks

Image classification

Back to Top