Paper
27 November 2019 Video action recognition based on improved 3D convolutional network and sparse representation classification
Author Affiliations +
Proceedings Volume 11321, 2019 International Conference on Image and Video Processing, and Artificial Intelligence; 1132115 (2019) https://doi.org/10.1117/12.2542195
Event: The Second International Conference on Image, Video Processing and Artifical Intelligence, 2019, Shanghai, China
Abstract
In view of the problem that the typical convolutional neural networks fail to model actions at their full temporal extent, a novel video action recognition algorithm, which is based on improved 3D Convolutional Network (iC3D) architecture with K-means keyframes extraction and sparse representation classification (SRC), is proposed in this study. During the feature extraction process, the K-means keyframes extraction is constrained to reduce redundant information generated by continuous video frames and increase the temporal acceptance region. Meanwhile, to improve the noise immunity, sparse coding and its reconstruction errors are used for classification. The proposed method has 96.5% recognition accuracy on the typical video action classification dataset UCF101 that outperforms other competing methods. In addition, we built a wild test dataset to verify the generalization performance of the proposed model.
© (2019) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Wang Liu, Qi Fu, Yuqiu Lu, Jinyu Sun, and Shiwei Ma "Video action recognition based on improved 3D convolutional network and sparse representation classification", Proc. SPIE 11321, 2019 International Conference on Image and Video Processing, and Artificial Intelligence, 1132115 (27 November 2019); https://doi.org/10.1117/12.2542195
Lens.org Logo
CITATIONS
Cited by 1 scholarly publication.
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Video

RGB color model

Feature extraction

Detection and tracking algorithms

3D modeling

Convolution

Data modeling

RELATED CONTENT


Back to Top