Paper
8 November 2023 Application of multimodal speech recognition based on deep neural networks in interpretation teaching
Ruihua Nai, Hanita Hassan
Author Affiliations +
Proceedings Volume 12923, Third International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2023); 129230N (2023) https://doi.org/10.1117/12.3011751
Event: 3rd International Conference on Artificial Intelligence, Virtual Reality and Visualization (AIVRV 2023), 2023, Chongqing, China
Abstract
In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the advantages of speech corpus denoising based on acoustic expert feature extraction and training algorithm, convolution decomposition method and interpretation element analysis are described. Finally, through the experimental verification, it is proved that the recognition system can effectively improve students’ interpretation efficiency and accuracy, and the accuracy rate is more than 93%.
(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.
Ruihua Nai and Hanita Hassan "Application of multimodal speech recognition based on deep neural networks in interpretation teaching", Proc. SPIE 12923, Third International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2023), 129230N (8 November 2023); https://doi.org/10.1117/12.3011751
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Speech recognition

Education and training

Neural networks

Artificial neural networks

Data modeling

Statistical modeling

Back to Top