Application of multimodal speech recognition based on deep neural networks in interpretation teaching

Ruihua Nai; Hanita Hassan

doi:10.1117/12.3011751

8 November 2023 Application of multimodal speech recognition based on deep neural networks in interpretation teaching

Ruihua Nai, Hanita Hassan

Proceedings Volume 12923, Third International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2023); 129230N (2023) https://doi.org/10.1117/12.3011751
Event: 3rd International Conference on Artificial Intelligence, Virtual Reality and Visualization (AIVRV 2023), 2023, Chongqing, China

Abstract

In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the advantages of speech corpus denoising based on acoustic expert feature extraction and training algorithm, convolution decomposition method and interpretation element analysis are described. Finally, through the experimental verification, it is proved that the recognition system can effectively improve students’ interpretation efficiency and accuracy, and the accuracy rate is more than 93%.

(2023) Published by SPIE. Downloading of the abstract is permitted for personal use only.

Citation Download Citation

Ruihua Nai and Hanita Hassan "Application of multimodal speech recognition based on deep neural networks in interpretation teaching", Proc. SPIE 12923, Third International Conference on Artificial Intelligence, Virtual Reality, and Visualization (AIVRV 2023), 129230N (8 November 2023); https://doi.org/10.1117/12.3011751

ACCESS THE FULL ARTICLE

INSTITUTIONAL
Select your institution to access the SPIE Digital Library.

SELECT YOUR INSTITUTION

PERSONAL
Sign in with your SPIE account to access your personal subscriptions or to use specific features such as save to my library, sign up for alerts, save searches, etc.

PERSONAL SIGN IN

No SPIE Account? Create one

PURCHASE THIS CONTENT

SUBSCRIBE TO DIGITAL LIBRARY

50 downloads per 1-year subscription

Members: $195

Non-members: $335 ADD TO CART

25 downloads per 1 - year subscription

Members: $145

Non-members: $250 ADD TO CART

PURCHASE SINGLE ARTICLE

Includes PDF, HTML & Video, when available

Members: $17.00

Non-members: $21.00 ADD TO CART

PROCEEDINGS
7 PAGES

DOWNLOAD PAPER SAVE TO MY LIBRARY

GET CITATION

RIGHTS & PERMISSIONS

Get copyright permission Get copyright permission on Copyright Marketplace

KEYWORDS

Speech recognition

Education and training

Neural networks

Artificial neural networks

Data modeling

Statistical modeling

RELATED CONTENT

The performance and error analysis of LSTM model combined with...
Proceedings of SPIE (March 08 2023)

Research on image classification algorithm of environmental art design effect...
Proceedings of SPIE (August 10 2023)

Construction of electric emergency materials storage system based on BP...
Proceedings of SPIE (June 16 2023)

Research on hard points height prediction method based on BP...
Proceedings of SPIE (August 10 2023)

Short term wind power prediction based on optimized BP neural...
Proceedings of SPIE (September 25 2023)

Research on genetic algorithm back propagation neural network photovoltaic daily...
Proceedings of SPIE (December 07 2023)

A method of flavors and fragrances identification based on hybrid...
Proceedings of SPIE (April 08 2024)

Subscribe to Digital Library

Receive Erratum Email Alert

Keywords/Phrases

Search In:

Publication Years