Universiti Teknologi Malaysia Institutional Repository

Application of multimodal speech recognition based on deep neural networks in interpretation teaching

Nai, Ruihua and Hassan, Hanita (2023) Application of multimodal speech recognition based on deep neural networks in interpretation teaching. In: 3rd International Conference on Artificial Intelligence, Virtual Reality, and Visualization, AIVRV 2023, 7 July 2023 - 9 July 2023, Chongqing, China.

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1117/12.3011751

Abstract

In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the advantages of speech corpus denoising based on acoustic expert feature extraction and training algorithm, convolution decomposition method and interpretation element analysis are described. Finally, through the experimental verification, it is proved that the recognition system can effectively improve students’ interpretation efficiency and accuracy, and the accuracy rate is more than 93%.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:deep neural networks, interpretation teaching, multimodal speech recognition
Subjects:L Education > L Education (General)
P Language and Literature > P Philology. Linguistics
Divisions:Language Academy
ID Code:108403
Deposited By: Yanti Mohd Shah
Deposited On:28 Oct 2024 10:05
Last Modified:28 Oct 2024 10:05

Repository Staff Only: item control page