Nai, Ruihua and Hassan, Hanita (2023) Application of multimodal speech recognition based on deep neural networks in interpretation teaching. In: 3rd International Conference on Artificial Intelligence, Virtual Reality, and Visualization, AIVRV 2023, 7 July 2023 - 9 July 2023, Chongqing, China.
Full text not available from this repository.
Official URL: http://dx.doi.org/10.1117/12.3011751
Abstract
In recent years, although speech recognition technology has been widely used, it also faces some problems. This paper studies multimodal speech recognition in interpreting based on deep neural network. Firstly, the deep learning method and its related theoretical basis are introduced. Then, the advantages of speech corpus denoising based on acoustic expert feature extraction and training algorithm, convolution decomposition method and interpretation element analysis are described. Finally, through the experimental verification, it is proved that the recognition system can effectively improve students’ interpretation efficiency and accuracy, and the accuracy rate is more than 93%.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | deep neural networks, interpretation teaching, multimodal speech recognition |
Subjects: | L Education > L Education (General) P Language and Literature > P Philology. Linguistics |
Divisions: | Language Academy |
ID Code: | 108403 |
Deposited By: | Yanti Mohd Shah |
Deposited On: | 28 Oct 2024 10:05 |
Last Modified: | 28 Oct 2024 10:05 |
Repository Staff Only: item control page