Ahmad, Abdul Manan and Goh, Kia Eng and Mohamed Shaharoun, Awaluddin and Tan, Chiu Yeek and Jarni, Muhamad Hafiz (2004) An isolated speech endpoint detector using multiple speech features. In: Tencon 2004 - 2004 IEEE Region 10 Conference, Vols A-D, Proceedings - Analog And Digital Techniques in Electrical Engineering. TENCON-IEEE Region 10 Conference Proceedings, B . IEEE, USA, pp. 403-406. ISBN 0-7803-8560-8
|
PDF
1MB |
Abstract
Energy and zero crossing rate of the speech signal have been the two most widely used features for detecting the endpoints of an utterance. This paper proposed a new approach for locating the endpoint for isolated speech, which significantly improve the endpoint detector performance. The proposed algorithm relies on multiple speech features: root mean square energy (rmse), zero crossing rate (zcr) and cepstral coefficient (cepstrum) where the Euclidean distance measure is adopted to accurately detect the endpoint of an isolated utterance. This algorithm offers better performance than conventional algorithm which using energy only. The vocabulary for the experiment includes English digit from 1 to 9. These experimental results were conducted by 360 utterances from a male speaker. Experimental results show that the accuracy of the algorithm is quite acceptable.
Item Type: | Book Section |
---|---|
Additional Information: | ISBN: 0-7803-8560-8 IEEE Region 10 Conference on Analog and Digital Techniques in Electrical Engineering, 21-24 Nov. 2004, Chiang Mai, THAILAND. |
Uncontrolled Keywords: | algorithms, performance, speech, speech processing, euclidance distance, speech endpoint detector, speech features, vocabulary, speech recognition |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Computer Science and Information System |
ID Code: | 9804 |
Deposited By: | Zalinda Shuratman |
Deposited On: | 25 Mar 2010 02:45 |
Last Modified: | 07 Mar 2011 07:28 |
Repository Staff Only: item control page