Universiti Teknologi Malaysia Institutional Repository

A 3-level endpoint detection algorithm for isolated speech and frequency-based features

Goh, K. E. and Ahmad, A. M. (2004) A 3-level endpoint detection algorithm for isolated speech and frequency-based features. In: International conference on Control, Automation And system, 2004, The Shangri-La Hotel, Bangkok, Thailand.

[img]
Preview
PDF
1MB

Official URL: https://scienceon.kisti.re.kr/srch/selectPORSrchAr...

Abstract

This paper proposed a new approach for endpoint detection of isolated speech, which proves to significantly improve the endpoint detection performance. The proposed algorithm relies on the root mean square energy (rms energy), zero crossing rate and spectral characteristics of the speech signal where the Euclidean distance measure is adopted using cepstral coefficients to accurately detect the endpoint of isolated speech. The algorithm offers better performance than traditional energy-based algorithm. The vocabulary for the experiment includes English digit from one to nine. These experimental results were conducted by 360 utterances from a male speaker. Experimental results show that the accuracy of the algorithm is quite acceptable. Moreover, the computation overload of this algorithm is low since the cepstral coefficients parameters will be used in feature extraction later of speech recognition procedure.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:endpoint detection, speech recognition, rms energy
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:20757
Deposited By: Narimah Nawil
Deposited On:26 May 2014 09:05
Last Modified:28 Feb 2022 12:18

Repository Staff Only: item control page