Universiti Teknologi Malaysia Institutional Repository

Fuzzy phoneme classification using multi-speaker vocal tract length normalization

Jing Lung, Jensen Wong and Salam, Md. Sah and Rehman, Amjad and Mohd. Rahim, Mohd. Shafry and Saba, Tanzila (2014) Fuzzy phoneme classification using multi-speaker vocal tract length normalization. IETE Technical Review, 31 (2). pp. 128-136. ISSN 0256-4602

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1080/02564602.2014.892669

Abstract

The overall success of automatic speech recognition (ASR) depends on efficient phoneme recognition performance and quality of speech signal received in ASR. However, dissimilar inputs of speakers affect the overall recognition performance. One of the main problems that affect recognition performance is inter-speaker variability. Vocal tract length normalization (VTLN) is introduced to compensate inter-speaker variation on the speaker signal by applying speaker-specific warping of the frequency scale of a filter bank. Instead of measuring the performance on word level with speaker-specific warping, this research focuses on direct tackling at the phoneme level and applying VTLN on all speakers' speech signals to analyse the best setting for the highest recognition performance. This research seeks to compare each phoneme recognition results from warping factor between 0.74 and 1.54 with 0.02 increments on nine different ranges of frequency warping boundary. The warp factor and frequency warping range that provides the highest phoneme recognition performance is applied on word recognition. The results show an improved performance in phoneme recognition by 0.7% and spoken word recognition by 0.5% using warp factor of 1.40 on frequency range of 300-5000 Hz in comparison to baseline results.

Item Type:Article
Uncontrolled Keywords:fuzzy phoneme recognition, inter-speaker variability, multi-speaker frequency warping, vocal tract length normalization, warp factor
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:52950
Deposited By: Siti Nor Hashidah Zakaria
Deposited On:01 Feb 2016 03:52
Last Modified:19 Jul 2018 07:22

Repository Staff Only: item control page