Universiti Teknologi Malaysia Institutional Repository

Implementation of vocal tract length normalization for phoneme recognition on timit speech corpus

Wong, Jensen Jing Lung and Salam, Md. Sah and Mohd. Rahim, Mohd. Shafry and Ahmad, Abdul Manan (2011) Implementation of vocal tract length normalization for phoneme recognition on timit speech corpus. In: Proceedings Of 2011 International Conference On Information Communication And Management (Icicm 2011).

Full text not available from this repository.

Abstract

Inter-speaker variability, one of the problems faced in speech recognition system, has caused the performance degradation in recognizing varied speech spoken by different speakers. Vocal Tract Length Normalization (VTLN) method is known to improve the recognition performances by compensating the speech signal using specific warping factor. Experiments are conducted using TIMIT speech corpus and Hidden Markov Model Toolkit (HTK) together with the implementation of VTLN method in order to show improvement in speaker independent phoneme recognition. The results show better recognition performance using Bigram Language Model compared to Unigram Language Model, with Phoneme Error Rate (PER) 28.8% as the best recognition performance for Bigram and PER 38.09% for Unigram. The best warp factor used for normalization in this experiment is 1.40.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:phoneme recognition
Divisions:Computing
ID Code:45933
Deposited By: Haliza Zainal
Deposited On:10 Jun 2015 03:00
Last Modified:30 Aug 2017 05:16

Repository Staff Only: item control page