Universiti Teknologi Malaysia Institutional Repository

Support vector machine with dynamic shifting window for continuous speech recognition

Ahmad, Abdul Manan and Salam, Md. Sah and Samaon, Den Fairol (2006) Support vector machine with dynamic shifting window for continuous speech recognition. In: Proc. Postgraduate Annual Research Seminar 2006 (PARS 2006) , 2006, UTM.

[img] PDF (Abstract)
11Kb

Official URL: http://comp.utm.my/pars/files/2013/04/SUPPORT-VECT...

Abstract

Support Vector Machine (SVM) is excellence in classification owing to its discriminative trait. Automatic speech recognition (ASR), amongst many areas in pattern recognition could well benefit from this phenomenon. Conventional method applied in continuous speech recognition (CSR) uses frame-based statistical framework, namely the Hidden Markov Model (HMM). Frame-based approach promises accurate segmentation of the sub-word units (eg: phoneme, syllable) which eventually contributes to the recognition accuracy. Despite these advantages, as each frame size is incredibly small, the system consumes valuable time to build-up or recognizes individual word from several frames. Although segment-based HMM overcomes this hindrance, complexity of the training procedure increases as well as being higher on computational load. Our solution is to extend the size of acoustic event that expands over several frames incrementally thus making the SVM dynamic. We refer the procedure as SVM-DSW (dynamic shifting window). Recognition is determined via voting the highest posterior probability score provided by SVM for a word segment. Artifacts collected from SVM-DSW are the word segmentation. Whereas, when the score deteriorates it signifies the start of a new word hence explaining the segmentation module. Based on preliminary result on a subset of 16 Malay sentences, we manage to outperform HTK’s HMM both in terms of recognition and segmentation.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:continuous speech recognition, support vector machines, dynamic shifting window
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
?? QA79 ??
Divisions:Computer Science and Information System (Formerly known)
ID Code:25604
Deposited By: Liza Porijo
Deposited On:20 May 2012 06:48
Last Modified:10 Jun 2014 07:48

Repository Staff Only: item control page