Universiti Teknologi Malaysia Institutional Repository

Recurrent neural network with backpropagation through time for speech recognition

Ahmad, A. M. and Ismail, S and Samaon, D. F. (2004) Recurrent neural network with backpropagation through time for speech recognition. In: IEEE International Symposium on Communications and Information Technology, 2004. ISCIT 2004. .

[img] PDF
307Kb

Abstract

Speech recognition and understanding have been studied for many years. The neural network is well-known as a technique that is able to classify nonlinear problems. Much research has been done in applying neural networks to solving the problem of recognizing speech such as Arabic. Arabic offers a number of challenges to speech recognition. We propose a fully-connected hidden layer between the input and state nodes and the output. We also investigate and show that this hidden layer makes the learning of complex classification tasks more efficient. We also investigate the difference between LPCC (linear predictive cepstrum coefficients) and MFCC (Mel-frequency cepstral coefficients) in the feature extraction process. The aim of the study was to observe the differences in the 29 letters of the Arabic alphabet from "alif" to "ya". The purpose of this research is to upgrade the knowledge and understanding of Arabic alphabet or words using a fully-connected recurrent neural network (FCRNN) and backpropagation through time (BPTT) learning algorithm. Six speakers (a mixture of male and female) in a quiet environment are used in training.

Item Type:Conference or Workshop Item (Paper)
Subjects:T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions:Computer Science and Information System (Formerly known)
ID Code:2016
Deposited By: Dr Zaharuddin Mohamed
Deposited On:06 Apr 2007 01:02
Last Modified:01 Jun 2010 03:00

Repository Staff Only: item control page