Universiti Teknologi Malaysia Institutional Repository

Arabic script documents language identifications using fuzzy ART

Selamat, Ali and Ching, Ng Choon (2008) Arabic script documents language identifications using fuzzy ART. In: Proceedings - 2nd Asia International Conference on Modelling and Simulation, AMS 2008. IEEE, New York, 528-533 . ISBN 978-076953136-6

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1109/AMS.2008.47

Abstract

The volume of information available on the internet, intranet, digital libraries and newsgroup has increased dramatically in recent years. Therefore, there is a growing interest in helping user better find, filter, and manage these resources. Language identification is the first step of understanding text documents which is written in. It is usually a module within multilingual application. In this paper, we introduce language identification of Arabic script documents by letter frequency. Technique used for identification is fuzzy adaptive resonance theory (ART), which is belong to the neural network architectures that perform incremental unsupervised learning. Arabic script documents such as Arabic, Persian and Urdu were used for performing language identification. From the experiments, we have found that fuzzy ART is particularly promising in terms of accuracy on language identification.

Item Type:Book Section
Additional Information:ISBN: 978-076953136-6; 2nd Asia International Conference on Modelling and Simulation, AMS 2008; Kuala Lumpur; 13 May 2008 through 15 May 2008
Uncontrolled Keywords:adaptive neural networks, arabic script, language identification, letter frequency
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System (Formerly known)
ID Code:12503
Deposited By: Liza Porijo
Deposited On:06 Jun 2011 08:43
Last Modified:06 Jun 2011 08:43

Repository Staff Only: item control page