Selamat, Ali and Ching, Ng Choon (2008) Arabic script documents language identifications using fuzzy ART. In: Proceedings - 2nd Asia International Conference on Modelling and Simulation, AMS 2008. IEEE, New York, 528-533 . ISBN 978-076953136-6
Full text not available from this repository.
Official URL: http://dx.doi.org/10.1109/AMS.2008.47
The volume of information available on the internet, intranet, digital libraries and newsgroup has increased dramatically in recent years. Therefore, there is a growing interest in helping user better find, filter, and manage these resources. Language identification is the first step of understanding text documents which is written in. It is usually a module within multilingual application. In this paper, we introduce language identification of Arabic script documents by letter frequency. Technique used for identification is fuzzy adaptive resonance theory (ART), which is belong to the neural network architectures that perform incremental unsupervised learning. Arabic script documents such as Arabic, Persian and Urdu were used for performing language identification. From the experiments, we have found that fuzzy ART is particularly promising in terms of accuracy on language identification.
|Item Type:||Book Section|
|Additional Information:||ISBN: 978-076953136-6; 2nd Asia International Conference on Modelling and Simulation, AMS 2008; Kuala Lumpur; 13 May 2008 through 15 May 2008|
|Uncontrolled Keywords:||adaptive neural networks, arabic script, language identification, letter frequency|
|Subjects:||Q Science > QA Mathematics > QA75 Electronic computers. Computer science|
|Divisions:||Computer Science and Information System|
|Deposited By:||Liza Porijo|
|Deposited On:||06 Jun 2011 08:43|
|Last Modified:||06 Jun 2011 08:43|
Repository Staff Only: item control page