Universiti Teknologi Malaysia Institutional Repository

Design consideration of Malay text stemmer using structured approach

Kassim, Mohamad Nizam and Mat Jali, Shaiful Hisham and Maarof, Mohd Aizaini and Zainal, Anazida and Abdul Wahab, Amirudin (2020) Design consideration of Malay text stemmer using structured approach. In: 3rd International Conference on Smart Trends for Information Technology and Computer Communications, SmartCom 2019, 24 - 25 January 2019, Bangkok, Thailand.

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1007/978-981-15-0077-0_43

Abstract

Word stemmer (or text stemmer) is used to remove bound morphemes from derived words so that various morphological variants are mapped into common base forms. It is usually used as one of the preprocessing tools in text classification, text mining, and information retrieval tasks. Therefore, the design of an effective text stemmer is crucial for ensuring text stemming process maps morphological variants into correct base forms. This paper investigates the design consideration of an effective text stemmer from the perspective of the Malay language. These design considerations are based on current challenges faced by previous researchers in performing text stemming against Malay texts. By adopting these considerations, an effective text stemmer is expected to address common stemming errors and also, expected to produce promising stemming accuracy.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:Text stemming, Word stemmer
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:92523
Deposited By: Widya Wahid
Deposited On:30 Sep 2021 15:12
Last Modified:30 Sep 2021 15:12

Repository Staff Only: item control page