Universiti Teknologi Malaysia Institutional Repository

An analysis of fuzzy clustering algortihms for suggestion of supervisor and examiner of thesis title

Suhaimi, Azrina (2005) An analysis of fuzzy clustering algortihms for suggestion of supervisor and examiner of thesis title. Masters thesis, Universiti Teknologi Malaysia, Faculty of Computer Science and Information System.

[img] PDF (Full text)
Restricted to Repository staff only

1556Kb
[img] PDF
61Kb
[img] PDF
61Kb
[img] PDF
90Kb

Abstract

Document clustering has been investigated for use in a number of different areas of information retrieval. In this project, the use of Fuzzy clustering techniques for suggestion of supervisors and examiners of thesis in School of Postgraduate Studies at Faculty of Computer Science and Information Technology are studied. The aim of this project is to assist the administration in assigning supervisors and examiners to each post graduate student for their project. Preprocessing tasks for document clustering that are applied in this project are commonly used in the Information Retrieval field, which are stemming, stopword removal, and indexing. Document is represented using the Vector Space Model. The index terms are then clustered using Fuzzy clustering algorithms based on similarity. The selected algorithms for Fuzzy are Fuzzy C-means and Gustafson Kessel. The clustering results are evaluated in terms of classification accuracy to predict the thesis supervisor(s) or examiner(s). Experiments show that Fuzzy C-means gives better result compared to Gustafson Kessel. However, the performances of both techniques are not at the top level. Hence, these techniques are not suitable for use in suggestion of supervisors and examiners. Nevertheless, to get a better performance, a larger dataset, thorough experiments and detailed evaluation has to be carried out and this will take longer time

Item Type:Thesis (Masters)
Additional Information:Thesis (Master of Science (Computer Science)) - Universiti Teknologi Malaysia, 2005
Uncontrolled Keywords:Document clustering; Fuzzy clustering techniques; information retrieval field
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System (Formerly known)
ID Code:2709
Deposited By: Ms Zalinda Shuratman
Deposited On:24 May 2007 04:28
Last Modified:25 Jul 2012 02:30

Repository Staff Only: item control page