Universiti Teknologi Malaysia Institutional Repository

Modelling kernel methods for unsupervised learning of micro array data

Md. Sap, Mohd. Noor (2008) Modelling kernel methods for unsupervised learning of micro array data. Project Report. Faculty of Computer Science and Information System, Skudai, Johor. (Unpublished)

[img]
Preview
PDF
789kB

Abstract

Unsupervised learning, mostly represented by data clustering methods, is an important machine learning technique. Data clustering analysis has been extensively applied to extract information from microarray gene expression data. However, finding good quality clusters in gene expression data is more challenging because of its peculiar characteristics such as non-linear separability, outliers, high dimensionality, and diverse structures. Therefore, this study aims at combining kernel methods, capable of both handling the high dimensionality and discovering nonlinear relationships in the data, with the approximate reasoning offered by fuzzy approach. To this end, a robust Weighted Kernel Fuzzy C-Means incorporating local approximation (WKFCM) is presented. In WKFCM, fuzzy membership of each object is approximated from the memberships of its neighbouring objects. It brings in the synergy of partitioning and density based clustering approaches and provides a substantial improvement in the analysis of the data using unsupervised learning. Comparative analysis with K-means, hierarchical, fuzzy C-means and fuzzy self organizing maps showed that, although different types of datasets are better partitioned by different algorithms, WKFCM displays the best overall performance, and has the ability to capture nonlinear relationships and non-globular clusters, and identify cluster outliers.

Item Type:Monograph (Project Report)
Uncontrolled Keywords:Clustering; Kernel methods; Pattern recognition; microarray data analysis; gene expression data; Fuzzy C-means clustering (FCM)
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System
ID Code:5818
Deposited By: Noor Aklima Harun
Deposited On:02 Jul 2008 06:37
Last Modified:10 Aug 2017 01:37

Repository Staff Only: item control page