Universiti Teknologi Malaysia Institutional Repository

Random forest for gene selection and microarray data classification

Moorthy, Kohbalan and Mohammad, Mohd. Saberi (2012) Random forest for gene selection and microarray data classification. In: Communications in Computer and Information Science. Springer, Berlin, pp. 174-183. ISBN 978-364232825-1 (Print); 978-364232826-8 (Electronic)

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1007/978-3-642-32826-8_18

Abstract

A random forest method has been selected to perform both gene selection and classification of the microarray data. The goal of this research is to develop and improve the random forest gene selection method. Hence, improved gene selection method using random forest has been proposed to obtain the smallest subset of genes as well as biggest subset of genes prior to classification. In this research, ten datasets that consists of different classes are used, which are Adenocarcinoma, Brain, Breast (Class 2 and 3), Colon, Leukemia, Lymphoma, NCI60, Prostate and Small Round Blue-Cell Tumor (SRBCT). Enhanced random forest gene selection has performed better in terms of selecting the smallest subset as well as biggest subset of informative genes through gene selection. Furthermore, the classification performed on the selected subset of genes using random forest has lead to lower prediction error rates compared to existing method and other similar available methods.

Item Type:Book Section
Additional Information:Indexed by Scopus
Uncontrolled Keywords:cancer classification, classification, gene expression data, gene selection, microarray data, random forest
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computer Science and Information System
ID Code:35811
Deposited By: Fazli Masari
Deposited On:11 Nov 2013 09:42
Last Modified:06 Aug 2017 04:40

Repository Staff Only: item control page