Universiti Teknologi Malaysia Institutional Repository

Feature selection of high dimensional data using Hybrid FSA-IG

Mohd. Rosely, Nur Fatin Liyana and Mohd. Zain, Azlan and Yusoff, Yusliza (2020) Feature selection of high dimensional data using Hybrid FSA-IG. In: 2nd Joint Conference on Green Engineering Technology and Applied Computing 2020, IConGETech 2020 and International Conference on Applied Computing 2020, ICAC 2020, 4 February 2020 - 5 February 2020, Bangkok, Thailand.

[img]
Preview
PDF
467kB

Official URL: http://dx.doi.org/10.1088/1757-899X/864/1/012066

Abstract

Feature selection (FS) is a process of selecting a subset of relevant features depends on the specific target variables especially when dealing with high dimensional dataset. The aim of this paper is to investigate the performance comparison of different feature selection techniques on high dimensional datasets. The techniques used are filter, wrapper and hybrid. Information gain (IG) represents the filter, Fish Swarm Algorithm (FSA) represents metaheuristics wrapper and Hybrid FSA-IG represents the hybrid technique. Five datasets with different number of features are used in these techniques. The dataset used are breast cancer, lung cancer, ovarian cancer, mixed-lineage leukaemia (MLL) and small round blue cell tumors (SRBCT). The result shown Hybrid FSA-IG managed to select least feature that represent significant feature for every dataset with improved performance of accuracy from 4.868% to 33.402% and 1.706% to 25.154% compared to IG and FSA respectively.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:Hybrid FSA-IG, high dimensional datasets, FS
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:92504
Deposited By: Yanti Mohd Shah
Deposited On:30 Sep 2021 15:14
Last Modified:30 Sep 2021 15:14

Repository Staff Only: item control page