Universiti Teknologi Malaysia Institutional Repository

Optimal feature selection using novel flamingo search algorithm for classification of COVID-19 patients from clinical text

Mahdi, Amir Yasseen and Yuhaniz, Siti Sophiayati (2023) Optimal feature selection using novel flamingo search algorithm for classification of COVID-19 patients from clinical text. Mathematical Biosciences and Engineering, 20 (3). pp. 5268-5297. ISSN 1547-1063

[img] PDF
829kB

Official URL: http://dx.doi.org/10.3934/mbe.2023244

Abstract

Though several AI-based models have been established for COVID-19 diagnosis, the machine-based diagnostic gap is still ongoing, making further efforts to combat this epidemic imperative. So, we tried to create a new feature selection (FS) method because of the persistent need for a reliable system to choose features and to develop a model to predict the COVID-19 virus from clinical texts. This study employs a newly developed methodology inspired by the flamingo’s behavior to find a near-ideal feature subset for accurate diagnosis of COVID-19 patients. The best features are selected using a two-stage. In the first stage, we implemented a term weighting technique, which that is RTF-C-IEF, to quantify the significance of the features extracted. The second stage involves using a newly developed feature selection approach called the improved binary flamingo search algorithm (IBFSA), which chooses the most important and relevant features for COVID-19 patients. The proposed multi-strategy improvement process is at the heart of this study to improve the search algorithm. The primary objective is to broaden the algorithm's capabilities by increasing diversity and support exploring the algorithm search space. Additionally, a binary mechanism was used to improve the performance of traditional FSA to make it appropriate for binary FS issues. Two datasets, totaling 3053 and 1446 cases, were used to evaluate the suggested model based on the Support Vector Machine (SVM) and other classifiers. The results showed that IBFSA has the best performance compared to numerous previous swarm algorithms. It was noted, that the number of feature subsets that were chosen was also drastically reduced by 88% and obtained the best global optimal features.

Item Type:Article
Uncontrolled Keywords:binary flamingo search algorithm, clinical text classification, COVID-19, feature selection, natural language processing
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Razak School of Engineering and Advanced Technology
ID Code:105648
Deposited By: Yanti Mohd Shah
Deposited On:08 May 2024 06:04
Last Modified:08 May 2024 06:04

Repository Staff Only: item control page