Universiti Teknologi Malaysia Institutional Repository

Assessing time-dependent performance of a feature selection method using correlation sharing t-statistics (corT) for heart failure data classification.

Ibrahim, Nurain and Kamarudin, Adina Najwa (2023) Assessing time-dependent performance of a feature selection method using correlation sharing t-statistics (corT) for heart failure data classification. In: 5th ISM International Statistical Conference 2021: Statistics in the Spotlight: Navigating the New Norm, ISM 2021, 17 August 2021 - 19 August 2021, Johor Bahru, Johor, Malaysia - Virtual, Online.

Full text not available from this repository.

Official URL: http://dx.doi.org/10.1063/5.0109918

Abstract

World Health Organization (WHO) has announced that cardiovascular disease is one of the leading causes of death with approximately 18 million people died per year and this specifically because of heart failure (HF). Many predictive models of HF were proposed using various feature selection methods, but none was found performing the appropriate external validation. The performance of the models in the literature was assessed considering disease status as fixed over time. However, disease status and biomarker values of an individual may change over time thus time-dependent accuracy measures is more appropriate. This paper aims to propose a predictive model of HF mortality using a feature selection method that accounting the relationship between variables (biomarkers or factors) and assess the performance of the model at different time points. A published dataset consists of 299 HF patients collected from Faisalabad Institute of Cardiology and the Allied Hospital in Faisalabad (Punjab, Pakistan), during April-December 2015 is used to illustrate our proposed model. The most important factors are determined using corT statistics on the 80% of the dataset, as training set, and the selected factors are used to fit a logistic regression model as the classifier on the remaining 20% as testing set. The performance of the model is then assessed by calculating the accuracy measures at several prediction times. The dynamic model performance may guide medical decision in collecting the best measurements along the follow-up time period.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:Classification; Data Mining; Feature Selection; Heart failure; Survival analysis; time-dependent AUC.
Subjects:T Technology > T Technology (General) > T58.5-58.64 Information technology
Divisions:Science
ID Code:107964
Deposited By: Muhamad Idham Sulong
Deposited On:16 Oct 2024 06:31
Last Modified:16 Oct 2024 06:31

Repository Staff Only: item control page