Hassan, H. and Ahmad, N. B. and Sallehuddin, R. (2021) An empirical study to improve multiclass classification using hybrid ensemble approach for students' performance prediction. In: 7th International Conference on Computational Science and Technology, ICCST 2020, 29 August 2020 - 30 August 2020, Pattaya, Thailand.
Full text not available from this repository.
Abstract
Improving machine learning algorithms has been the interest of data scientists and researchers for the past few years. Among the performance problems raised is the classification imbalance issues listed as the top ten. The present study makes comparison and analysis of 5 state-of-art classifiers, 5 ensembles classifiers and 10 resampling techniques for data imbalance. This is done via the used 4413 instances consisting of demographic, economic, and behavioural data from student information systems and e-learning, as well as engineering faculty for the first semester 2017/2018. The use of three sampling types was adapted for the analysis: oversampling, undersampling and hybrid. The experimental results prove to model students’ behaviour from e-learning data and bagging decision tree ensemble classifier produces the highest results. Lastly, a hybrid resampling technique, SMOTEENN consistently shows the top result compared to other resampling techniques.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | machine learning, multiclass classification, sampling |
Subjects: | Q Science > QA Mathematics > QA75 Electronic computers. Computer science |
Divisions: | Computing |
ID Code: | 95767 |
Deposited By: | Narimah Nawil |
Deposited On: | 31 May 2022 13:18 |
Last Modified: | 31 May 2022 13:18 |
Repository Staff Only: item control page