Universiti Teknologi Malaysia Institutional Repository

Dynamic extraction of initial behavior for evasive malware detection

Aboaoja, Faitouri A. and Zainal, Anazida and Ali, Abdullah Marish and Ghaleb, Fuad A. and Alsolami, Fawaz Jaber and Rassam, Murad A. (2023) Dynamic extraction of initial behavior for evasive malware detection. Mathematics, 11 (2). pp. 1-23. ISSN 2227-7390

[img] PDF
498kB

Official URL: http://dx.doi.org/10.3390/math11020416

Abstract

Recently, malware has become more abundant and complex as the Internet has become more widely used in daily services. Achieving satisfactory accuracy in malware detection is a challenging task since malicious software exhibit non-relevant features when they change the performed behaviors as a result of their awareness of the analysis environments. However, the existing solutions extract features from the entire collected data offered by malware during the run time. Accordingly, the actual malicious behaviors are hidden during the training, leading to a model trained using unrepresentative features. To this end, this study presents a feature extraction scheme based on the proposed dynamic initial evasion behaviors determination (DIEBD) technique to improve the performance of evasive malware detection. To effectively represent evasion behaviors, the collected behaviors are tracked by examining the entropy distributions of APIs-gram features using the box-whisker plot algorithm. A feature set suggested by the DIEBD-based feature extraction scheme is used to train machine learning algorithms to evaluate the proposed scheme. Our experiments’ outcomes on a dataset of benign and evasive malware samples show that the proposed scheme achieved an accuracy of 0.967, false positive rate of 0.040, and F1 of 0.975.

Item Type:Article
Uncontrolled Keywords:box-whisker plot algorithm, evasive malware, feature extraction methods, machine learning-based malware detection models, malware analysis approaches
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:105649
Deposited By: Yanti Mohd Shah
Deposited On:08 May 2024 06:05
Last Modified:08 May 2024 06:05

Repository Staff Only: item control page