Do, Nguyet Quang and Selamat, Ali and Lim, Kok Cheng and Krejcar, Ondrej (2022) Malicious URL detection with distributed representation and deep learning. In: 21st International Conference on New Trends in Intelligent Software Methodologies, Tools and Techniques, SoMeT 2022, 20 - 22 September 2022, Kitakyushu, Japan.
Full text not available from this repository.
Official URL: http://dx.doi.org/10.3233/FAIA220248
Abstract
There exist numerous solutions to detect malicious URLs based on Natural Language Processing and machine learning technologies. However, there is a lack of comparative analysis among approaches using distributed representation and deep learning. To solve this problem, this paper performs a comparative study on phishing URL detection based on text embedding and deep learning algorithms. Specifically, character-level and word-level embedding were combined to learn the feature representations from the webpage URLs. In addition, three deep learning models, including Convolutional Neural Network (CNN), Bidirectional Gated Recurrent Unit (BiGRU), and Bidirectional Long Short-Term Memory (BiLSTM), were constructed for effective classification of phishing websites. Several experiments were conducted and various evaluation metrics were used to assess the performance of these deep learning models. The findings obtained from the experiments indicated that the combination of the character-level and word-level embedding approach produced better results than the individual text representation methods. Also, the CNN-based model outperformed the other two deep learning algorithms in terms of both detection accuracy and execution time.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Uncontrolled Keywords: | deep learning, distributed representation, Malicious URL, phishing detection |
Subjects: | Q Science > QA Mathematics > QA76 Computer software |
Divisions: | Malaysia-Japan International Institute of Technology |
ID Code: | 100544 |
Deposited By: | Widya Wahid |
Deposited On: | 17 Apr 2023 06:53 |
Last Modified: | 17 Apr 2023 06:53 |
Repository Staff Only: item control page