Buntoro, Ghulam Asrofi and Arifin, Rizal and Syaifuddiin, Gus Nanang and Selamat, Ali and Krejcar, Ondrej and Fujita, Hamido (2021) Implementation of a machine learning algorithm for sentiment analysis of Indonesia‘s 2019 Presidential election. IIUM Engineering Journal, 22 (1). pp. 78-92. ISSN 1511-788X
|
PDF
1MB |
Official URL: http://dx.doi.org/10.31436/IIUMEJ.V22I1.1532
Abstract
In 2019, citizens of Indonesia participated in the democratic process of electing a new president, vice president, and various legislative candidates for the country. The 2019 Indonesian presidential election was very tense in terms of the candidates' campaigns in cyberspace, especially on social media sites such as Facebook, Twitter, Instagram, Google+, Tumblr, LinkedIn, etc. The Indonesian people used social media platforms to express their positive, neutral, and also negative opinions on the respective presidential candidates. The campaigning of respective social media users on their choice of candidates for regents, governors, and legislative positions up to presidential candidates was conducted via the Internet and online media. Therefore, the aim of this paper is to conduct sentiment analysis on the candidates in the 2019 Indonesia presidential election based on Twitter datasets. The study used datasets on the opinions expressed by the Indonesian people available on Twitter with the hashtags (#) containing “Jokowi and Prabowo.” We conducted data pre-processing using a selection of comments, data cleansing, text parsing, sentence normalization and tokenization based on the given text in the Indonesian language, determination of class attributes, and, finally, we classified the Twitter posts with the hashtags (#) using Naïve Bayes Classifier (NBC) and a Support Vector Machine (SVM) to achieve an optimal and maximum optimization accuracy. The study provides benefits in terms of helping the community to research opinions on Twitter that contain positive, neutral, or negative sentiments. Sentiment Analysis on the candidates in the 2019 Indonesian presidential election on Twitter using non-conventional processes resulted in cost, time, and effort savings. This research proved that the combination of the SVM machine learning algorithm and alphabetic tokenization produced the highest accuracy value of 79.02%. While the lowest accuracy value in this study was obtained with a combination of the NBC machine learning algorithm and N-gram tokenization with an accuracy value of 44.94%.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Indonesia, naïve Bayes classifier, president, sentiment analysis, support vector machine |
Subjects: | T Technology > T Technology (General) |
Divisions: | Malaysia-Japan International Institute of Technology |
ID Code: | 97858 |
Deposited By: | Widya Wahid |
Deposited On: | 07 Nov 2022 10:03 |
Last Modified: | 07 Nov 2022 10:03 |
Repository Staff Only: item control page