Universiti Teknologi Malaysia Institutional Repository

Fake news data exploration and analytics

Awan, Mazhar Javed and Yasin, Awais and Nobanee, Haitham and Ali, Ahmed Abid and Shahzad, Zain and Nabeel, Muhammad and Mohd. Zain, Azlan and Shahzad, Hafiz Muhammad Faisal (2021) Fake news data exploration and analytics. Electronics (Switzerland), 10 (19). pp. 1-15. ISSN 2079-9292

[img]
Preview
PDF
477kB

Official URL: http://dx.doi.org/10.3390/electronics10192326

Abstract

Before the internet, people acquired their news from the radio, television, and newspapers. With the internet, the news moved online, and suddenly, anyone could post information on websites such as Facebook and Twitter. The spread of fake news has also increased with social media. It has become one of the most significant issues of this century. People use the method of fake news to pollute the reputation of a well-reputed organization for their benefit. The most important reason for such a project is to frame a device to examine the language designs that describe fake and right news through machine learning. This paper proposes models of machine learning that can successfully detect fake news. These models identify which news is real or fake and specify the accuracy of said news, even in a complex environment. After data-preprocessing and exploration, we applied three machine learning models; random forest classifier, logistic regression, and term frequency-inverse document frequency (TF-IDF) vectorizer. The accuracy of the TFIDF vectorizer, logistic regression, random forest classifier, and decision tree classifier models was approximately 99.52%, 98.63%, 99.63%, and 99.68%, respectively. Machine learning models can be considered a great choice to find reality-based results and applied to other unstructured data for various sentiment analysis applications.

Item Type:Article
Uncontrolled Keywords:analytics, big data, data exploration, detection, fake news
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
T Technology > T Technology (General) > T58.5-58.64 Information technology
Divisions:Computing
ID Code:94492
Deposited By: Yanti Mohd Shah
Deposited On:31 Mar 2022 15:46
Last Modified:31 Mar 2022 15:46

Repository Staff Only: item control page