Shaffiei, Zatul Alwani and Amir Hamzah, Amir Syafiq Syamin Syah and Rashid, Shaikh Mariyam Harunor and Oshima, Naoki (2023) Role of text mining in extracting valuable information from text data. Journal of Advanced Research in Applied Sciences and Engineering Technology, 32 (1). pp. 263-271. ISSN 2462-1943
PDF
2MB |
Official URL: http://dx.doi.org/10.37934/ARASET.32.1.263271
Abstract
Text mining has become a popular field with the rapid development of information technology and the extensive amounts of unstructured text data such as web pages, social network sites and technical documentations. This data contains a lot of information, which is extremely difficult to deal with the huge number and various forms. Extracting and analysing important information from massive data, for example in automotive industries has become our major problem. The main aim of text mining is to extract important information from massive text data that are difficult to handle manually with error-free. In this paper, the fundamental concept is based on Euclidean distance in finding the similarity between words. Finally, a set of data is used to describe the similarities, distances and frequencies between several words. Word cloud, bar plot, dendrogram and co-occurrence network are also presented to illustrate the behaviour of the text data.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Co-occurrence network; Dendrogram; Euclidean distance; Text mining; Word cloud. |
Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7885-7895 Computer engineer. Computer hardware |
Divisions: | Malaysia-Japan International Institute of Technology |
ID Code: | 106143 |
Deposited By: | Muhamad Idham Sulong |
Deposited On: | 06 Jun 2024 08:49 |
Last Modified: | 06 Jun 2024 08:49 |
Repository Staff Only: item control page