Shaffiei, Zatul Alwani and Syah Amir Hamzah, Amir Syafiq Syamin and Harunor Rashid, Shaikh Mariyam and Oshima, Naoki (2023) Role of text mining in extracting valuable information from text data. Journal of Advanced Research in Applied Sciences and Engineering Technology, 32 (1). pp. 263-271. ISSN 2462-1943
PDF
2MB |
Official URL: http://dx.doi.org/10.37934/araset.32.1.263271
Abstract
Text mining has become a popular field with the rapid development of information technology and the extensive amounts of unstructured text data such as web pages, social network sites and technical documentations. This data contains a lot of information, which is extremely difficult to deal with the huge number and various forms. Extracting and analysing important information from massive data for example in automotive industries has become our major problem. The main aim of text mining is to extract important information from massive text data that are difficult to be handled manually with error-free. In this paper, the fundamental concept is based on Euclidean distance in finding the similarity between words. Finally, a set of data is used to describe the similarities, distances and frequencies between several words. Word cloud, bar plot, dendrogram and co-occurrence network are also presented to illustrate the behaviour of the text data.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | Text mining, Word cloud, Dendrogram, Co-occurrence network, Euclidean distance. |
Subjects: | T Technology > TK Electrical engineering. Electronics Nuclear engineering > TK7885-7895 Computer engineer. Computer hardware |
Divisions: | Malaysia-Japan International Institute of Technology |
ID Code: | 108555 |
Deposited By: | Muhamad Idham Sulong |
Deposited On: | 17 Nov 2024 09:51 |
Last Modified: | 17 Nov 2024 09:51 |
Repository Staff Only: item control page