Universiti Teknologi Malaysia Institutional Repository

Tools in data science for better processing

Hussien, Nur Syahela and Sulaiman, Sarina and Shamsuddin, Siti Mariyam (2015) Tools in data science for better processing. In: Simposium Kebangsaan Sains Matematik Ke-23, 24-26 Nov,2015, Johor Bahru, Johor.

[img]
Preview
PDF
474kB

Abstract

Analysing the data is an important part of a research in data science. There are many tools that can be used in analysing a data set to get the experiment results for classification, clustering and others. However, the researchers are concerned about how to increase the efficiency in analysing a data set. In this paper, three open source tools which are the Waikato Environment for Knowledge Analysis (WEKA), Konstanz Information Miner (KNIME) and Salford Predictive Modular (SPM) were compared to identify the better processing tools in evaluating the presented data. All of these tools have their own different characteristics. WEKA can handle pre-processing of data and then analyses it based on different algorithms. It is suitable to be used for classification, regression, clustering, association rules, and visualisation. The algorithms can be applied directly to a data set or called from its own Java code. KNIME is more inclined towards producing graphical view, while SPM is a highly accurate and ultra-fast analytics which also data mines platforms for any sizes, complexity or organisation. The results illustrate the tools capability in analysing data sets and evaluators in an efficient and effective manner.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:data science, efficiency processing
Subjects:Q Science > QA Mathematics
Divisions:Computing
ID Code:62121
Deposited By: Widya Wahid
Deposited On:09 May 2017 08:54
Last Modified:21 Aug 2017 07:07

Repository Staff Only: item control page