Universiti Teknologi Malaysia Institutional Repository

The challenges of extract, transform and load (ETL) for data integration in near real-time environment

Sabtu, A. and Azmi, N. F. M. and Sjarif, N. N. A. and Ismail, S. A. and Yusop, O. M. and Sarkan, H. and Chuprat, S. (2017) The challenges of extract, transform and load (ETL) for data integration in near real-time environment. Journal of Theoretical and Applied Information Technology, 95 (22). pp. 6314-6322. ISSN 1992-8645

[img]
Preview
PDF
194kB

Official URL: https://www.scopus.com/inward/record.uri?eid=2-s2....

Abstract

Organization with considerable investment into data warehousing, the influx of various data types and forms require certain ways of prepping data and staging platform that support fast, efficient and volatile data to reach its targeted audiences or users of different business needs. Extract, Transform and Load (ETL) system proved to be a choice standard for managing and sustaining the movement and transactional process of the valued big data assets. However, traditional ETL system can no longer accommodate and effectively handle streaming or near real-time data and stimulating environment which demands high availability, low latency and horizontal scalability features for functionality. This paper identifies the challenges of implementing ETL system for streaming or near real-time data which needs to evolve and streamline itself with the different requirements. Current efforts and solution approaches to address the challenges are presented. The classification of ETL system challenges are prepared based on near real-time environment features and ETL stages to encourage different perspectives for future research.

Item Type:Article
Uncontrolled Keywords:Low latency, Near real-time environment
Subjects:T Technology > T Technology (General)
Divisions:Advanced Informatics School
ID Code:76644
Deposited By: Fazli Masari
Deposited On:30 Apr 2018 13:46
Last Modified:30 Apr 2018 13:46

Repository Staff Only: item control page