Universiti Teknologi Malaysia Institutional Repository

Deep learning algorithms-based object detection and localization revisited

Waheed, Safa Riyadh and Mohd. Suaib, Norhaida and Mohd. Rahim, Mohd. Shafry and Adnan, Myasar Mundher and Salim, A. A. (2021) Deep learning algorithms-based object detection and localization revisited. In: International Laser Technology and Optics Symposium in Conjunction with Photonics Meeting 2020, ILATOSPM 2020, 22 - 23 October 2020, Johor, Malaysia.

[img]
Preview
PDF
1MB

Official URL: http://dx.doi.org/10.1088/1742-6596/1892/1/012001

Abstract

The computer vision (CV) is an emerging area with sundry promises. This communication encompasses the past development, recent trends and future directions of the CV in the context of deep learning (DL) algorithms-based object detections and localizations techniques. To identify the object location inside an image and recognize it by a computer program as fast as the human brain the machine learning and DL techniques have been evolved. However, the main limitations of the machine are related to the prolonged time consumption to handle vast amount of data to perform the same task as the human brain. To overcome these shortcomings, the convolution neural networks (NNs)-based deep NN has been developed, which detects and classifies the object with high precision. To train the deep NNs, massive amount of data (in the form of images and videos) and time is needed, making the computational cost of the CV very high. Thus, transfer learning techniques have been proposed wherein a model trained on one task can be reused on another linked task, thereby producing excellent outcomes. In this spirit, diverse DL-based algorithms have been introduced to detect and classify the object. These algorithms include the region-based convolutional NN (R-CNN), fast R-CNN, Faster R-CNN, mask E-CNN and You Only Look Once. A comparative evaluation among these techniques has been made to reveal their merits and demerits in the CV.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:deep learning, algorithms
Subjects:Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions:Computing
ID Code:94337
Deposited By: Widya Wahid
Deposited On:31 Mar 2022 15:14
Last Modified:31 Mar 2022 15:14

Repository Staff Only: item control page