Mashudi, Nurul Amirah and Ahmad, Norulhusna and Mohd. Noor, Norliza (2022) LiWGAN: A light method to improve the performance of generative adversarial network. IEEE Access, 10 (NA). pp. 93155-93167. ISSN 2169-3536
PDF
2MB |
Official URL: http://dx.doi.org/10.1109/ACCESS.2022.3203065
Abstract
Generative adversarial networks (GANs) gained tremendous growth due to the potency and efficiency in producing realistic samples. This study proposes a light-weight GAN (LiWGAN) to learn non-image synthesis with minimum computational time for less power computing. Hence, the LiWGAN method enhanced a new skip-layer channel-wise excitation module (SLE) and a self-supervised discriminator design for non-synthesis performance using the facemask dataset. Facemask is one of the preventative strategies pioneered by the current COVID-19 pandemic. LiWGAN manipulates a non-image synthesis of facemasks that could be beneficial for some researchers to identify an individual using lower power devices, occlusion challenges for face recognition, and alleviate the accuracy challenges due to limited datasets. The study evaluates the performance of the processing time in terms of batch sizes and image resolutions using the facemask dataset. The Fréchet inception distance (FID) was also measured on the facemask images to evaluate the quality of the augmented image using LiWGAN. The findings for 3000 generated images showed a nearly similar FID score at 220.43 with significantly less processing time per iteration at 1.03s than StyleGAN at 219.97 FID score. One experiment was conducted using the CelebA dataset to compare with GL-GAN and DRAGAN, proving LiWGAN is appropriate for other datasets. The outcomes found LiWGAN performed better than GL-GAN and DRAGAN at 91.31 FID score with 3.50s processing time per iteration. Therefore, LiWGAN could aim to enhance the FID score to be near zero in the future with less processing time by using different datasets.
Item Type: | Article |
---|---|
Uncontrolled Keywords: | data augmentation, deep learning, generative adversarial network, Non-image synthesis, self-supervised discriminator |
Subjects: | T Technology > T Technology (General) |
Divisions: | Razak School of Engineering and Advanced Technology |
ID Code: | 104422 |
Deposited By: | Widya Wahid |
Deposited On: | 04 Feb 2024 09:58 |
Last Modified: | 04 Feb 2024 09:58 |
Repository Staff Only: item control page