Universiti Teknologi Malaysia Institutional Repository

Siamese networks for speaker identification on resource-constrained platforms

Lim, Jun Jie and Ahmad Zabidi, Muhammad Mun’im and Abdul Manan, Shahidatul Sadiah and Ab. Rahman, Ab. Al-Hadi (2023) Siamese networks for speaker identification on resource-constrained platforms. In: 1st International Conference on Electronic and Computer Engineering, ECE 2023, 4 July 2023 - 5 July 2023, Virtual, UTM Johor Bahru, Johor, Malaysia.

[img] PDF
1MB

Official URL: http://dx.doi.org/10.1088/1742-6596/2622/1/012014

Abstract

This paper investigates the implementation of a lightweight Siamese neural network for enhancing speaker identification accuracy and inference speed in embedded systems. Integrating speaker identification into embedded systems can improve portability and versatility. Siamese neural networks achieve speaker identification by comparing input voice samples to reference voices in a database, effectively extracting features and classifying speakers accurately. Considering the trade-off between accuracy and complexity, as well as hardware constraints in embedded systems, various neural networks could be applied to speaker identification. This paper compares the incorporation of CNN architectures targeted for embedded systems, MCUNet, SqueezeNet and MobileNetv2, to implement Siamese neural networks on a Raspberry Pi. Our experiments demonstrate that MCUNet achieves 85% accuracy with a 0.23-second inference time. In comparison, the larger MobileNetv2 attains 84.5% accuracy with a 0.32-second inference time. Additionally, contrastive loss was superior to binary cross-entropy loss in the Siamese neural network. The system using contrastive loss had almost 68% lower loss scores, resulting in a more stable performance and more accurate predictions. In conclusion, this paper establishes that an appropriate lightweight Siamese neural network, combined with contrastive loss, can significantly improve speaker identification accuracy, and enable efficient deployment on resource-constrained platforms.

Item Type:Conference or Workshop Item (Paper)
Uncontrolled Keywords:embedded-system, entropy loss, extracting features
Subjects:T Technology > TK Electrical engineering. Electronics Nuclear engineering
Divisions:Electrical Engineering
ID Code:107836
Deposited By: Yanti Mohd Shah
Deposited On:08 Oct 2024 06:13
Last Modified:08 Oct 2024 06:13

Repository Staff Only: item control page