IAES International Journal of Artificial Intelligence (IJ-AI)
Vol 12, No 4: December 2023

Improving Indonesian multietnics speaker recognition using pitch shifting data augmentation

Kristiawan Nugroho (Universitas Stikubank)
Isworo Nugroho (Universitas Stikubank)
De Rosal Igniatus Moses Setiadi (Universitas Dian Nuswantoro)
Omar Farooq (A. M. U. Aligarh)



Article Info

Publish Date
01 Dec 2023

Abstract

Speaker recognition to recognize multiethnic speakers is an interesting research topic. Various studies involving many ethnicities require the right approach to achieve optimal model performance. The deep learning approach has been used in speaker recognition research involving many classes to achieve high accuracy results with promising results. However, multi-class and imbalanced datasets are still obstacles encountered in various studies using the deep learning method which cause overfitting and decreased accuracy. Data augmentation is an approach model used in overcoming the problem of small amounts of data and multiclass problems. This approach can improve the quality of research data according to the method applied. This study proposes a data augmentation method using pitch shifting with a deep neural network called pitch shifting data augmentation deep neural network (PSDA-DNN) to identify multiethnic Indonesian speakers. The results of the research that has been done prove that the PSDA-DNN approach is the best method in multi-ethnic speaker recognition where the accuracy reaches 99.27% and the precision, recall, F1 score is 97.60%.

Copyrights © 2023






Journal Info

Abbrev

IJAI

Publisher

Subject

Computer Science & IT Engineering

Description

IAES International Journal of Artificial Intelligence (IJ-AI) publishes articles in the field of artificial intelligence (AI). The scope covers all artificial intelligence area and its application in the following topics: neural networks; fuzzy logic; simulated biological evolution algorithms (like ...