Ayvaz, Uğur; Gürüler, Hüseyin; Khan, Faheem; Ahmed, Naveed; Taegkeun Whangbo; Bobomirzaevich, Abdusalomov Akmalbek

doi:10.32604/cmc.2022.023278

Back to matches

Your institution may have access to this item. Find your institution then sign in to continue.

Title: Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning.
Authors: Ayvaz, Uğur; Gürüler, Hüseyin; Khan, Faheem; Ahmed, Naveed; Taegkeun Whangbo; Bobomirzaevich, Abdusalomov Akmalbek
Abstract: Automatic speaker recognition (ASR) systems are the field of Human-machine interaction and scientists have been using feature extraction and feature matching methods to analyze and synthesize these signals. One of the most commonly used methods for feature extraction is Mel Frequency Cepstral Coefficients (MFCCs). Recent researches show that MFCCs are successful in processing the voice signal with high accuracies. MFCCs represents a sequence of voice signal-specific features. This experimental analysis is proposed to distinguish Turkish speakers by extracting the MFCCs from the speech recordings. Since the human perception of sound is not linear, after the filterbank step in the MFCC method, we converted the obtained log filterbanks into decibel (dB) features-based spectrograms without applying the Discrete Cosine Transform (DCT). A new dataset was created with converted spectrogram into a 2-D array. Several learning algorithms were implemented with a 10-fold cross-validation method to detect the speaker. The highest accuracy of 90.2% was achieved using Multi-layer Perceptron (MLP) with tanh activation function. The most important output of this study is the inclusion of human voice as a new feature set.
Subjects: MACHINE learning; DISCRETE cosine transforms; AUTOMATIC speech recognition; SIGNAL processing; FEATURE extraction; AUDITORY perception
Publication: Computers, Materials & Continua, 2022, Vol 71, Issue 3, p5511
ISSN: 1546-2218
Publication type: Article
DOI: 10.32604/cmc.2022.023278

We found a match

Automatic Speaker Recognition Using Mel-Frequency Cepstral Coefficients Through Machine Learning.

Ayvaz, Uğur; Gürüler, Hüseyin; Khan, Faheem; Ahmed, Naveed; Taegkeun Whangbo; Bobomirzaevich, Abdusalomov Akmalbek

MACHINE learning; DISCRETE cosine transforms; AUTOMATIC speech recognition; SIGNAL processing; FEATURE extraction; AUDITORY perception

Computers, Materials & Continua, 2022, Vol 71, Issue 3, p5511

1546-2218

Article

10.32604/cmc.2022.023278