Prasanna Kumar, M.; Kumaraswamy, R.

doi:10.1007/s10772-015-9309-1

Back to matches

Your institution may have access to this item. Find your institution then sign in to continue.

Title: Supervised and unsupervised separation of convolutive speech mixtures using f and formant frequencies.
Authors: Prasanna Kumar, M.; Kumaraswamy, R.
Abstract: In this paper we discuss the role of fundamental frequency f and formants F, F and F of the speech signal in supervised and unsupervised source separation of real recorded convolutive speech mixtures. Initially supervised source separation is discussed where it is assumed that sources are known a priori. The supervised source separation is discussed by considering (1) only fundamental frequency f, (2) only formants F, F and F, (3) both f and formants F, F and F. It is observed that last case which involves both f and formants gives most accurate separation results and is used as ideal case or reference to compare the separation results obtained for unsupervised source separation. The unsupervised source separation is discussed, where there is no knowledge about the sources a priori. The unsupervised source separation is discussed using (1) cross correlation of formants of different frames along with f and (2) standard deviation of magnitude of frequency components in F, F and F regions of the spectrogram. It is observed that separation results obtained using both unsupervised methods are very close to the ideal case in supervised source separation. The results show that this method works better than some of the classical blind source separation algorithms like independent component analysis and non negative matrix factorization which works well only for the case of instantaneous mixtures where delay is neglected.
Subjects: FORMANTS (Speech); VOICE frequency; SIGNAL convolution; ALGORITHMS; STANDARD deviations; CEPSTRUM analysis (Mechanics)
Publication: International Journal of Speech Technology, 2015, Vol 18, Issue 4, p649
ISSN: 1381-2416
Publication type: Article
DOI: 10.1007/s10772-015-9309-1

We found a match

Supervised and unsupervised separation of convolutive speech mixtures using f and formant frequencies.

Prasanna Kumar, M.; Kumaraswamy, R.

FORMANTS (Speech); VOICE frequency; SIGNAL convolution; ALGORITHMS; STANDARD deviations; CEPSTRUM analysis (Mechanics)

International Journal of Speech Technology, 2015, Vol 18, Issue 4, p649

1381-2416

Article

10.1007/s10772-015-9309-1