We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Supervised and unsupervised separation of convolutive speech mixtures using f and formant frequencies.
- Authors
Prasanna Kumar, M.; Kumaraswamy, R.
- Abstract
In this paper we discuss the role of fundamental frequency f and formants F, F and F of the speech signal in supervised and unsupervised source separation of real recorded convolutive speech mixtures. Initially supervised source separation is discussed where it is assumed that sources are known a priori. The supervised source separation is discussed by considering (1) only fundamental frequency f, (2) only formants F, F and F, (3) both f and formants F, F and F. It is observed that last case which involves both f and formants gives most accurate separation results and is used as ideal case or reference to compare the separation results obtained for unsupervised source separation. The unsupervised source separation is discussed, where there is no knowledge about the sources a priori. The unsupervised source separation is discussed using (1) cross correlation of formants of different frames along with f and (2) standard deviation of magnitude of frequency components in F, F and F regions of the spectrogram. It is observed that separation results obtained using both unsupervised methods are very close to the ideal case in supervised source separation. The results show that this method works better than some of the classical blind source separation algorithms like independent component analysis and non negative matrix factorization which works well only for the case of instantaneous mixtures where delay is neglected.
- Subjects
FORMANTS (Speech); VOICE frequency; SIGNAL convolution; ALGORITHMS; STANDARD deviations; CEPSTRUM analysis (Mechanics)
- Publication
International Journal of Speech Technology, 2015, Vol 18, Issue 4, p649
- ISSN
1381-2416
- Publication type
Article
- DOI
10.1007/s10772-015-9309-1