We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Underdetermined Blind Source Separation Based on Spatial Estimation and Compressed Sensing.
- Authors
Wei, Shuang; Zhang, Rui
- Abstract
This paper proposes a dual-channel speech separation method based on spatial estimation via sparse Bayesian inference (SBI) and nonnegative matrix factorization (NMF). The spatial information estimated by traditional compressed sensing (CS) models is insufficient when two microphones receive limited columns of mixed signals. Considering the sparsity of peak values in the cross-correlation spectrum between two received signals, the proposed method builds a new CS model based on cross-correlation spectrum and applies SBI algorithm to solve this model to improve the estimation accuracy of spatial information. Combined the spatial information with the spectral features decomposed by NMF, NMF coefficient matrix masks belonging to individual source are generated for pre-separation. To mitigate retained potential interference components, a post-separation processing stage is designed using an expectation maximization (EM) algorithm based on a Gaussian mixture model (GMM). The estimated spatial information and binary time–frequency masks are used for parameter initialization of the EM algorithm. The experimental results using real-world speech data show that the proposed method can achieve better separation performance compared to various existing methods.
- Subjects
BLIND source separation; COMPRESSED sensing; GAUSSIAN mixture models; CHANNEL estimation; EXPECTATION-maximization algorithms; MATRIX decomposition; NONNEGATIVE matrices; AUTOMATIC speech recognition; BAYESIAN field theory
- Publication
Circuits, Systems & Signal Processing, 2024, Vol 43, Issue 4, p2428
- ISSN
0278-081X
- Publication type
Article
- DOI
10.1007/s00034-023-02566-1