Xiaoyan Zhao; Lin Zhou; Yue Xie; Ying Tong; Jingang Shi

doi:10.32604/iasc.2023.030180

Back to matches

Your institution may have rights to this item. Sign in to continue.

Title: Speech Separation Algorithm Using Gated Recurrent Network Based on Microphone Array.
Authors: Xiaoyan Zhao; Lin Zhou; Yue Xie; Ying Tong; Jingang Shi
Abstract: Speech separation is an active research topic that plays an important role in numerous applications, such as speaker recognition, hearing prosthesis, and autonomous robots. Many algorithms have been put forward to improve separation performance. However, speech separation in reverberant noisy environment is still a challenging task. To address this, a novel speech separation algorithm using gate recurrent unit (GRU) network based on microphone array has been proposed in this paper. The main aim of the proposed algorithm is to improve the separation performance and reduce the computational cost. The proposed algorithm extracts the sub-band steered response power-phase transform (SRP-PHAT) weighted by gammatone filter as the speech separation feature due to its discriminative and robust spatial position information. Since theGRUnetwork has the advantage of processing time series data with faster training speed and fewer training parameters, the GRUmodel is adopted to process the separation features of several sequential frames in the same sub-band to estimate the ideal RatioMasking (IRM). The proposed algorithm decomposes themixture signals into time-frequency (TF) units using gammatone filter bank in the frequency domain, and the target speech is reconstructed in the frequency domain bymasking themixture signal according to the estimated IRM. The operations of decomposing the mixture signal and reconstructing the target signal are completed in the frequency domain which can reduce the total computational cost. Experimental results demonstrate that the proposed algorithm realizes omnidirectional speech separation in noisy and reverberant environments, provides good performance in terms of speech quality and intelligibility, and has the generalization capacity to reverberate.
Subjects: MICROPHONES; SPEECH; MICROPHONE arrays; INTELLIGIBILITY of speech; FILTER banks; AUTONOMOUS robots; ALGORITHMS; MULTICASTING (Computer networks)
Publication: Intelligent Automation & Soft Computing, 2023, Vol 36, Issue 3, p3087
ISSN: 1079-8587
Publication type: Article
DOI: 10.32604/iasc.2023.030180

We found a match

Speech Separation Algorithm Using Gated Recurrent Network Based on Microphone Array.

Xiaoyan Zhao; Lin Zhou; Yue Xie; Ying Tong; Jingang Shi

MICROPHONES; SPEECH; MICROPHONE arrays; INTELLIGIBILITY of speech; FILTER banks; AUTONOMOUS robots; ALGORITHMS; MULTICASTING (Computer networks)

Intelligent Automation & Soft Computing, 2023, Vol 36, Issue 3, p3087

1079-8587

Article

10.32604/iasc.2023.030180