May, Avner; Garakani, Alireza Bagheri; Zhiyun Lu; Dong Guo; Kuan Liu; Bellet, Aurélien; Linxi Fan; Collins, Michael; Hsu, Daniel; Kingsbury, Brian; Picheny, Michael; Fei Sha

Back to matches

Your institution may have rights to this item. Sign in to continue.

Title: Kernel Approximation Methods for Speech Recognition.
Authors: May, Avner; Garakani, Alireza Bagheri; Zhiyun Lu; Dong Guo; Kuan Liu; Bellet, Aurélien; Linxi Fan; Collins, Michael; Hsu, Daniel; Kingsbury, Brian; Picheny, Michael; Fei Sha
Abstract: We study the performance of kernel methods on the acoustic modeling task for automatic speech recognition, and compare their performance to deep neural networks (DNNs). To scale the kernel methods to large data sets, we use the random Fourier feature method of Rahimi and Recht (2007). We propose two novel techniques for improving the performance of kernel acoustic models. First, we propose a simple but effective feature selection method which reduces the number of random features required to attain a fixed level of performance. Second, we present a number of metrics which correlate strongly with speech recognition performance when computed on the heldout set; we attain improved performance by using these metrics to decide when to stop training. Additionally, we show that the linear bottleneck method of Sainath et al. (2013a) improves the performance of our kernel models significantly, in addition to speeding up training and making the models more compact. Leveraging these three methods, the kernel methods attain token error rates between 0:5% better and 0:1% worse than fully-connected DNNs across four speech recognition data sets, including the TIMIT and Broadcast News benchmark tasks.
Subjects: SPEECH perception; AUTOMATIC speech recognition; ACOUSTIC models; BIG data; FEATURE selection; RANDOM numbers
Publication: Journal of Machine Learning Research, 2019, Vol 20, Issue 57-84, p1
ISSN: 1532-4435
Publication type: Article

We found a match

Kernel Approximation Methods for Speech Recognition.

May, Avner; Garakani, Alireza Bagheri; Zhiyun Lu; Dong Guo; Kuan Liu; Bellet, Aurélien; Linxi Fan; Collins, Michael; Hsu, Daniel; Kingsbury, Brian; Picheny, Michael; Fei Sha

SPEECH perception; AUTOMATIC speech recognition; ACOUSTIC models; BIG data; FEATURE selection; RANDOM numbers

Journal of Machine Learning Research, 2019, Vol 20, Issue 57-84, p1

1532-4435

Article