Huang, Zhengwei; Xue, Wentao; Mao, Qirong; Zhan, Yongzhao

doi:10.1007/s11042-016-3354-x

Back to matches

Your institution may have access to this item. Find your institution then sign in to continue.

Title: Unsupervised domain adaptation for speech emotion recognition using PCANet.
Authors: Huang, Zhengwei; Xue, Wentao; Mao, Qirong; Zhan, Yongzhao
Abstract: Research in emotion recognition seeks to develop insights into the variances of features of emotion in one common domain. However, automatic emotion recognition from speech is challenging when training data and test data are drawn from different domains due to different recording conditions, languages, speakers and many other factors. In this paper, we propose a novel feature transfer approach with PCANet (a deep network), which extracts both the domain-shared and the domain-specific latent features to facilitate performance improvement. The proposal attempts to learn multiple intermediate feature representations along an interpolating path between the source and target domains using PCANet by considering the distribution shift between source domain and target domain, and then aligns other feature representations on the path with target subspace to control them to change in the right direction towards the target. To exemplify the effectiveness of our approach, we select the INTERSPEECH 2009 Emotion Challenge's FAU Aibo Emotion Corpus as the target database and two public databases (ABC and Emo-DB) as source set. Experimental results demonstrate that the proposed feature transfer learning method outperforms the conventional machine learning methods and other transfer learning methods on the performance.
Subjects: AUTOMATIC speech recognition; EMOTION recognition; MACHINE learning; FEATURE extraction; DIGITAL image processing
Publication: Multimedia Tools & Applications, 2017, Vol 76, Issue 5, p6785
ISSN: 1380-7501
Publication type: Article
DOI: 10.1007/s11042-016-3354-x

We found a match

Unsupervised domain adaptation for speech emotion recognition using PCANet.

Huang, Zhengwei; Xue, Wentao; Mao, Qirong; Zhan, Yongzhao

AUTOMATIC speech recognition; EMOTION recognition; MACHINE learning; FEATURE extraction; DIGITAL image processing

Multimedia Tools & Applications, 2017, Vol 76, Issue 5, p6785

1380-7501

Article

10.1007/s11042-016-3354-x