Cross-Corpus Speech Emotion Recognition Based on Joint Transfer Subspace Learning and Regression

被引：9

作者：

Zhang, Weijian ^{[1
]}

Song, Peng ^{[1
]}

Chen, Dongliang ^{[1
]}

Sheng, Chao ^{[1
]}

Zhang, Wenjing ^{[1
]}

机构：

[1] Yantai Univ, Sch Comp & Control Engn, Shandong 264005, Peoples R China

来源：

IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS | 2022年 / 14卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Regression; speech emotion recognition; subspace learning; transfer learning; DIMENSIONALITY REDUCTION; FEATURES; FRAMEWORK;

D O I：

10.1109/TCDS.2021.3055524

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Speech emotion recognition has become an attractive research topic due to various emotional states of speech signals in real-life scenarios. Most current speech emotion recognition methods are carried out on a single corpus. However, in practice, the training and testing data often come from different domains, e.g., different corpora. In this case, the model generalizability and recognition performance would decrease greatly due to the domain mismatch. To address this challenging problem, we present a transfer learning method, called joint transfer subspace learning and regression (JTSLR), for cross-corpus speech emotion recognition. Specifically, JTSLR performs transfer subspace learning and regression in a joint framework. First, we learn a latent subspace by introducing a discriminative maximum mean discrepancy (MMD) as the discrepancy metric. Then, we put forward a regression function in this latent subspace to describe the relationships between features and corresponding labels. Moreover, we present a label graph to help transfer knowledge from relevant source data to target data. Finally, we conduct extensive experiments on three popular emotional data sets. The results show that our method can outperform traditional methods and some state-of-the-art transfer learning algorithms for cross-corpus speech emotion recognition tasks.

引用

下载

页码：588 / 598

页数：11

共 50 条

[1] Cross-Corpus Speech Emotion Recognition Based on Sparse Subspace Transfer Learning
Zhao, Keke
Song, Peng
Zhang, Wenjing
Zhang, Weijian
Li, Shaokai
Chen, Dongliang
Zheng, Wenming
BIOMETRIC RECOGNITION (CCBR 2021), 2021, 12878 : 466 - 473
[2] Transfer Linear Subspace Learning for Cross-Corpus Speech Emotion Recognition
Song, Peng
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (02) : 265 - 275
[3] Transfer Subspace Learning for Unsupervised Cross-Corpus Speech Emotion Recognition
Liu, Na
Zhang, Baofeng
Liu, Bin
Shi, Jingang
Yang, Lei
Li, Zhiwei
Zhu, Junchao
IEEE ACCESS, 2021, 9 : 95925 - 95937
[4] Transfer Sparse Discriminant Subspace Learning for Cross-Corpus Speech Emotion Recognition
Zhang, Weijian
Song, Peng
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 (28) : 307 - 318
[5] Nonnegative Matrix Factorization Based Transfer Subspace Learning for Cross-Corpus Speech Emotion Recognition
Luo, Hui
Han, Jiqing
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 2047 - 2060
[6] Target-Adapted Subspace Learning for Cross-Corpus Speech Emotion Recognition
Chen, Xiuzhen
Zhou, Xiaoyan
Lu, Cheng
Zong, Yuan
Zheng, Wenming
Tang, Chuangao
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (12) : 2632 - 2636
[7] Cross-corpus speech emotion recognition using subspace learning and domain adaption
Cao, Xuan
Jia, Maoshen
Ru, Jiawei
Pai, Tun-wen
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2022, 2022 (01)
[8] Cross-corpus speech emotion recognition using subspace learning and domain adaption
Xuan Cao
Maoshen Jia
Jiawei Ru
Tun-wen Pai
EURASIP Journal on Audio, Speech, and Music Processing, 2022
[9] CROSS-CORPUS SPEECH EMOTION RECOGNITION USING JOINT DISTRIBUTION ADAPTIVE REGRESSION
Zhang, Jiacheng
Jiang, Lin
Zong, Yuan
Zheng, Wenming
Zhao, Li
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3790 - 3794
[10] Deep Transductive Transfer Regression Network for Cross-Corpus Speech Emotion Recognition
Zhao, Yan
Wang, Jincen
Ye, Ru
Zong, Yuan
Zheng, Wenming
Zhao, Li
INTERSPEECH 2022, 2022, : 371 - 375

← 1 2 3 4 5 →