Self-labeling with feature transfer for speech emotion recognition

被引：11

作者：

Wen, Guihua ^{[1
]}

Liao, Huiqiang ^{[1
]}

Li, Huihui ^{[2
]}

Wen, Pengchen ^{[3
]}

Zhang, Tong ^{[1
]}

Gao, Sande ^{[4
]}

Wang, Bao ^{[4
]}

机构：

[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China

[2] Guangdong Polytech Normal Univ, Sch Comp Sci, Guangzhou, Peoples R China

[3] Hubei Minzu Univ, Sch Informat Engn, Enshi, Hubei, Peoples R China

[4] Affiliated TCM Hosp Guangzhou Med Univ, Guangzhou, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2022年 / 254卷

基金：

中国国家自然科学基金;

关键词：

Speech emotion recognition; Deep neural network; Self-labeled; Speech frame; Transfer learning; REPRESENTATION;

D O I：

10.1016/j.knosys.2022.109589

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most speech emotion recognition methods based on frames have obtained good results in many applications. However, they segment each speech sample into smaller frames that are labeled with the same emotional tag as that of the speech sample. This is inconsistent with the possibility of a speech sample containing several emotional categories at the same time. Thus, this paper proposes a self-labeling (SL) learning method for speech emotion recognition, which automatically segments each speech sample into frames and then labels them with the corresponding emotional tags, where the compatibility of these tags is also checked. Then, a time-frequency deep neural network for speech emotion recognition is designed and trained. As most speech emotion datasets are very small, the feature transfer model is applied to further enhance the performance of the SL learning method, which is trained on large-scale audio data. Experimental results on various datasets demonstrate the effectiveness of the proposed method. (C) 2022 Published by Elsevier B.V.

引用

页数：10

共 50 条

[1] Self-Labeling Learning Ensemble via Deep Recurrent Neural Network and Self-Representation for Speech Emotion Recognition
Cui, Yan
Jiang, Xiaoyan
Dai, Yue
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (09)
[2] A Self-Labeling Feature Matching Algorithm for Instance Recognition on Multi-Sensor Images
Zhang X.
He Z.
Ma Z.
Yang Y.
Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2021, 41 (05): : 558 - 568
[3] Self-labeling methods for unsupervised transfer ranking
Li, Pengfei
Sanderson, Mark
Carman, Mark
Scholer, Falk
INFORMATION SCIENCES, 2020, 516 : 293 - 315
[4] Feature Selection Based Transfer Subspace Learning for Speech Emotion Recognition
Song, Peng
Zheng, Wenming
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2020, 11 (03) : 373 - 382
[5] Feature representation for speech emotion Recognition
Abdollahpour, Mehdi
Zamani, Lafar
Rad, Hamidreza Saligheh
2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1465 - 1468
[6] Self-attention transfer networks for speech emotion recognition
Ziping ZHAO
Keru Wang
Zhongtian BAO
Zixing ZHANG
Nicholas CUMMINS
Shihuang SUN
Haishuai WANG
Jianhua TAO
Bj?rn W.SCHULLER
虚拟现实与智能硬件(中英文), 2021, 3 (01) : 43 - 54
[7] Decoupled Feature and Self-Knowledge Distillation for Speech Emotion Recognition
Yu, Haixiang
Ning, Yuan
IEEE ACCESS, 2025, 13 : 33275 - 33285
[8] Sparse Autoencoder-based Feature Transfer Learning for Speech Emotion Recognition
Deng, Jun
Zhang, Zixing
Marchi, Erik
Schuller, Bjoern
2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 511 - 516
[9] Self-labeling sexual harassment
Magley, VJ
Shupe, EI
SEX ROLES, 2005, 53 (3-4) : 173 - 189
[10] Self-Labeling Sexual Harassment
Vicki J. Magley
Ellen I. Shupe
Sex Roles, 2005, 53 : 173 - 189

← 1 2 3 4 5 →