Self-labeling with feature transfer for speech emotion recognition

被引:11
|
作者
Wen, Guihua [1 ]
Liao, Huiqiang [1 ]
Li, Huihui [2 ]
Wen, Pengchen [3 ]
Zhang, Tong [1 ]
Gao, Sande [4 ]
Wang, Bao [4 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou, Peoples R China
[2] Guangdong Polytech Normal Univ, Sch Comp Sci, Guangzhou, Peoples R China
[3] Hubei Minzu Univ, Sch Informat Engn, Enshi, Hubei, Peoples R China
[4] Affiliated TCM Hosp Guangzhou Med Univ, Guangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Speech emotion recognition; Deep neural network; Self-labeled; Speech frame; Transfer learning; REPRESENTATION;
D O I
10.1016/j.knosys.2022.109589
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most speech emotion recognition methods based on frames have obtained good results in many applications. However, they segment each speech sample into smaller frames that are labeled with the same emotional tag as that of the speech sample. This is inconsistent with the possibility of a speech sample containing several emotional categories at the same time. Thus, this paper proposes a self-labeling (SL) learning method for speech emotion recognition, which automatically segments each speech sample into frames and then labels them with the corresponding emotional tags, where the compatibility of these tags is also checked. Then, a time-frequency deep neural network for speech emotion recognition is designed and trained. As most speech emotion datasets are very small, the feature transfer model is applied to further enhance the performance of the SL learning method, which is trained on large-scale audio data. Experimental results on various datasets demonstrate the effectiveness of the proposed method. (C) 2022 Published by Elsevier B.V.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Self-Labeling Learning Ensemble via Deep Recurrent Neural Network and Self-Representation for Speech Emotion Recognition
    Cui, Yan
    Jiang, Xiaoyan
    Dai, Yue
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2024, 38 (09)
  • [2] A Self-Labeling Feature Matching Algorithm for Instance Recognition on Multi-Sensor Images
    Zhang X.
    He Z.
    Ma Z.
    Yang Y.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2021, 41 (05): : 558 - 568
  • [3] Self-labeling methods for unsupervised transfer ranking
    Li, Pengfei
    Sanderson, Mark
    Carman, Mark
    Scholer, Falk
    INFORMATION SCIENCES, 2020, 516 : 293 - 315
  • [4] Feature Selection Based Transfer Subspace Learning for Speech Emotion Recognition
    Song, Peng
    Zheng, Wenming
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2020, 11 (03) : 373 - 382
  • [5] Feature representation for speech emotion Recognition
    Abdollahpour, Mehdi
    Zamani, Lafar
    Rad, Hamidreza Saligheh
    2017 25TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2017, : 1465 - 1468
  • [6] Self-attention transfer networks for speech emotion recognition
    Ziping ZHAO
    Keru Wang
    Zhongtian BAO
    Zixing ZHANG
    Nicholas CUMMINS
    Shihuang SUN
    Haishuai WANG
    Jianhua TAO
    Bj?rn W.SCHULLER
    虚拟现实与智能硬件(中英文), 2021, 3 (01) : 43 - 54
  • [7] Decoupled Feature and Self-Knowledge Distillation for Speech Emotion Recognition
    Yu, Haixiang
    Ning, Yuan
    IEEE ACCESS, 2025, 13 : 33275 - 33285
  • [8] Sparse Autoencoder-based Feature Transfer Learning for Speech Emotion Recognition
    Deng, Jun
    Zhang, Zixing
    Marchi, Erik
    Schuller, Bjoern
    2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 511 - 516
  • [9] Self-labeling sexual harassment
    Magley, VJ
    Shupe, EI
    SEX ROLES, 2005, 53 (3-4) : 173 - 189
  • [10] Self-Labeling Sexual Harassment
    Vicki J. Magley
    Ellen I. Shupe
    Sex Roles, 2005, 53 : 173 - 189