Relevance units machine based dimensional and continuous speech emotion prediction

被引:5
|
作者
Wang, Fengna [1 ]
Sahli, Hichem [1 ,2 ]
Gao, Junbin [3 ]
Jiang, Dongmei [4 ]
Verhelst, Werner [1 ,5 ]
机构
[1] Univ Brussel, Dept Elect & Informat ETRO, VUB NPU Joint AVSP Lab, B-1050 Brussels, Belgium
[2] Interuniv Microelect Ctr IMEC, Leuven, Belgium
[3] Charles Sturt Univ, Sch Comp & Math, Bathurst, NSW 2795, Australia
[4] Northwestern Polytech Univ, Sch Comp Sci, VUB NPU Joint AVSP Lab, Xian 710072, Peoples R China
[5] iMinds, Gaston Crommenlaan 8, B-9050 Ghent, Belgium
关键词
Relevance units machine; Continuous speech emotion regression; Dimensional emotion modeling; RECOGNITION;
D O I
10.1007/s11042-014-2319-1
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotion plays a significant role in human-computer interaction. The continuing improvements in speech technology have led to many new and fascinating applications in human-computer interaction, context aware computing and computer mediated communication. Such applications require reliable online recognition of the user's affect. However most emotion recognition systems are based on speech via an isolated short sentence or word. We present a framework for online emotion recognition from speech. On the front-end, a voice activity detection algorithm is used to segment the input speech, and features are estimated to model long-term properties. Then, dimensional and continuous emotion recognition is performed via a Relevance Units Machine (RUM). The advantages of the proposed system are: (i) its computational efficiency in run-time (regression outputs can be produced continuously in pseudo real-time), (ii) RUM offers superior sparsity to the well-known Support Vector Regression (SVR) and Relevance Vector Machine for regression (RVR), and (iii) RUM's predictive performance is comparable to SVR and RVR.
引用
收藏
页码:9983 / 10000
页数:18
相关论文
共 50 条
  • [21] Dimensional Emotion Prediction based on Interactive Context in Conversation
    Shi, Xiaohan
    Li, Sixia
    Dang, Jianwu
    INTERSPEECH 2020, 2020, : 4193 - 4197
  • [22] RECONSTRUCTION-ERROR-BASED LEARNING FOR CONTINUOUS EMOTION RECOGNITION IN SPEECH
    Han, Jing
    Zhang, Zixing
    Ringeval, Fabien
    Schuller, Bjoern
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2367 - 2371
  • [23] Discovering Functional Units in Continuous Speech
    Lim, Sung-Joo
    Lacerda, Francisco
    Holt, Lori L.
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2015, 41 (04) : 1139 - 1152
  • [24] Speech Emotion Recognition with Emotion-Pair based Framework Considering Emotion Distribution Information in Dimensional Emotion Space
    Ma, Xi
    Wu, Zhiyong
    Jia, Jia
    Xu, Mingxing
    Meng, Helen
    Cai, Lianhong
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1238 - 1242
  • [25] SER: Speech Emotion Recognition Application Based on Extreme Learning Machine
    Ainurrochman
    Febriansyah, Irfanur Ilham
    Yuhana, Umi Laili
    PROCEEDINGS OF 2021 13TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2021, : 179 - 183
  • [26] A Review on Emotion Based Harmful Speech Detection Using Machine Learning
    Tyagi, Suryakant
    Varkonyi, Annamaria R.
    Marta, Takacs
    Szenasi, Sandor
    2022 IEEE 22ND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS AND 8TH IEEE INTERNATIONAL CONFERENCE ON RECENT ACHIEVEMENTS IN MECHATRONICS, AUTOMATION, COMPUTER SCIENCE AND ROBOTICS (CINTI-MACRO), 2022, : 17 - 23
  • [27] Speech emotion recognition method in educational scene based on machine learning
    Zhang, Yanning
    Srivastava, Gautam
    EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (05)
  • [28] Recognizing Speech Emotion Based on Acoustic Features Using Machine Learning
    Nasim, Md Abu Saleh
    Chowdory, Md Rakibul Hassan
    Dey, Ashim
    Das, Annesha
    13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2021), 2021, : 95 - +
  • [29] Factor Analysis Based Speaker Normalisation for Continuous Emotion Prediction
    Dang, Ting
    Sethu, Vidhyasaharan
    Ambikairajah, Eliathamby
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 913 - 917
  • [30] Dimensional Speech Emotion Recognition Review
    Li H.-F.
    Chen J.
    Ma L.
    Bo H.-J.
    Xu C.
    Li H.-W.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (08): : 2465 - 2491