Relevance units machine based dimensional and continuous speech emotion prediction

被引：5

作者：

Wang, Fengna ^{[1
]}

Sahli, Hichem ^{[1
,2
]}

Gao, Junbin ^{[3
]}

Jiang, Dongmei ^{[4
]}

Verhelst, Werner ^{[1
,5
]}

机构：

[1] Univ Brussel, Dept Elect & Informat ETRO, VUB NPU Joint AVSP Lab, B-1050 Brussels, Belgium

[2] Interuniv Microelect Ctr IMEC, Leuven, Belgium

[3] Charles Sturt Univ, Sch Comp & Math, Bathurst, NSW 2795, Australia

[4] Northwestern Polytech Univ, Sch Comp Sci, VUB NPU Joint AVSP Lab, Xian 710072, Peoples R China

[5] iMinds, Gaston Crommenlaan 8, B-9050 Ghent, Belgium

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2015年 / 74卷 / 22期

关键词：

Relevance units machine; Continuous speech emotion regression; Dimensional emotion modeling; RECOGNITION;

D O I：

10.1007/s11042-014-2319-1

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Emotion plays a significant role in human-computer interaction. The continuing improvements in speech technology have led to many new and fascinating applications in human-computer interaction, context aware computing and computer mediated communication. Such applications require reliable online recognition of the user's affect. However most emotion recognition systems are based on speech via an isolated short sentence or word. We present a framework for online emotion recognition from speech. On the front-end, a voice activity detection algorithm is used to segment the input speech, and features are estimated to model long-term properties. Then, dimensional and continuous emotion recognition is performed via a Relevance Units Machine (RUM). The advantages of the proposed system are: (i) its computational efficiency in run-time (regression outputs can be produced continuously in pseudo real-time), (ii) RUM offers superior sparsity to the well-known Support Vector Regression (SVR) and Relevance Vector Machine for regression (RVR), and (iii) RUM's predictive performance is comparable to SVR and RVR.

引用

页码：9983 / 10000

页数：18

共 50 条

[21] Dimensional Emotion Prediction based on Interactive Context in Conversation
Shi, Xiaohan
Li, Sixia
Dang, Jianwu
INTERSPEECH 2020, 2020, : 4193 - 4197
[22] RECONSTRUCTION-ERROR-BASED LEARNING FOR CONTINUOUS EMOTION RECOGNITION IN SPEECH
Han, Jing
Zhang, Zixing
Ringeval, Fabien
Schuller, Bjoern
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 2367 - 2371
[23] Discovering Functional Units in Continuous Speech
Lim, Sung-Joo
Lacerda, Francisco
Holt, Lori L.
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2015, 41 (04) : 1139 - 1152
[24] Speech Emotion Recognition with Emotion-Pair based Framework Considering Emotion Distribution Information in Dimensional Emotion Space
Ma, Xi
Wu, Zhiyong
Jia, Jia
Xu, Mingxing
Meng, Helen
Cai, Lianhong
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1238 - 1242
[25] SER: Speech Emotion Recognition Application Based on Extreme Learning Machine
Ainurrochman
Febriansyah, Irfanur Ilham
Yuhana, Umi Laili
PROCEEDINGS OF 2021 13TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2021, : 179 - 183
[26] A Review on Emotion Based Harmful Speech Detection Using Machine Learning
Tyagi, Suryakant
Varkonyi, Annamaria R.
Marta, Takacs
Szenasi, Sandor
2022 IEEE 22ND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS AND 8TH IEEE INTERNATIONAL CONFERENCE ON RECENT ACHIEVEMENTS IN MECHATRONICS, AUTOMATION, COMPUTER SCIENCE AND ROBOTICS (CINTI-MACRO), 2022, : 17 - 23
[27] Speech emotion recognition method in educational scene based on machine learning
Zhang, Yanning
Srivastava, Gautam
EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2022, 9 (05)
[28] Recognizing Speech Emotion Based on Acoustic Features Using Machine Learning
Nasim, Md Abu Saleh
Chowdory, Md Rakibul Hassan
Dey, Ashim
Das, Annesha
13TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND INFORMATION SYSTEMS (ICACSIS 2021), 2021, : 95 - +
[29] Factor Analysis Based Speaker Normalisation for Continuous Emotion Prediction
Dang, Ting
Sethu, Vidhyasaharan
Ambikairajah, Eliathamby
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 913 - 917
[30] Dimensional Speech Emotion Recognition Review
Li H.-F.
Chen J.
Ma L.
Bo H.-J.
Xu C.
Li H.-W.
Ruan Jian Xue Bao/Journal of Software, 2020, 31 (08): : 2465 - 2491

← 1 2 3 4 5 →