METHODS AND CHALLENGES FOR CREATING AN EMOTIONAL AUDIO-VISUAL DATABASE

被引：0

作者：

Pandharipande, Meghna A. ^{[1
]}

Chakraborty, Rupayan ^{[1
]}

Kopparapu, Sunil Kumar ^{[1
]}

机构：

[1] TCS Innovat Labs Mumbai, Yantra Pk, Thane West 400601, India

来源：

2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA) | 2017年

关键词：

emotion database; speech emotion; visual expression; acted; spontaneous; induced; SPEECH;

D O I：

暂无

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

Emotion has a very important role in human communication and can be expressed either verbally through speech (e.g. pitch, intonation, prosody etc), or by facial expressions, gestures etc. Most of the contemporary human-computer interaction are deficient in interpreting these information and hence suffers from lack of emotional intelligence. In other words, these systems are unable to identify human's emotional state and hence is not able to react properly. To overcome these inabilities, machines are required to be trained using annotated emotional data samples. Motivated from this fact, here we have attempted to collect and create an audio-visual emotional corpus. Audio-visual signals of multiple subjects were recorded when they were asked to watch either presentation (having background music) or emotional video clips. Post recording subjects were asked to express how they felt, and to read out sentences that appeared on the screen. Self annotation from the subject itself, as well as annotation from others have also been carried out to annotate the recorded data.

引用

页码：183 / 188

页数：6

共 50 条

[1] A Turkish Audio-Visual Emotional Database
Onder, Onur
Zhalehpour, Sara
Erdem, Cigdem Eroglu
[J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
[2] CHEAVD: a Chinese natural emotional audio-visual database
Li, Ya
Tao, Jianhua
Chao, Linlin
Bao, Wei
Liu, Yazhu
[J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (06) : 913 - 924
[3] BUILDING A CHINESE NATURAL EMOTIONAL AUDIO-VISUAL DATABASE
Bao, Wei
Li, Ya
Gu, Mingliang
Yang, Minghao
Li, Hao
Chao, Linlin
Tao, Jianhua
[J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 583 - 587
[4] The Dysarthric Expressed Emotional Database (DEED): An audio-visual database in British English
Alhinti, Lubna
Cunningham, Stuart
Christensen, Heidi
[J]. PLOS ONE, 2023, 18 (08):
[5] An audio-visual speech recognition with a new mandarin audio-visual database
Liao, Wen-Yuan
Pao, Tsang-Long
Chen, Yu-Te
Chang, Tsun-Wei
[J]. INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
[6] Audio-Visual Twins Database
Li, Jing
Zhang, Li
Guo, Dong
Zhuo, Shaojie
Sim, Terence
[J]. 2015 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2015, : 493 - 500
[7] THE VERA AM MITTAG GERMAN AUDIO-VISUAL EMOTIONAL SPEECH DATABASE
Grimm, Michael
Kroschel, Kristian
Narayanan, Shrikanth
[J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 865 - +
[8] SUTAV: A Turkish Audio-Visual Database
Topkaya, Ibrahim Saygin
Erdogan, Hakan
[J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2334 - 2337
[9] AUDIO-VISUAL METHODS IN TEACHING
Allen, William H.
[J]. QUARTERLY JOURNAL OF SPEECH, 1955, 41 (01) : 90 - 90
[10] Audio-Visual Methods in Teaching
Hart, William G.
[J]. EDUCATIONAL RESEARCH BULLETIN, 1954, 33 (06): : 162 - 163

← 1 2 3 4 5 →