METHODS AND CHALLENGES FOR CREATING AN EMOTIONAL AUDIO-VISUAL DATABASE

被引:0
|
作者
Pandharipande, Meghna A. [1 ]
Chakraborty, Rupayan [1 ]
Kopparapu, Sunil Kumar [1 ]
机构
[1] TCS Innovat Labs Mumbai, Yantra Pk, Thane West 400601, India
关键词
emotion database; speech emotion; visual expression; acted; spontaneous; induced; SPEECH;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Emotion has a very important role in human communication and can be expressed either verbally through speech (e.g. pitch, intonation, prosody etc), or by facial expressions, gestures etc. Most of the contemporary human-computer interaction are deficient in interpreting these information and hence suffers from lack of emotional intelligence. In other words, these systems are unable to identify human's emotional state and hence is not able to react properly. To overcome these inabilities, machines are required to be trained using annotated emotional data samples. Motivated from this fact, here we have attempted to collect and create an audio-visual emotional corpus. Audio-visual signals of multiple subjects were recorded when they were asked to watch either presentation (having background music) or emotional video clips. Post recording subjects were asked to express how they felt, and to read out sentences that appeared on the screen. Self annotation from the subject itself, as well as annotation from others have also been carried out to annotate the recorded data.
引用
收藏
页码:183 / 188
页数:6
相关论文
共 50 条
  • [1] A Turkish Audio-Visual Emotional Database
    Onder, Onur
    Zhalehpour, Sara
    Erdem, Cigdem Eroglu
    [J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [2] CHEAVD: a Chinese natural emotional audio-visual database
    Li, Ya
    Tao, Jianhua
    Chao, Linlin
    Bao, Wei
    Liu, Yazhu
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (06) : 913 - 924
  • [3] BUILDING A CHINESE NATURAL EMOTIONAL AUDIO-VISUAL DATABASE
    Bao, Wei
    Li, Ya
    Gu, Mingliang
    Yang, Minghao
    Li, Hao
    Chao, Linlin
    Tao, Jianhua
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 583 - 587
  • [4] The Dysarthric Expressed Emotional Database (DEED): An audio-visual database in British English
    Alhinti, Lubna
    Cunningham, Stuart
    Christensen, Heidi
    [J]. PLOS ONE, 2023, 18 (08):
  • [5] An audio-visual speech recognition with a new mandarin audio-visual database
    Liao, Wen-Yuan
    Pao, Tsang-Long
    Chen, Yu-Te
    Chang, Tsun-Wei
    [J]. INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
  • [6] Audio-Visual Twins Database
    Li, Jing
    Zhang, Li
    Guo, Dong
    Zhuo, Shaojie
    Sim, Terence
    [J]. 2015 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2015, : 493 - 500
  • [7] THE VERA AM MITTAG GERMAN AUDIO-VISUAL EMOTIONAL SPEECH DATABASE
    Grimm, Michael
    Kroschel, Kristian
    Narayanan, Shrikanth
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 865 - +
  • [8] SUTAV: A Turkish Audio-Visual Database
    Topkaya, Ibrahim Saygin
    Erdogan, Hakan
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2334 - 2337
  • [9] AUDIO-VISUAL METHODS IN TEACHING
    Allen, William H.
    [J]. QUARTERLY JOURNAL OF SPEECH, 1955, 41 (01) : 90 - 90
  • [10] Audio-Visual Methods in Teaching
    Hart, William G.
    [J]. EDUCATIONAL RESEARCH BULLETIN, 1954, 33 (06): : 162 - 163