The Dysarthric Expressed Emotional Database (DEED): An audio-visual database in British English

被引:1
|
作者
Alhinti, Lubna [1 ,4 ]
Cunningham, Stuart [2 ,3 ]
Christensen, Heidi [1 ,3 ]
机构
[1] Univ Sheffield, Dept Comp Sci, Sheffield, England
[2] Univ Sheffield, Hlth Sci Sch, Sheffield, England
[3] Ctr Assist Technol & Connected Healthcare CATCH, Sheffield, England
[4] Saudi Co Artificial Intelligence SCAI, Riyadh, Saudi Arabia
来源
PLOS ONE | 2023年 / 18卷 / 08期
关键词
FACIAL EXPRESSIONS; PARKINSONS-DISEASE; ACOUSTIC CHARACTERISTICS; VOCAL COMMUNICATION; SPEECH; VOICE; RECOGNITION; SPEAKERS; CUES; JUDGMENTS;
D O I
10.1371/journal.pone.0287971
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The Dysarthric Expressed Emotional Database (DEED) is a novel, parallel multimodal (audio-visual) database of dysarthric and typical emotional speech in British English which is a first of its kind. It is an induced (elicited) emotional database that includes speech recorded in the six basic emotions: "happiness", "sadness", "anger", "surprise", "fear", and "disgust". A "neutral" state has also been recorded as a baseline condition. The dysarthric speech part includes recordings from 4 speakers: one female speaker with dysarthria due to cerebral palsy and 3 speakers with dysarthria due to Parkinson's disease (2 female and 1 male). The typical speech part includes recordings from 21 typical speakers (9 female and 12 male). This paper describes the collection of the database, covering its design, development, technical information related to the data capture, and description of the data files and presents the validation methodology. The database was validated subjectively (human performance) and objectively (automatic recognition). The achieved results demonstrated that this database will be a valuable resource for understanding emotion communication by people with dysarthria and useful in the research field of dysarthric emotion classification. The database is freely available for research purposes under a Creative Commons licence at:
引用
收藏
页数:31
相关论文
共 50 条
  • [1] A Turkish Audio-Visual Emotional Database
    Onder, Onur
    Zhalehpour, Sara
    Erdem, Cigdem Eroglu
    [J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [2] METHODS AND CHALLENGES FOR CREATING AN EMOTIONAL AUDIO-VISUAL DATABASE
    Pandharipande, Meghna A.
    Chakraborty, Rupayan
    Kopparapu, Sunil Kumar
    [J]. 2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), 2017, : 183 - 188
  • [3] CHEAVD: a Chinese natural emotional audio-visual database
    Li, Ya
    Tao, Jianhua
    Chao, Linlin
    Bao, Wei
    Liu, Yazhu
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (06) : 913 - 924
  • [4] BUILDING A CHINESE NATURAL EMOTIONAL AUDIO-VISUAL DATABASE
    Bao, Wei
    Li, Ya
    Gu, Mingliang
    Yang, Minghao
    Li, Hao
    Chao, Linlin
    Tao, Jianhua
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 583 - 587
  • [5] Audio-Visual Twins Database
    Li, Jing
    Zhang, Li
    Guo, Dong
    Zhuo, Shaojie
    Sim, Terence
    [J]. 2015 INTERNATIONAL CONFERENCE ON BIOMETRICS (ICB), 2015, : 493 - 500
  • [6] An audio-visual speech recognition with a new mandarin audio-visual database
    Liao, Wen-Yuan
    Pao, Tsang-Long
    Chen, Yu-Te
    Chang, Tsun-Wei
    [J]. INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
  • [7] THE VERA AM MITTAG GERMAN AUDIO-VISUAL EMOTIONAL SPEECH DATABASE
    Grimm, Michael
    Kroschel, Kristian
    Narayanan, Shrikanth
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, 2008, : 865 - +
  • [8] SUTAV: A Turkish Audio-Visual Database
    Topkaya, Ibrahim Saygin
    Erdogan, Hakan
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2334 - 2337
  • [9] Searching Audio-Visual Clips for Dual-mode Chinese Emotional Speech Database
    Zhang, Xudong
    Wu, Guoqing
    Ren, Fuji
    [J]. 2018 FIRST ASIAN CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII ASIA), 2018,
  • [10] Development of an audio-visual database system for human identification
    Bargale, CB
    Chaudhuri, S
    Bhattacharyya, P
    [J]. AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 345 - 352