EmoChildRu: Emotional Child Russian Speech Corpus

被引:20
|
作者
Lyakso, Elena [1 ]
Frolova, Olga [1 ]
Dmitrieva, Evgeniya [1 ]
Grigorev, Aleksey [1 ]
Kaya, Heysem [2 ]
Salah, Albert Ali [2 ]
Karpov, Alexey [3 ,4 ]
机构
[1] St Petersburg State Univ, Child Speech Res Grp, St Petersburg 199034, Russia
[2] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
[3] RAS, St Petersburg Inst Informat & Automat, St Petersburg, Russia
[4] ITMO Univ, St Petersburg, Russia
来源
关键词
Emotional child speech; Perceptual analysis; Spectro-graphic analysis; Emotional states; Computational paralinguistics;
D O I
10.1007/978-3-319-23132-7_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the first child emotional speech corpus in Russian, called "EmoChildRu", which contains audio materials of 3-7 year old kids. The database includes over 20K recordings (approx. 30 h), collected from 100 children. Recordings were carried out in three controlled settings by creating different emotional states for children: playing with a standard set of toys; repetition of words from a toy-parrot in a game store setting; watching a cartoon and retelling of the story, respectively. This corpus is designed to study the reflection of the emotional state in the characteristics of voice and speech and for studies of the formation of emotional states in ontogenesis. A portion of the corpus is annotated for three emotional states (discomfort, neutral, comfort). Additional data include brain activity measurements (original EEG, evoked potentials records), the results of the adult listeners analysis of child speech, questionnaires, and description of dialogues. The paper reports two child emotional speech analysis experiments on the corpus: by adult listeners (humans) and by an automatic classifier (machine), respectively. Automatic classification results are very similar to human perception, although the accuracy is below 55% for both, showing the difficulty of child emotion recognition from speech under naturalistic conditions.
引用
收藏
页码:144 / 152
页数:9
相关论文
共 50 条
  • [1] IESC-Child: An Interactive Emotional Children's Speech Corpus
    Perez-Espinosa, Humberto
    Martinez-Miranda, Juan
    Espinosa-Curiel, Ismael
    Rodriguez-Jacobo, Josefina
    Villasenor-Pineda, Luis
    Avila-George, Himer
    [J]. COMPUTER SPEECH AND LANGUAGE, 2020, 59 : 55 - 74
  • [2] AD-Child.Ru: Speech Corpus for Russian Children with Atypical Development
    Lyakso, Elena
    Frolova, Olga
    Kaliyev, Arman
    Gorodnyi, Viktor
    Grigorev, Aleksey
    Matveev, Yuri
    [J]. SPEECH AND COMPUTER, SPECOM 2019, 2019, 11658 : 299 - 308
  • [3] SUST Bangla Emotional Speech Corpus (SUBESCO): An audio-only emotional speech corpus for Bangla
    Sultana, Sadia
    Rahman, M. Shahidur
    Selim, M. Reza
    Iqbal, M. Zafar
    [J]. PLOS ONE, 2021, 16 (04):
  • [4] Emotional Speech Corpus of Croatian Language
    Dropuljic, Branimir
    Chmura, Milosz Thomasz
    Kolak, Antonio
    Petrinovic, Davor
    [J]. PROCEEDINGS OF THE 7TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2011), 2011, : 95 - 100
  • [5] DEVELOPING A THAI EMOTIONAL SPEECH CORPUS
    Kasuriya, Sawit
    Teeramunkong, Thanaruk
    Wutiwiwatchai, Chai
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [6] A Fully Annotated Corpus of Russian Speech
    Skrelin, Pavel
    Volskaya, Nina
    Kocharov, Daniil
    Evgrafova, Karina
    Glotova, Olga
    Evdokimova, Vera
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 109 - 112
  • [7] EMOVO Corpus: an Italian Emotional Speech Database
    Costantini, Giovanni
    Iadarola, Iacopo
    Paoloni, Andrea
    Todisco, Massimiliano
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 3501 - 3504
  • [8] A Cross-Corpus Recognition of Emotional Speech
    Xiao, Zhongzhe
    Wu, Di
    Zhang, Xiaojun
    Tao, Zhi
    [J]. PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 42 - 46
  • [9] Emotional Speech Corpus for Persuasive Dialogue System
    Asai, Sara
    Yoshino, Koichiro
    Shinagawa, Seitaro
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 491 - 497
  • [10] Construction and Analysis of Indonesian Emotional Speech Corpus
    Lubis, Nurul
    Lestari, Dessi
    Purwarianti, Ayu
    Sakti, Sakriani
    Nakamura, Satoshi
    [J]. 2014 17TH ORIENTAL CHAPTER OF THE INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDIZATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (COCOSDA), 2014,