On the Impact of Children's Emotional Speech on Acoustic and Language Models

被引:0
|
作者
Stefan Steidl
Anton Batliner
Dino Seppi
Björn Schuller
机构
[1] Friedrich-Alexander-Universität Erlangen-Nürnberg,Lehrstuhl für Mustererkennung
[2] ESAT,Institute for Human
[3] Katholieke Universiteit Leuven,Machine Communication
[4] Technische Universität München,undefined
关键词
Language Model; Automatic Speech Recognition; Acoustic Model; Baseline System; Emotional Speech;
D O I
暂无
中图分类号
学科分类号
摘要
The automatic recognition of children's speech is well known to be a challenge, and so is the influence of affect that is believed to downgrade performance of a speech recogniser. In this contribution, we investigate the combination of both phenomena. Extensive test runs are carried out for 1 k vocabulary continuous speech recognition on spontaneous motherese, emphatic, and angry children's speech as opposed to neutral speech. The experiments address the question how specific emotions influence word accuracy. In a first scenario, "emotional" speech recognisers are compared to a speech recogniser trained on neutral speech only. For this comparison, equal amounts of training data are used for each emotion-related state. In a second scenario, a "neutral" speech recogniser trained on large amounts of neutral speech is adapted by adding only some emotionally coloured data in the training process. The results show that emphatic and angry speech is recognised best—even better than neutral speech—and that the performance can be improved further by adaptation of the acoustic and linguistic models. In order to show the variability of emotional speech, we visualise the distribution of the four emotion-related states in the MFCC space by applying a Sammon transformation.
引用
收藏
相关论文
共 50 条
  • [31] Free Acoustic and Language Models for Large Vocabulary Continuous Speech Recognition in Swedish
    Vanhainen, Niklas
    Salvi, Giampiero
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [32] Integration of multiple acoustic and language models for improved Hindi speech recognition system
    R. K. Aggarwal
    M. Dave
    Aggarwal, R.K. (rka15969@gmail.com), 2012, Kluwer Academic Publishers (15) : 165 - 180
  • [33] Automatic children's personality assessment from emotional speech
    Pérez-Espinosa, Humberto
    Gutiérrez-Serafín, Benjamín
    Martínez-Miranda, Juan
    Espinosa-Curiel, Ismael E.
    Expert Systems with Applications, 2022, 187
  • [34] Children's speech accommodation to gendered language styles
    Robertson, K
    Murachver, T
    JOURNAL OF LANGUAGE AND SOCIAL PSYCHOLOGY, 2003, 22 (03) : 321 - 333
  • [35] THE PLACE OF TALES IN CHILDREN'S SPEECH AND LANGUAGE THERAPY
    Popova, Desislava
    PEDAGOGIKA-PEDAGOGY, 2024, 96 (05): : 689 - 701
  • [36] Automatic children's personality assessment from emotional speech
    Perez-Espinosa, Humberto
    Gutierrez-Serafin, Benjamin
    Martinez-Miranda, Juan
    Espinosa-Curiel, Ismael E.
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 187
  • [37] Costing children's speech, language and communication interventions
    Beecham, Jennifer
    Law, James
    Zeng, Biao
    Lindsay, Geoff
    INTERNATIONAL JOURNAL OF LANGUAGE & COMMUNICATION DISORDERS, 2012, 47 (05) : 477 - 486
  • [38] Language and disfluency in nonstuttering children's conversational speech
    Yaruss, JS
    Newman, RM
    Flora, T
    JOURNAL OF FLUENCY DISORDERS, 1999, 24 (03) : 185 - 207
  • [39] Improvement of preschool children's speech and language skills
    Brodin, Jane
    Renblad, Karin
    EARLY CHILD DEVELOPMENT AND CARE, 2020, 190 (14) : 2205 - 2213
  • [40] Emotional Impact of Bell's Palsy in Children
    McKay, Damien
    JOURNAL OF PAEDIATRICS AND CHILD HEALTH, 2014, 50 (03) : 245 - 245