Children's Emotion Recognition from Spontaneous Speech Using a Reduced Set of Acoustic and Linguistic Features

被引:9
|
作者
Planet, Santiago [1 ]
Iriondo, Ignasi [1 ]
机构
[1] Univ Ramon Llull, Barcelona 08022, Spain
关键词
Emotion recognition; Spontaneous speech; Acoustic and linguistic features; Feature selection; Feature-level fusion; Speaker-independent;
D O I
10.1007/s12559-012-9174-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The aim of this article is to classify children's affective states in a real-life non-prototypical emotion recognition scenario. The framework is the same as that proposed in the Interspeech 2009 Emotion Challenge. We used a large set of acoustic features and five linguistic parameters based on the concept of emotional salience. Features were extracted from the spontaneous speech recordings of the FAU Aibo Corpus and their transcriptions. We used a wrapper method to reduce the acoustic set of features from 384 to 28 elements and feature-level fusion to merge them with the set of linguistic parameters. We studied three classification approaches: a Naive-Bayes classifier, a support vector machine and a logistic model tree. Results show that the linguistic features improve the performances of the classifiers that use only acoustic datasets. Additionally, merging the linguistic features with the reduced acoustic set is more effective than working with the full dataset. The best classifier performance is achieved with the logistic model tree and the reduced set of acoustic and linguistic features, which improves the performance obtained with the full dataset by 4.15 % absolute (10.14 % relative) and improves the performance of the Naive-Bayes classifier by 9.91 % absolute (28.18 % relative). For the same conditions proposed in the Emotion Challenge, this simple scheme slightly improves a much more complex structure involving seven classifiers and a larger number of features.
引用
收藏
页码:526 / 532
页数:7
相关论文
共 50 条
  • [1] Children’s Emotion Recognition from Spontaneous Speech Using a Reduced Set of Acoustic and Linguistic Features
    Santiago Planet
    Ignasi Iriondo
    [J]. Cognitive Computation, 2013, 5 : 526 - 532
  • [2] Emotion Classification in Children's Speech Using Fusion of Acoustic and Linguistic Features
    Polzehl, Tim
    Sundaram, Shiva
    Ketabdar, Hamed
    Wagner, Michael
    Metze, Florian
    [J]. INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 340 - +
  • [3] Emotion Recognition from Speech using Prosodic and Linguistic Features
    Pervaiz, Mahwish
    Khan, Tamim Ahmed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 84 - 90
  • [4] Spontaneous Children's Emotion Recognition by Categorical Classification of Acoustic Features
    Planet, Santiago
    Iriondo, Ignasi
    [J]. SISTEMAS E TECNOLOGIAS DE INFORMACAO, VOL I, 2011, : 594 - +
  • [5] Emotion recognition from telephone speech using acoustic and nonlinear features
    Bedoya-Jaramillo, S.
    Orozco-Arroyave, J. R.
    Arias-Londono, J. D.
    Vargas-Bonilla, J. F.
    [J]. 2013 47TH INTERNATIONAL CARNAHAN CONFERENCE ON SECURITY TECHNOLOGY (ICCST), 2013,
  • [6] Comparison of machine learning algorithms and acoustic features in emotion recognition from spontaneous speech
    Iizuka, Takahisa
    Mori, Hiroki
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2022, 43 (04) : 228 - 231
  • [7] Speech Emotion Recognition by Late Fusion of Linguistic and Acoustic Features using Deep Learning Models
    Sato, Kiyohide
    Kishi, Keita
    Kosaka, Tetsuo
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1013 - 1018
  • [8] Fusion of Acoustic and Linguistic Speech Features for Emotion Detection
    Metze, Florian
    Polzehl, Tim
    Wagner, Michael
    [J]. 2009 IEEE THIRD INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2009), 2009, : 153 - +
  • [9] Novel acoustic features for speech emotion recognition
    Yong-Wan Roh
    Dong-Ju Kim
    Woo-Seok Lee
    Kwang-Seok Hong
    [J]. Science in China Series E: Technological Sciences, 2009, 52 : 1838 - 1848
  • [10] Novel acoustic features for speech emotion recognition
    ROH Yong-Wan
    KIM Dong-Ju
    LEE Woo-Seok
    HONG Kwang-Seok
    [J]. Science China Technological Sciences, 2009, 52 (07) : 1838 - 1848