Improving Noise Robustness of Speech Emotion Recognition System

被引:11
|
作者
Juszkiewicz, Lukasz [1 ]
机构
[1] Wroclaw Univ Technol, Inst Comp Engn Control & Robot, PL-50370 Wroclaw, Poland
来源
关键词
D O I
10.1007/978-3-319-01571-2_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper method of improving noise robustness of speech emotion recognition system is proposed. Such a system has been developed for use in a social robot, but its performance is highly degraded by environmental noise. To alleviate this problem, the histogram equalisation is proposed to reduce the difference between feature vectors in clean and noisy conditions. In training phase of the system the averaged histograms of pitch and MFCC are computed and then serve as reference for equalisation. System performance was evaluated using Database of Polish Emotional Speech, which was split into training and test sets. Test sets were noised with 3 different noise samples. Presented preliminary results show a significant improvement of recognition accuracy in noisy environment conditions.
引用
收藏
页码:223 / 232
页数:10
相关论文
共 50 条
  • [1] Improving Speech Emotion Recognition System for a Social Robot with Speaker Recognition
    Juszkiewicz, Lukasz
    [J]. 2014 19TH INTERNATIONAL CONFERENCE ON METHODS AND MODELS IN AUTOMATION AND ROBOTICS (MMAR), 2014, : 921 - 925
  • [2] Noise and speaker robustness in a Persian continuous speech recognition system
    Veisi, Hadi
    Sameti, Hossein
    [J]. 2007 9TH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, VOLS 1-3, 2007, : 73 - 76
  • [3] Toward noise robustness speech recognition
    Namarvar, HH
    Liaw, J
    Berger, TW
    [J]. 2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 4016 - 4016
  • [4] Head Fusion: Improving the Accuracy and Robustness of Speech Emotion Recognition on the IEMOCAP and RAVDESS Dataset
    Xu, Mingke
    Zhang, Fan
    Zhang, Wei
    [J]. IEEE ACCESS, 2021, 9 : 74539 - 74549
  • [5] Improving speech detection robustness for wireless speech recognition
    Karray, L
    Mauuary, L
    [J]. 1997 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING, PROCEEDINGS, 1997, : 428 - 435
  • [6] Adding Noise to Improve Noise Robustness in Speech Recognition
    Morales, Nicolas
    Gu, Liang
    Gao, Yuqing
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 861 - +
  • [7] Investigation of noise-reverberation-robustness of modulation spectral features for speech-emotion recognition
    Guo, Taiyang
    Li, Sixia
    Unoki, Masashi
    Okada, Shogo
    [J]. PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 39 - 46
  • [8] On the application of variable-step adaptive noise cancelling for improving the robustness of speech recognition
    Yang Jie
    Wang Zhenli
    [J]. 2009 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL II, 2009, : 419 - +
  • [9] Improving Robustness to Compressed Speech in Speaker Recognition
    McLaren, Mitchell
    Abrash, Victor
    Graciarena, Martin
    Lei, Yun
    Pesan, Jan
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3665 - 3669
  • [10] Improving Speech Emotion Recognition System Using Spectral and Prosodic Features
    Chakhtouna, Adil
    Sekkate, Sara
    Adib, Abdellah
    [J]. INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 399 - 409