A new approach of audio emotion recognition

被引:84
|
作者
Ooi, Chien Shing [1 ]
Seng, Kah Phooi [2 ]
Ang, Li-Minn [2 ]
Chew, Li Wern [3 ]
机构
[1] Sunway Univ, Dept Comp Sci & Networked Syst, Petaling Jaya 46150, Malaysia
[2] Edith Cowan Univ, Sch Engn, Joondalup, WA 6027, Australia
[3] Intel Microelect M Sdn Bhd, George Town 11900, Malaysia
关键词
Audio emotion recognition; RBF neural network; Prosodic features; MFCC feature; EXPRESSION;
D O I
10.1016/j.eswa.2014.03.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new architecture of intelligent audio emotion recognition is proposed in this paper. It fully utilizes both prosodic and spectral features in its design. It has two main paths in parallel and can recognize 6 emotions. Path 1 is designed based on intensive analysis of different prosodic features. Significant prosodic features are identified to differentiate emotions. Path 2 is designed based on research analysis on spectral features. Extraction of Mel-Frequency Cepstral Coefficient (MFCC) feature is then followed by Bi-directional Principle Component Analysis (BDPCA), Linear Discriminant Analysis (LDA) and Radial Basis Function (RBF) neural classification. This path has 3 parallel BDPCA + LDA + RBF sub-paths structure and each handles two emotions. Fusion modules are also proposed for weights assignment and decision making. The performance of the proposed architecture is evaluated on eNTERFACE'05 and RML databases. Simulation results and comparison have revealed good performance of the proposed recognizer. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:5858 / 5869
页数:12
相关论文
共 50 条
  • [1] Audio-visual based emotion recognition - A new approach
    Song, ML
    Bu, JJ
    Chen, C
    Li, N
    [J]. PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 1020 - 1025
  • [2] Audio Segmentation based Approach for Improved Emotion Recognition
    Pandharipande, Meghna Abhishek
    Kopparapu, Sunil Kumar
    [J]. TENCON 2015 - 2015 IEEE REGION 10 CONFERENCE, 2015,
  • [3] Using the Bag-of-Audio-Words approach for emotion recognition
    Vetrab, Mercedes
    Gosztolya, Gabor
    [J]. ACTA UNIVERSITATIS SAPIENTIAE INFORMATICA, 2022, 14 (01) : 1 - 21
  • [4] Emotion Recognition in Audio Records
    Pavaloi, Ioan
    Musca, Elena
    Rotaru, Florin
    [J]. 2013 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2013,
  • [5] Emotion Recognition and Emotion Based Classification of Audio using Genetic Algorithm - An Optimized Approach
    Bargaje, Mahesh
    [J]. 2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL INSTRUMENTATION AND CONTROL (ICIC), 2015, : 562 - 567
  • [6] A multimodal hierarchical approach to speech emotion recognition from audio and text
    Singh, Prabhav
    Srivastava, Ridam
    Rana, K. P. S.
    Kumar, Vineet
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 229
  • [7] The CASIA Audio Emotion Recognition Method for Audio/Visual Emotion Challenge 2011
    Pan, Shifeng
    Tao, Jianhua
    Li, Ya
    [J]. AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PT II, 2011, 6975 : 388 - 395
  • [8] Speech Emotion Recognition Using Audio Matching
    Chaturvedi, Iti
    Noel, Tim
    Satapathy, Ranjan
    [J]. ELECTRONICS, 2022, 11 (23)
  • [9] Study on CNN in the Recognition of Emotion in Audio and Images
    Zhang, Bin
    Quan, Changqin
    Ren, Fuji
    [J]. 2016 IEEE/ACIS 15TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE (ICIS), 2016, : 319 - 323
  • [10] Deep Learning and Audio Based Emotion Recognition
    Demir, Asli
    Atila, Orhan
    Sengur, Abdulkadir
    [J]. 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,