Emotion Recognition Using Prosodic and Spectral Features of Speech and Naive Bayes Classifier

被引:0
|
作者
Khan, Atreyee [1 ]
Roy, Uttam Kumar [1 ]
机构
[1] Jadavpur Univ, Dept Informat Technol, Kolkata, India
关键词
Pitch; MFCC; Cepstrum; Naive Bayes; Emo-db;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Emotion plays an important role in human communication as we can convey our feelings through it. In recent years this emotion detection from speech has become one of the major and challenging research areas in speech processing domain. Extracting suitable features from speech that influence emotions and the categorization of several emotional classes make this task more challenging. This paper proposes emotion detection using both prosodic and spectral features of speech using Naive Bayes Classifier. Although many works consider only spectral features or prosodic features, this paper considers both. We have used pitch and Mel-Frequency Cepstral Coefficients (MFCC) as the prosodic and spectral features respectively. The classification is performed using Naive Bayes Classifier. We have developed both gender dependent and gender independent system considering seven emotional classes. After performing the classification the accuracy of the system is tested using speech samples of popular speech database Berlin Emo-db and the results are considerably better than many existing systems.
引用
收藏
页码:1017 / 1021
页数:5
相关论文
共 50 条
  • [1] Emotion recognition using a Cauchy naive Bayes classifier
    Sebe, N
    Lew, MS
    Cohen, I
    Garg, A
    Huang, TS
    [J]. 16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL I, PROCEEDINGS, 2002, : 17 - 20
  • [2] Improving Speech Emotion Recognition System Using Spectral and Prosodic Features
    Chakhtouna, Adil
    Sekkate, Sara
    Adib, Abdellah
    [J]. INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, ISDA 2021, 2022, 418 : 399 - 409
  • [3] Egyptian Arabic speech emotion recognition using prosodic, spectral and wavelet features
    Abdel-Hamid, Lamiaa
    [J]. SPEECH COMMUNICATION, 2020, 122 : 19 - 30
  • [4] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Arijul Haque
    K. Sreenivasa Rao
    [J]. Multimedia Tools and Applications, 2024, 83 : 19629 - 19661
  • [5] Hierarchical emotion recognition from speech using source, power spectral and prosodic features
    Haque, Arijul
    Rao, K. Sreenivasa
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (07) : 19629 - 19661
  • [6] PERFORMANCE ANALYSIS OF SPECTRAL AND PROSODIC FEATURES AND THEIR FUSION FOR EMOTION RECOGNITION IN SPEECH
    Gaurav, Manish
    [J]. 2008 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY: SLT 2008, PROCEEDINGS, 2008, : 313 - 316
  • [7] A Hybrid Speech Emotion Recognition System Based on Spectral and Prosodic Features
    Zhou, Yu
    Li, Junfeng
    Sun, Yanqing
    Zhang, Jianping
    Yan, Yonghong
    Akagi, Masato
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2010, E93D (10) : 2813 - 2821
  • [8] Emotion Recognition on The Basis of Audio Signal Using Naive Bayes Classifier
    Bhakre, Sagar K.
    Bang, Arti
    [J]. 2016 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2016, : 2363 - 2367
  • [9] Emotion Recognition from Speech using Prosodic and Linguistic Features
    Pervaiz, Mahwish
    Khan, Tamim Ahmed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (08) : 84 - 90
  • [10] Emotion recognition from speech using global and local prosodic features
    Rao K.S.
    Koolagudi S.G.
    Vempada R.R.
    [J]. International Journal of Speech Technology, 2013, 16 (2) : 143 - 160