Characterization and Classification of Speech Emotion with Spectrograms

被引:0
|
作者
Palo, Hemanta Kumar [1 ]
Sagar, Sangeet [2 ]
机构
[1] Siksha O Anusandhan, Inst Tech Educ & Res, Dept Elect & Commun Engn, Bhubaneswar, Odisha, India
[2] LNM Inst Informat Technol, Dept Elect & Commun Engn, Jaipur, Rajasthan, India
关键词
Speech emotion; Spectrogram; Characterization; Classification; Multilayer Perceptron;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The work attempts to characterize and classify speech emotions using the spectrogram. Initially, it extracts the individual Red, Green, and Blue parameters from the raw speech spectrogram image of every individual emotional utterance. Further, it computes the statistical parameters of individual RGB components to characterize the chosen emotional states. The utterances of anger, happiness, neutral, and sad emotional states from the standard Berlin (EMO-DB) database has been used for this purpose. The individual statistical R, G, and B spectrogram parameters are found to be different within an emotion as well as across emotional states. Thus, these values have been used as different feature sets to classify the designated emotional states using the popular Multilayer Perceptron Neural Network (MLPNN).
引用
收藏
页码:309 / 313
页数:5
相关论文
共 50 条
  • [1] Emotion recognition from speech using deep learning on spectrograms
    Li, Xingguang
    Song, Wenjun
    Liang, Zonglin
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (03) : 2791 - 2796
  • [2] Speech Emotion Recognition from Spectrograms with Deep Convolutional Neural Network
    Badshah, Abdul Malik
    Ahmad, Jamil
    Rahim, Nasir
    Baik, Sung Wook
    [J]. 2017 INTERNATIONAL CONFERENCE ON PLATFORM TECHNOLOGY AND SERVICE (PLATCON), 2017, : 125 - 129
  • [3] Efficient Emotion Recognition from Speech Using Deep Learning on Spectrograms
    Satt, Aharon
    Rozenberg, Shai
    Hoory, Ron
    [J]. 18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1089 - 1093
  • [4] Speech Emotion Recognition using Convolutional Recurrent Neural Networks and Spectrograms
    Qamhan, Mustafa A.
    Meftah, Ali H.
    Selouani, Sid-Ahmed
    Alotaibi, Yousef A.
    Zakariah, Mohammed
    Seddiq, Yasser Mohammad
    [J]. 2020 IEEE CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING (CCECE), 2020,
  • [5] Emotion Recognition from Speech using Spectrograms and Shallow Neural Networks
    Slimi, Anwer
    Hamroun, Mohamed
    Zrigui, Mounir
    Nicolas, Henri
    [J]. MOMM 2020: THE 18TH INTERNATIONAL CONFERENCE ON ADVANCES IN MOBILE COMPUTING & MULTIMEDIA, 2020, : 35 - 39
  • [6] Emotion Profile Refinery for Speech Emotion Classification
    Mao, Shuiyang
    Ching, P. C.
    Lee, Tan
    [J]. INTERSPEECH 2020, 2020, : 531 - 535
  • [7] Speech based emotion classification
    Nwe, TL
    Wei, FS
    De Silva, LC
    [J]. IEEE REGION 10 INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONIC TECHNOLOGY, VOLS 1 AND 2, 2001, : 297 - 301
  • [8] Leveraged Mel Spectrograms Using Harmonic and Percussive Components in Speech Emotion Recognition
    Rudd, David Hason
    Huo, Huan
    Xu, Guandong
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2022, PT II, 2022, 13281 : 392 - 404
  • [9] DYNAMIC SPECTROGRAMS OF SPEECH
    KOCK, WE
    MILLER, RL
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1952, 24 (06): : 783 - 784
  • [10] Feature Analysis for Speech Emotion Classification
    Kingeski, R.
    Schueda, L. A. P.
    Paterno, A. S.
    [J]. XXVII BRAZILIAN CONGRESS ON BIOMEDICAL ENGINEERING, CBEB 2020, 2022, : 2359 - 2365