Emotional Speech Recognition Using Rhythm Metrics and a New Arabic Corpus

被引:0
|
作者
Meftah, Ali H. [1 ]
Qamhan, Mustafa [1 ]
Alotaibi, Yousef [1 ]
Selouani, Sid-Ahmed [2 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Riyadh, Saudi Arabia
[2] Univ Moncton, 218 Bvd JD Gauthier, Shippegan, NB E8S 1P6, Canada
关键词
Emotion; rhythm metrics; acoustic features; MLP; SVM;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study aims to investigate the possible use of speech rhythm metrics as a new feature for speech emotion recognition, gender identification, and regional accent identification. Further, it aims to evaluate a new Arabic speech emotion corpus. The King Saud University Emotions (KSUEmotions) speech corpus contains five emotions: neutral, sadness, happiness, surprise, and anger. For this study, speech acoustic features are extracted and used to classify the speakers' emotions. All classification results were obtained using the multilayer perceptron (MLP) neural networks and support vector machine (SVM) classifiers. Results demonstrate that the rhythm metrics are not sufficient for speech emotion classification. Nevertheless, they can improve the classifier accuracy when combined with other speech acoustic features. These results also demonstrate that the average performance accuracy of the KSUEmotions Phase 1 is 54.07% and 84.14% for Phase 2 and that the emotion of sadness achieves the best emotions' classification accuracy.
引用
收藏
页码:57 / 62
页数:6
相关论文
共 50 条
  • [1] Investigating Arabic Speakers' Emotions Using Speech Rhythm Metrics
    Meftah, Ali H.
    Alotaibi, Yousef
    Selouani, Sid-Ahmed
    [J]. UKSIM-AMSS 11TH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2017), 2017, : 73 - 77
  • [2] Arabic corpus Implementation: Application to Speech Recognition
    Helali, Wafa
    Hajaiej, Zied
    Cherif, Adnane
    [J]. 2018 INTERNATIONAL CONFERENCE ON ADVANCED SYSTEMS AND ELECTRICAL TECHNOLOGIES (IC_ASET), 2017, : 50 - 53
  • [3] Designing, Building, and Analyzing an Arabic Speech Emotional Corpus
    Meftah, Ali
    Alotaibi, Yousef
    Selouani, Sid-Ahmed
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [4] Arabic Speech Rhythm Corpus: Read and Spontaneous Speaking Styles
    Ibrahim, Omnia
    Asadi, Homa
    Kassem, Eman
    Dellwo, Volker
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5337 - 5342
  • [5] Speech Recognition System of Arabic Alphabet Based on a Telephony Arabic Corpus
    Alotaibi, Yousef Ajami
    Alghamdi, Mansour
    Alotaiby, Fabad
    [J]. IMAGE AND SIGNAL PROCESSING, PROCEEDINGS, 2010, 6134 : 122 - +
  • [6] Acoustic Model Adaptation for Emotional Speech Recognition Using Twitter-Based Emotional Speech Corpus
    Kosaka, Tetsuo
    Aizawa, Yoshitaka
    Kato, Masaharu
    Nose, Takashi
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1747 - 1751
  • [7] A Corpus and Phonetic Dictionary for Tunisian Arabic Speech Recognition
    Masmoudi, Abir
    Khemakhem, Mariem Ellouze
    Esteve, Yannick
    Belguith, Lamia Hadrich
    Habash, Nizar
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014,
  • [8] A Cross-Corpus Recognition of Emotional Speech
    Xiao, Zhongzhe
    Wu, Di
    Zhang, Xiaojun
    Tao, Zhi
    [J]. PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 42 - 46
  • [9] Arabic Speech Emotion Recognition From Saudi Dialect Corpus
    Aljuhani, Reem Hamed
    Alshutayri, Areej
    Alahdal, Shahd
    [J]. IEEE ACCESS, 2021, 9 : 127081 - 127085
  • [10] ALGERIAN ARABIC SPEECH DATABASE (ALGASD): CORPUS DESIGN AND AUTOMATIC SPEECH RECOGNITION APPLICATION
    Droua-Hamdani, Ghania
    Selouani, Sid Ahmed
    Boudraa, Malika
    [J]. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2010, 35 (2C): : 157 - 166