Voice Pathology Detection Using a Two-Level Classifier Based on Combined CNN-RNN Architecture

被引:9
|
作者
Ksibi, Amel [1 ]
Hakami, Nada Ali [2 ]
Alturki, Nazik [1 ]
Asiri, Mashael M. M. [3 ]
Zakariah, Mohammed [4 ]
Ayadi, Manel [1 ]
机构
[1] Princess Nourah bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11671, Saudi Arabia
[2] Jazan Univ, Coll Comp Sci & Informat Technol, Comp Sci Dept, Jazan 45142, Saudi Arabia
[3] King Khalid Univ, Coll Sci & Art Mahayil, Dept Comp Sci, Abha 62529, Saudi Arabia
[4] King Saud Univ, Coll Comp & Informat Sci, Riyadh 11543, Saudi Arabia
关键词
recurrent neural networks (RNNs); deep learning; audio feature extraction; Mel-frequency cepstral coefficients; SIGNAL; DISORDERS; FEATURES; MACHINE;
D O I
10.3390/su15043204
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The construction of an automatic voice pathology detection system employing machine learning algorithms to study voice abnormalities is crucial for the early detection of voice pathologies and identifying the specific type of pathology from which patients suffer. This paper's primary objective is to construct a deep learning model for accurate speech pathology identification. Manual audio feature extraction was employed as a foundation for the categorization process. Incorporating an additional piece of information, i.e., voice gender, via a two-level classifier model was the most critical aspect of this work. The first level determines whether the audio input is a male or female voice, and the second level determines whether the agent is pathological or healthy. Similar to the bulk of earlier efforts, the current study analyzed the audio signal by focusing solely on a single vowel, such as /a/, and ignoring phrases and other vowels. The analysis was performed on the Saarbruecken Voice Database,. The two-level cascaded model attained an accuracy and F1 score of 88.84% and 87.39%, respectively, which was superior to earlier attempts on the same dataset and provides a steppingstone towards a more precise early diagnosis of voice complications.
引用
下载
收藏
页数:18
相关论文
共 50 条
  • [1] Characters Recognition based on CNN-RNN architecture and Metaheuristic
    Keddous, F.
    Nguyen, H-N
    Nakib, A.
    2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2021, : 500 - 507
  • [2] Efficient Approach for Epileptic Seizure Classification and Detection based on Genetic Algorithm with CNN-RNN Classifier
    Priyanka, R.
    Sathesh, M.
    Selvi, Tamil T.
    Saikumar, P. Janardhan
    Venkatachalam, K.
    Raja, M.
    2024 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND APPLIED INFORMATICS, ACCAI 2024, 2024,
  • [3] Categorization of actions in soccer videos using a CNN-RNN architecture
    Macedo, Matheus de Sousa
    Adamatti, Diana Francisca
    REVISTA BRASILEIRA DE COMPUTACAO APLICADA, 2023, 15 (03): : 1 - 14
  • [4] A Robust Voice Pathology Detection System Based on the Combined BiLSTM–CNN Architecture
    Amami R.
    Amami R.
    Trabelsi C.
    Mabrouk S.H.
    Khalil H.A.
    Mendel, 2023, 29 (02) : 202 - 210
  • [5] Human Abnormality Classification Using Combined CNN-RNN Approach
    Kabir, Mohsin
    Safir, Farisa Benta
    Shahen, Saifullah
    Maua, Jannatul
    Awlad, Iffat Ara Binte
    Mridha, M. F.
    2020 IEEE 17TH INTERNATIONAL CONFERENCE ON SMART COMMUNITIES: IMPROVING QUALITY OF LIFE USING ICT, IOT AND AI (IEEEHONET 2020), 2020, : 204 - 208
  • [6] Comparative Analysis of CNN and RNN for Voice Pathology Detection
    Syed, Sidra Abid
    Rashid, Munaf
    Hussain, Samreen
    Zahid, Hira
    BIOMED RESEARCH INTERNATIONAL, 2021, 2021
  • [7] Handwritten Odia numeral recognition using combined CNN-RNN
    Das, Abhishek
    Mohanty, Mihir Narayan
    INTERNATIONAL JOURNAL OF GRID AND UTILITY COMPUTING, 2023, 14 (04) : 382 - 388
  • [8] Network security based combined CNN-RNN models for IoT intrusion detection system
    Rahma Jablaoui
    Noureddine Liouane
    Peer-to-Peer Networking and Applications, 2025, 18 (3)
  • [9] Detection of fake news using deep learning CNN-RNN based methods
    Sastrawan, I. Kadek
    Bayupati, I. P. A.
    Arsa, Dewa Made Sri
    ICT EXPRESS, 2022, 8 (03): : 396 - 408
  • [10] Emergency Detection Method in Social Media Based on CNN-RNN
    Li X.
    Bai C.
    Tiedao Xuebao/Journal of the China Railway Society, 2021, 43 (08): : 97 - 105