A Hybrid Deep Ensemble for Speech Disfluency Classification

被引:9
|
作者
Pravin, Sheena Christabel [1 ]
Palanivelan, M. [1 ]
机构
[1] Rajalakshmi Engn Coll, Dept ECE, Chennai, Tamil Nadu, India
关键词
Hybrid Deep Ensemble; Speech disfluency classification; Sparse speech dataset; Deep autoencoder; Latent features; RECOGNITION; ALGORITHM;
D O I
10.1007/s00034-021-01657-1
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a novel Hybrid Deep Ensemble (HDE) is proposed for automatic speech disfluency classification on a sparse speech dataset. Categorizations of speech disfluencies for diagnosis of speech disorders have so long relied on sophisticated deep learning models. Such a task can be accomplished by a straightforward approach with high accuracy by the proposed model which is an optimal combination of diverse machine learning and deep learning algorithms in a hierarchical arrangement which includes a deep autoencoder that yields the compressed latent features. The proposed model has shown considerable improvement in downgrading processing time overcoming the issues of cumbersome hyper-parameter tuning and huge data demand of the deep learning algorithms with high classification accuracy. Experimental results show that the proposed Hybrid Deep Ensemble has superior performance compared to the individual base learners, and the deep neural network as well. The proposed model and the baseline models were evaluated in terms of Cohen's kappa coefficient, Hamming loss, Jaccard score, F-score and classification accuracy.
引用
收藏
页码:3968 / 3995
页数:28
相关论文
共 50 条
  • [31] Speaker independent speech emotion recognition by ensemble classification
    Schuller, B
    Reiter, S
    Müller, R
    Al-Hames, M
    Lang, M
    Rigoll, G
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 865 - 868
  • [32] Hybrid optimization enabled deep learning-based ensemble classification for heart disease detection
    R. Jayasudha
    Chanti Suragali
    J. T. Thirukrishna
    B. Santhosh Kumar
    [J]. Signal, Image and Video Processing, 2023, 17 : 4235 - 4244
  • [33] Retinal diseases classification based on hybrid ensemble deep learning and optical coherence tomography images
    Pin, Kuntha
    Han, Jung Woo
    Nam, Yunyoung
    [J]. ELECTRONIC RESEARCH ARCHIVE, 2023, 31 (08): : 4843 - 4861
  • [34] Measure and Comparison of Speech Pause Duration in Subjects with Disfluency Speech
    Teixeira, Joao Paulo
    Fernandes, Maria Goreti
    Costa, Rita Alexandra
    [J]. 4TH CONFERENCE OF ENTERPRISE INFORMATION SYSTEMS - ALIGNING TECHNOLOGY, ORGANIZATIONS AND PEOPLE (CENTERIS 2012), 2012, 5 : 812 - 819
  • [35] A Deep Ensemble Learning Method for Monaural Speech Separation
    Zhang, Xiao-Lei
    Wang, DeLiang
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (05) : 967 - 977
  • [36] Ensemble deep learning with HuBERT for speech emotion recognition
    Yang, Janghoon
    [J]. 2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 153 - 154
  • [37] Deep Ensemble Learning for Retinal Image Classification
    Ho, Edward
    Wang, Edward
    Youn, Saerom
    Sivajohan, Asaanth
    Lane, Kevin
    Chun, Jin
    Hutnik, Cindy M. L.
    [J]. TRANSLATIONAL VISION SCIENCE & TECHNOLOGY, 2022, 11 (10):
  • [38] Ensemble deep learning in speech signal tasks: A review
    Tanveer, M.
    Rastogi, Aryan
    Paliwal, Vardhan
    Ganaie, M. A.
    Malik, A. K.
    Del Ser, Javier
    Lin, Chin-Teng
    [J]. NEUROCOMPUTING, 2023, 550
  • [39] A deep ensemble learning method for cherry classification
    Kiyas Kayaalp
    [J]. European Food Research and Technology, 2024, 250 : 1513 - 1528
  • [40] Deep Learning Ensemble for Hyperspectral Image Classification
    Chen, Yushi
    Wang, Ying
    Gu, Yanfeng
    He, Xin
    Ghamisi, Pedram
    Jia, Xiuping
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2019, 12 (06) : 1882 - 1897