A depthwise separable CNN-based interpretable feature extraction network for automatic pathological voice detection

被引:8
|
作者
Zhao, Denghuang [1 ]
Qiu, Zhixin [1 ]
Jiang, Yujie [1 ]
Zhu, Xincheng [1 ]
Zhang, Xiaojun [1 ]
Tao, Zhi [1 ]
机构
[1] Soochow Univ, 1 Shizi St, Suzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Pathological voice detection; Deep learning; Interpretability; Depthwise separable CNN; CLASSIFICATION; INFORMATION; CEPSTRUM; VOWEL;
D O I
10.1016/j.bspc.2023.105624
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
In recent years, deep learning methods in automatic pathological voice detection (APVD) have gained satisfying results. However, most deep learning methods in APVD cannot explain their performance. Interpretability is crucial in deep learning methods applied to the medical field. A lack of interpretability makes it hard for existing methods to give better generalization performance than meaningful feature-based methods in practical appli-cations. This paper proposed an interpretable neural network architecture called the Interpretable Multi-band Feature Extraction Network (IMBFN) based on clear feature extraction logic and a comprehensive result judg-ment method to improve the effectiveness and generalization performance of APVD. An amplitude-trainable SincNet (AT-SincNet) filter bank was put forward in IMBFN and applied as the front-end frequency division network. In addition, IMBFN used a designed two-path one-dimensional depthwise separatable convolutional neural network (CNN)-based feature extractor to extract meaningful voice features. The classification results of each voice frame were used to judge whether the voice was pathological synthetically. Comparative experiments were conducted using data from the MEEI, SVD, and HUPA databases. The best improvement of accuracy, F1-score, and Matthews correlation coefficient (MCC) reached 0.1705, 0.1977, and 0.4463, respectively. Also, blind tests were carried out in participants from the First Affiliated Hospital of Soochow University, and an accuracy, F1-score, and MCC of 0.7594, 0.8491, and 0.2981, respectively, were obtained. Results demonstrated that IMBFN provided meaningful explanations, good APVD effect, and better generalization performance than existing methods.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Epileptic Seizure Detection Based on Feature Extraction and CNN-BiGRU Network with Attention Mechanism
    Xu, Jie
    Wang, Juan
    Liu, Jin-Xing
    Shang, Junliang
    Dai, Lingyun
    Yan, Kuiting
    Yuan, Shasha
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 308 - 319
  • [42] Exploring the Influence of Input Feature Space on CNN-Based Geomorphic Feature Extraction From Digital Terrain Data
    Maxwell, Aaron E.
    Odom, William E.
    Shobe, Charles M.
    Doctor, Daniel H.
    Bester, Michelle S.
    Ore, Tobi
    EARTH AND SPACE SCIENCE, 2023, 10 (05)
  • [43] Automatic Layout Feature Extraction for Lithography Hotspot Detection Based on Deep Neural Network
    Matsunawa, Tetsuaki
    Nojima, Shigeki
    Kotani, Toshiya
    DESIGN-PROCESS-TECHNOLOGY CO-OPTIMIZATION FOR MANUFACTURABILITY X, 2016, 9781
  • [44] A Light Weight Depthwise Separable Layer Optimized CNN Architecture for Object-Based Forgery Detection in Surveillance Videos
    Sandhya
    Kashyap, Abhishek
    COMPUTER JOURNAL, 2024, 67 (06): : 2270 - 2285
  • [45] DeepDFML-NILM: A New CNN-Based Architecture for Detection, Feature Extraction and Multi-Label Classification in NILM Signals
    Nolasco, Lucas da Silva
    Lazzaretti, Andre Eugenio
    Mulinari, Bruna Machado
    IEEE SENSORS JOURNAL, 2022, 22 (01) : 501 - 509
  • [46] CNN-based automated approach to crack-feature detection in steam cycle components
    Fei, Zhouxiang
    West, Graeme M.
    Murray, Paul
    Dobie, Gordon
    INTERNATIONAL JOURNAL OF PRESSURE VESSELS AND PIPING, 2024, 207
  • [47] Joint motion boundary detection and CNN-based feature visualization for video object segmentation
    Zahra Kamranian
    Ahmad Reza Naghsh Nilchi
    Hamid Sadeghian
    Federico Tombari
    Nassir Navab
    Neural Computing and Applications, 2020, 32 : 4073 - 4091
  • [48] Joint motion boundary detection and CNN-based feature visualization for video object segmentation
    Kamranian, Zahra
    Nilchi, Ahmad Reza Naghsh
    Sadeghian, Hamid
    Tombari, Federico
    Navab, Nassir
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (08): : 4073 - 4091
  • [49] Towards Automatic Depression Detection: A BiLSTM/1D CNN-Based Model
    Lin, Lin
    Chen, Xuri
    Shen, Ying
    Zhang, Lin
    APPLIED SCIENCES-BASEL, 2020, 10 (23): : 1 - 20
  • [50] Tomato Disease Detection from Tomato Leaf Images Using CNN-Based Feature Extraction, Feature Selection with Whale Optimization Algorithm, and SVM Classifier
    Le Thi Thu Hong
    Nguyen Sinh Huy
    Doan Quang Tu
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS-ICCSA 2024, PT I, 2024, 14813 : 192 - 205