A Deep Learning Method for Pathological Voice Detection using Convolutional Deep Belief Network

被引:0
|
作者
Wu, Huiyi [1 ]
Soraghan, John [1 ]
Lowit, Anja [2 ]
Di Caterina, Gaetano [1 ]
机构
[1] Univ Strathclyde, Ctr Signal & Image Proc, Dept Elect & Elect Engn, Glasgow, Lanark, Scotland
[2] Univ Strathclyde, Speech & Language Therapy, Sch Psychol Sci & Hlth, Glasgow, Lanark, Scotland
关键词
pathological voice detection; convolutional neural network (CNN); Convolutional deep belief network (CDBN); deep learning; NEURAL-NETWORKS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automatically detecting pathological voice disorders such as vocal cord paralysis or Reinke's edema is a challenging and important medical classification problem. While deep learning techniques have achieved significant progress in the speech recognition field there has been less research work in the area of pathological voice disorders detection. A novel system for pathological voice detection using convolutional neural network (CNN) as the basic architecture is presented in this work. The novel system uses spectrograms of normal and pathological speech recordings as the input to the network. Initially Convolutional deep belief network (CDBN) are used to pre-train the weights of CNN system. This acts as a generative model to explore the structure of the input data using statistical methods. Then a CNN is trained using supervised back-propagation learning algorithm to fine tune the weights. It will be shown that a small amount of data can be used to achieve good results in classification with this deep learning approach. A performance analysis of the novel method is provided using real data from the Saarbrucken Voice database.
引用
收藏
页码:446 / 450
页数:5
相关论文
共 50 条
  • [1] Deep Learning Approaches for Pathological Voice Detection Using Heterogeneous Parameters
    Lee, JiYeoun
    Choi, Hee-Jin
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2020, E103D (08) : 1920 - 1923
  • [2] Detection of Pathological Voice Using Cepstrum Vectors: A Deep Learning Approach
    Fang, Shih-Hau
    Tsao, Yu
    Hsiao, Min-Jing
    Chen, Ji-Ying
    Lai, Ying-Hui
    Lin, Feng-Chuan
    Wang, Chi-Te
    [J]. JOURNAL OF VOICE, 2019, 33 (05) : 634 - 641
  • [3] Deep Convolutional Neural Network for Voice Liveness Detection
    Gupta, Siddhant
    Khoria, Kuldeep
    Patil, Ankur T.
    Patil, Hemant A.
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 775 - 779
  • [4] Deep convolutional neural network for detection of pathological speech
    Vavrek, Lukas
    Hires, Mate
    Kumar, Dinesh
    Drotar, Peter
    [J]. 2021 IEEE 19TH WORLD SYMPOSIUM ON APPLIED MACHINE INTELLIGENCE AND INFORMATICS (SAMI 2021), 2021, : 245 - 249
  • [5] An Object Detection by using Adaptive Structural Learning of Deep Belief Network
    Kamada, Shin
    Ichimura, Takumi
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [6] Intrusion Detection using Deep Belief Network
    Raza, Kamran
    Adil, Syed Hasan
    [J]. MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2014, 33 (04) : 485 - 491
  • [7] Brain Hemorrhage Detection Using Deep Learning: Convolutional Neural Network
    Navadia, Nipun R.
    Kaur, Gurleen
    Bhardwaj, Harshit
    [J]. INFORMATION SYSTEMS AND MANAGEMENT SCIENCE, ISMS 2021, 2023, 521 : 565 - 570
  • [8] Voice disorder classification using convolutional neural network based on deep transfer learning
    Peng, Xiangyu
    Xu, Huoyao
    Liu, Jie
    Wang, Junlang
    He, Chaoming
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01)
  • [9] Voice disorder classification using convolutional neural network based on deep transfer learning
    Xiangyu Peng
    Huoyao Xu
    Jie Liu
    Junlang Wang
    Chaoming He
    [J]. Scientific Reports, 13
  • [10] Face Detection Algorithm Based on Convolutional Pooling Deep Belief Network
    Wang, Dandan
    Li, Ming
    Li, Xiaoxu
    [J]. PROCEEDINGS OF THE 2017 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL, AUTOMATION AND MECHANICAL ENGINEERING (EAME 2017), 2017, 86 : 273 - 276