Convolutional Neural Networks for Pathological Voice Detection

被引:0
|
作者
Wu, Huiyi [1 ]
Soraghan, John [1 ]
Lowit, Anja [2 ]
Di Caterina, Gaetano [1 ]
机构
[1] Univ Strathclyde, Ctr Signal & Image Proc, Dept Elect & Elect Engn, Glasgow G1 1XW, Lanark, Scotland
[2] Univ Strathclyde, Sch Psychol Sci & Hlth, Speech & Language Therapy, Glasgow G1 1QE, Lanark, Scotland
关键词
AUTOMATIC DETECTION;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Acoustic analysis using signal processing tools can be used to extract voice features to distinguish whether a voice is pathological or healthy. The proposed work uses spectrogram of voice recordings from a voice database as the input to a Convolutional Neural Network (CNN) for automatic feature extraction and classification of disordered and normal voice. The novel classifier achieved 88.5%, 66.2% and 77.0% accuracy on training, validation and testing data set respectively on 482 normal and 482 organic dysphonia speech files. It reveals that the proposed novel algorithm on the Saarbruecken Voice Database can effectively been used for screening pathological voice recordings.
引用
收藏
页码:4784 / 4787
页数:4
相关论文
共 50 条
  • [1] Singing Voice Detection Based on Convolutional Neural Networks
    Huang, Hong-Ming
    Chen, Woei-Kae
    Liu, Chien-Hung
    You, Shingchern D.
    2018 7TH IEEE INTERNATIONAL SYMPOSIUM ON NEXT-GENERATION ELECTRONICS (ISNE), 2018, : 223 - 226
  • [2] Low Frequency Ultrasonic Voice Activity Detection using Convolutional Neural Networks
    McLoughlin, Ian
    Song, Yan
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2400 - 2404
  • [3] Exploring Channel Properties to Improve Singing Voice Detection with Convolutional Neural Networks
    Gui, Wenming
    Li, Yukun
    Zang, Xian
    Zhang, Jinglan
    APPLIED SCIENCES-BASEL, 2021, 11 (24):
  • [4] Detection of Pathological Myopia and Optic Disc Segmentation with Deep Convolutional Neural Networks
    Baid, Ujjwal
    Baheti, Bhakti
    Dutande, Prasad
    Talbar, Sanjay
    PROCEEDINGS OF THE 2019 IEEE REGION 10 CONFERENCE (TENCON 2019): TECHNOLOGY, KNOWLEDGE, AND SOCIETY, 2019, : 1345 - 1350
  • [5] Joint Detection and Classification of Singing Voice Melody Using Convolutional Recurrent Neural Networks
    Kum, Sangeun
    Nam, Juhan
    APPLIED SCIENCES-BASEL, 2019, 9 (07):
  • [6] Detection of pathological voice using convolutional neural network (CNN) and mel frequency cepstral coefficient ( MFCC)
    Lee, S. H.
    BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 127 : 23 - 23
  • [7] RECOGNITION OF SPOOFED VOICE USING CONVOLUTIONAL NEURAL NETWORKS
    Liang, Huixin
    Lin, Xiaodan
    Zhang, Qiong
    Kang, Xiangui
    2017 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2017), 2017, : 293 - 297
  • [8] Deep Convolutional Neural Network for Voice Liveness Detection
    Gupta, Siddhant
    Khoria, Kuldeep
    Patil, Ankur T.
    Patil, Hemant A.
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 775 - 779
  • [9] Convolutional neural networks for radar detection
    López-Risueño, G
    Grajal, J
    Haykin, S
    Díaz-Oliver, R
    ARTIFICIAL NEURAL NETWORKS - ICANN 2002, 2002, 2415 : 1150 - 1155
  • [10] Pedestrian detection with convolutional neural networks
    Szarvas, M
    Yoshizawa, A
    Yamamoto, M
    Ogata, J
    2005 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, 2005, : 224 - 229