Assessment of Amplified Parkinsonian Speech Quality Using Deep Learning

被引:0
|
作者
Gaballah, Amr [1 ]
Parsa, Vijay [1 ]
Andreetta, Monika [1 ]
Adams, Scott [1 ]
机构
[1] Western Univ, London, ON, Canada
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, deep neural networks (DNNs) are applied to features extracted from Parkinsonian speech recordings to predict their perceived quality. This procedure was also used to benchmark the electroacoustic characteristics of speech amplifiers used by people afflicted with Parkinson Disease (PD). Speech recordings were obtained from 11 PD subjects and 10 normal controls, with and without the assistance of 7 different speech amplifiers, and their quality was assessed subjectively by normal hearing listeners. Mel-frequency and Gammatone frequency cepstral coefficients (MFCCs and GFCCs respectively) and their first order derivatives were extracted as features, and given as input to the DNN. Two optimizers were used to train the neural network, namely stochastic gradient descent (SGD) and Adam optimizers. The paper also shows the effect of feature reduction in enhancing the performance of the objective predictors. Experimental results showed that a trained DNN with reduced set of GFCC features outperforms other objective metrics in terms of correlation with the subjective measures.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] Objective and Subjective Assessment of Amplified Parkinsonian Speech Quality
    Gaballah, Amr
    Parsa, Vijay
    Andreetta, Monika
    Adams, Scott
    [J]. 2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 2084 - 2087
  • [2] Interference Quality Assessment of Speech Communication Based on Deep Learning
    Wang, Sen
    Lin, Yun
    Hao, Ming
    Xu, Huaitao
    Tian, Qiao
    [J]. IEEE TRANSACTIONS ON RELIABILITY, 2022, 71 (02) : 1011 - 1021
  • [3] Deep Learning Based Speech Quality Assessment Focusing on Noise Effects
    Jaiswal, Rahul
    Priya, Anu
    [J]. SPEECH AND COMPUTER, SPECOM 2023, PT II, 2023, 14339 : 271 - 282
  • [4] Electrocardiogram Quality Assessment Using Unsupervised Deep Learning
    Seeuws, Nick
    De Vos, Maarten
    Bertrand, Alexander
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2022, 69 (02) : 882 - 893
  • [5] Retinal image quality assessment using deep learning
    Zago, Gabriel Tozatto
    Andreao, Rodrigo Varejao
    Dorizzi, Bernadette
    Teatini Salles, Evandro Ottoni
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2018, 103 : 64 - 70
  • [6] English Speech Recognition and Evaluation of Pronunciation Quality Using Deep Learning
    Xu, Yushu
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [7] Quality assessment of retargeted images using deep learning capabilities
    Absetan, Ahmad
    Fathi, Abdolhossein
    [J]. COMPUTERS & GRAPHICS-UK, 2024, 120
  • [8] Latent Fingerprint Image Quality Assessment Using Deep Learning
    Ezeobiejesi, Jude
    Bhanu, Bir
    [J]. PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, : 621 - 629
  • [9] Speech Recognition using Deep Learning
    Lakkhanawannakun, Phoemporn
    Noyunsan, Chaluemwut
    [J]. 2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019), 2019, : 514 - 517
  • [10] Speech Separation Using Deep Learning
    Nandal, P.
    [J]. SUSTAINABLE COMMUNICATION NETWORKS AND APPLICATION, ICSCN 2019, 2020, 39 : 319 - 326