A CNN-based approach to identification of degradations in speech signals

被引:0
|
作者
Yuki Saishu
Amir Hossein Poorjam
Mads Græsbøll Christensen
机构
[1] Audio Analysis Lab,
[2] CREATE,undefined
[3] Aalborg University,undefined
[4] Verisk Analytics,undefined
关键词
Signal enhancement; Convolutional neural network; Identification of degradation; Quality control; Visualization;
D O I
暂无
中图分类号
学科分类号
摘要
The presence of degradations in speech signals, which causes acoustic mismatch between training and operating conditions, deteriorates the performance of many speech-based systems. A variety of enhancement techniques have been developed to compensate the acoustic mismatch in speech-based applications. To apply these signal enhancement techniques, however, it is necessary to know prior information about the presence and the type of degradations in speech signals. In this paper, we propose a new convolutional neural network (CNN)-based approach to automatically identify the major types of degradations commonly encountered in speech-based applications, namely additive noise, nonlinear distortion, and reverberation. In this approach, a set of parallel CNNs, each detecting a certain degradation type, is applied to the log-mel spectrogram of audio signals. Experimental results using two different speech types, namely pathological voice and normal running speech, show the effectiveness of the proposed method in detecting the presence and the type of degradations in speech signals which outperforms the state-of-the-art method. Using the score weighted class activation mapping, we provide a visual analysis of how the network makes decision for identifying different types of degradation in speech signals by highlighting the regions of the log-mel spectrogram which are more influential to the target degradation.
引用
收藏
相关论文
共 50 条
  • [31] Blood Pressure Estimation with Phonocardiogram on CNN-Based Approach
    Kokkhunthod, Kasidit
    Phapatanaburi, Khomdet
    Pathonsuwan, Wongsathon
    Jumphoo, Talit
    Anchuen, Patikorn
    Nimkuntod, Porntip
    Uthansakul, Monthippa
    Uthansakul, Peerapong
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 79 (02): : 1775 - 1794
  • [32] A Novel CNN-Based Approach for Recognizing Facial Emotion
    Huu Hiep Nguyen
    Luong Anh Tuan Nguyen
    Anh Quan Tran
    Thi Ngoc Thanh Nguyen
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2019, 19 (05): : 237 - 243
  • [33] Hajj Crowd Management Using CNN-Based Approach
    Albattah, Waleed
    Khel, Muhammad Haris Kaka
    Habib, Shabana
    Islam, Muhammad
    Khan, Sheroz
    Kadir, Kushsairy Abdul
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 66 (02): : 2183 - 2197
  • [34] A CNN-based temperature prediction approach for grain storage
    School of Computer Science and Technology, University of Science and Technology of China, Hefei, Anhui, China
    不详
    [J]. Int. J. Internet Manuf. Serv, 2020, 4 (345-357): : 345 - 357
  • [35] Deep CNN-based Inductive Transfer Learning for Sarcasm Detection in Speech
    Gao, Xiyuan
    Nayak, Shekhar
    Coler, Matt
    [J]. INTERSPEECH 2022, 2022, : 2323 - 2327
  • [36] Signals hierarchical feature enhancement method for CNN-based fault diagnosis
    Zhang, Huang
    Zhang, Shuyou
    Wang, Zili
    Qiu, Lemiao
    Zhang, Yiming
    [J]. ADVANCES IN MECHANICAL ENGINEERING, 2022, 14 (09)
  • [37] CNN-based fault classification considered fault location of vibration signals
    Lee, Jeong Jun
    Cheong, Deok Young
    Min, Tae Hong
    Park, Dong Hee
    Choi, Byeong Keun
    [J]. JOURNAL OF MECHANICAL SCIENCE AND TECHNOLOGY, 2023, 37 (10) : 5021 - 5029
  • [38] A Speech Quality Classifier based on Tree-CNN Algorithm that Considers Network Degradations
    Vieira, Samuel Terra
    Rosa, Renata Lopes
    Rodriguez, Demostenes Zegarra
    [J]. JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2020, 16 (02) : 180 - 187
  • [39] A CNN-Based Approach for Driver Drowsiness Detection by Real-Time Eye State Identification
    Florez, Ruben
    Palomino-Quispe, Facundo
    Coaquira-Castillo, Roger Jesus
    Herrera-Levano, Julio Cesar
    Paixao, Thuanne
    Alvarez, Ana Beatriz
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [40] CNN-based classification of fNIRS signals in motor imagery BCI system
    Ma, Tengfei
    Wang, Shasha
    Xia, Yuting
    Zhu, Xinhua
    Evans, Julian
    Sun, Yaoran
    He, Sailing
    [J]. Journal of Neural Engineering, 2021, 18 (05)