A CNN-based approach to identification of degradations in speech signals

被引:0
|
作者
Yuki Saishu
Amir Hossein Poorjam
Mads Græsbøll Christensen
机构
[1] Audio Analysis Lab,
[2] CREATE,undefined
[3] Aalborg University,undefined
[4] Verisk Analytics,undefined
关键词
Signal enhancement; Convolutional neural network; Identification of degradation; Quality control; Visualization;
D O I
暂无
中图分类号
学科分类号
摘要
The presence of degradations in speech signals, which causes acoustic mismatch between training and operating conditions, deteriorates the performance of many speech-based systems. A variety of enhancement techniques have been developed to compensate the acoustic mismatch in speech-based applications. To apply these signal enhancement techniques, however, it is necessary to know prior information about the presence and the type of degradations in speech signals. In this paper, we propose a new convolutional neural network (CNN)-based approach to automatically identify the major types of degradations commonly encountered in speech-based applications, namely additive noise, nonlinear distortion, and reverberation. In this approach, a set of parallel CNNs, each detecting a certain degradation type, is applied to the log-mel spectrogram of audio signals. Experimental results using two different speech types, namely pathological voice and normal running speech, show the effectiveness of the proposed method in detecting the presence and the type of degradations in speech signals which outperforms the state-of-the-art method. Using the score weighted class activation mapping, we provide a visual analysis of how the network makes decision for identifying different types of degradation in speech signals by highlighting the regions of the log-mel spectrogram which are more influential to the target degradation.
引用
收藏
相关论文
共 50 条
  • [1] A CNN-based approach to identification of degradations in speech signals
    Saishu, Yuki
    Poorjam, Amir Hossein
    Christensen, Mads Graesboll
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [2] How Image Degradations Affect Deep CNN-based Face Recognition?
    Karahan, Samil
    Yildirm, Merve Kilinc
    Kirtac, Kadir
    Rende, Ferhat Sukru
    Butun, Gultekin
    Ekenel, Hazim Kemal
    [J]. PROCEEDINGS OF THE 15TH INTERNATIONAL CONFERENCE OF THE BIOMETRICS SPECIAL INTEREST GROUP (BIOSIG 2016), 2016, P-260
  • [3] A Generic Approach CNN-Based Camera Identification for Manipulated Images
    El-Yamany, Ahmed
    Fouad, Hossam
    Raffat, Youssef
    Alghoniemy, Masoud
    [J]. 2018 IEEE 3RD INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2018, : 43 - 48
  • [4] A Generic Approach CNN-Based Camera Identification for Manipulated Images
    El-Yamany, Ahmed
    Fouad, Hossam
    Raffat, Youssef
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY (EIT), 2018, : 165 - +
  • [5] CNN-based algorithm for drusen identification
    Checco, Paolo
    Corinto, Fernando
    [J]. 2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2181 - +
  • [6] A CNN-based vortex identification method
    Liang Deng
    Yueqing Wang
    Yang Liu
    Fang Wang
    Sikun Li
    Jie Liu
    [J]. Journal of Visualization, 2019, 22 : 65 - 78
  • [7] CNN-based fish iris identification
    Schraml, Rudolf
    Wimmer, Georg
    Hofbauer, Heinz
    Jalilian, Ehsaneddin
    Bekkozhayeva, Dinara
    Cisar, Petr
    Uhl, Andreas
    [J]. 2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 628 - 632
  • [8] A CNN-based vortex identification method
    Deng, Liang
    Wang, Yueqing
    Liu, Yang
    Wang, Fang
    Li, Sikun
    Liu, Jie
    [J]. JOURNAL OF VISUALIZATION, 2019, 22 (01) : 65 - 78
  • [9] CNN-Based Identification of Parkinson's Disease from Continuous Speech in Noisy Environments
    Farago, Paul
    Stefaniga, Sebastian-Aurelian
    Cordos, Claudia-Georgiana
    Mihaila, Laura-Ioana
    Hintea, Sorin
    Pestean, Ana-Sorina
    Beyer, Michel
    Perju-Dumbrava, Lacramioara
    Ilesan, Robert Radu
    [J]. BIOENGINEERING-BASEL, 2023, 10 (05):
  • [10] CNN-Based Fast Source Device Identification
    Mandelli, Sara
    Cozzolino, Davide
    Bestagini, Paolo
    Verdoliva, Luisa
    Tubaro, Stefano
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 1285 - 1289