A CNN-based approach to identification of degradations in speech signals

被引:0
|
作者
Yuki Saishu
Amir Hossein Poorjam
Mads Græsbøll Christensen
机构
[1] Audio Analysis Lab,
[2] CREATE,undefined
[3] Aalborg University,undefined
[4] Verisk Analytics,undefined
关键词
Signal enhancement; Convolutional neural network; Identification of degradation; Quality control; Visualization;
D O I
暂无
中图分类号
学科分类号
摘要
The presence of degradations in speech signals, which causes acoustic mismatch between training and operating conditions, deteriorates the performance of many speech-based systems. A variety of enhancement techniques have been developed to compensate the acoustic mismatch in speech-based applications. To apply these signal enhancement techniques, however, it is necessary to know prior information about the presence and the type of degradations in speech signals. In this paper, we propose a new convolutional neural network (CNN)-based approach to automatically identify the major types of degradations commonly encountered in speech-based applications, namely additive noise, nonlinear distortion, and reverberation. In this approach, a set of parallel CNNs, each detecting a certain degradation type, is applied to the log-mel spectrogram of audio signals. Experimental results using two different speech types, namely pathological voice and normal running speech, show the effectiveness of the proposed method in detecting the presence and the type of degradations in speech signals which outperforms the state-of-the-art method. Using the score weighted class activation mapping, we provide a visual analysis of how the network makes decision for identifying different types of degradation in speech signals by highlighting the regions of the log-mel spectrogram which are more influential to the target degradation.
引用
收藏
相关论文
共 50 条
  • [41] A CNN-Based Approach for Driver Drowsiness Detection by Real-Time Eye State Identification
    Florez, Ruben
    Palomino-Quispe, Facundo
    Coaquira-Castillo, Roger Jesus
    Herrera-Levano, Julio Cesar
    Paixao, Thuanne
    Alvarez, Ana Beatriz
    APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [42] CNN-based classification of fNIRS signals in motor imagery BCI system
    Ma T.
    Wang S.
    Xia Y.
    Zhu X.
    Evans J.
    Sun Y.
    He S.
    Journal of Neural Engineering, 2021, 18 (05)
  • [43] CNN-Based Identification of Hyperspectral Bacterial Signatures for Digital Microbiology
    Turra, Giovanni
    Arrigoni, Simone
    Signoroni, Alberto
    IMAGE ANALYSIS AND PROCESSING (ICIAP 2017), PT II, 2017, 10485 : 500 - 510
  • [44] A CNN-Based Electromagnetic Interference Source Identification and Location System
    Xiao, Yingchun
    Yang, Yang
    Zhu, Feng
    IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2024, 27 (09) : 63 - 70
  • [45] CNN-Based Tree Species Identification from Bark Image
    Ido, Junya
    Saitoh, Takeshi
    TENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2018), 2019, 11069
  • [46] A CNN-Based Identification Method for Products Appearing in Panoramic Images
    Pan, Siqiang
    Shibata, Kazuki
    Ohta, Masaya
    2018 IEEE 7TH GLOBAL CONFERENCE ON CONSUMER ELECTRONICS (GCCE 2018), 2018, : 656 - 657
  • [47] MULTI-SPEAKER EMOTIONAL ACOUSTIC MODELING FOR CNN-BASED SPEECH SYNTHESIS
    Choi, Heejin
    Park, Sangjun
    Park, Jinuk
    Hahn, Minsoo
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6950 - 6954
  • [48] A CNN-based Path Trajectory Prediction Approach with Safety Constraints
    Zaman, Mostafa
    Zohrabi, Nasibeh
    Abdelwahed, Sherif
    2020 IEEE TRANSPORTATION ELECTRIFICATION CONFERENCE & EXPO (ITEC), 2020, : 267 - 272
  • [49] A novel CNN-based approach for detection and classification of DDoS attacks
    Najar, Ashfaq Ahmad
    Sugali, Manohar Naik
    Lone, Faisal Rasheed
    Nazir, Azra
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2024, 36 (19):
  • [50] A CNN-based approach for upscaling multiphase flow in digital sandstones
    Siavashi, Javad
    Najafi, Arman
    Ebadi, Mohammad
    Sharifi, Mohammad
    FUEL, 2022, 308