Deep neural network based speech enhancement using mono channel mask

被引:4
|
作者
Ingale, Pallavi P. [1 ]
Nalbalwar, Sanjay L. [1 ]
机构
[1] Dr Babasaheb Ambedkar Tecnhol Univ, Lonere, India
关键词
Speech enhancement; Mono channel mask; Binary mask; Modified sub-harmonic summation; CLASSIFICATION-BASED APPROACH; NOISE;
D O I
10.1007/s10772-019-09627-4
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Getting enhanced speech from the noisy speech signal is a task of particular importance in the area of speech processing. Here we propose a deep neural network (DNN) based speech enhancement method utilising mono channel mask. The proposed method employs cochleagram to find an initial binary mask. Then modified sub-harmonic summation algorithm is applied on initial binary mask to obtain an intermediate mask. The spectro-temporal features of this intermediate mask are fed to DNN. DNN finds out the correct spectral structure in the frames associated with the target speech which are further used to develop the mono channel mask. Speech signal is reconstructed using mono channel mask. Mono channel mask avoids the unnecessary interference from the noisy time-frequency (T-F) units. Objective evaluations done using perceptual evaluation of speech quality (PESQ) and normalized source to distortion ratio indicate that the proposed method outperforms the state of the art methods in the area of speech enhancement. Obtained values of PESQ shows that proposed method improves the quality of the speech in noisy conditions. The experimental results present the effectiveness of the mono channel mask in speech enhancement. The proposed method gives better performance compared to other methods.
引用
收藏
页码:841 / 850
页数:10
相关论文
共 50 条
  • [1] Deep neural network based speech enhancement using mono channel mask
    Pallavi P. Ingale
    Sanjay L. Nalbalwar
    [J]. International Journal of Speech Technology, 2019, 22 : 841 - 850
  • [2] Monaural speech enhancement combining accurate ratio mask and deep neural network
    BAI Haojun
    ZHANG Tianqi
    LIU Jianxing
    YE Shaopeng
    [J]. Chinese Journal of Acoustics, 2022, 41 (04) : 373 - 389
  • [3] Ideal neighbourhood mask for speech enhancement using deep neural networks
    Arcos, Christian
    Vellasco, Marley
    Alcaim, Abraham
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [4] Speech Enhancement based on Deep Convolutional Neural Network
    Nuthakki, Ramesh
    Masanta, Payel
    Yukta, T. N.
    [J]. PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 770 - 775
  • [5] Supervised speech enhancement based on deep neural network
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Qazi, Abdul Baser
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5187 - 5201
  • [6] Deep Neural Network for Supervised Single-Channel Speech Enhancement
    Saleem, Nasir
    Irfan Khattak, Muhammad
    Ali, Muhammad Yousaf
    Shafi, Muhammad
    [J]. ARCHIVES OF ACOUSTICS, 2019, 44 (01) : 3 - 12
  • [7] Time-frequency mask estimation-based speech enhancement using deep encoder-decoder neural network
    SHI Wenhua
    ZHANG Xiongwei
    ZOU Xia
    SUN Meng
    LI Li
    REN Zhengbing
    [J]. Chinese Journal of Acoustics, 2021, 40 (01) : 141 - 154
  • [8] Kernel Machines Beat Deep Neural Networks on Mask-based Single-channel Speech Enhancement
    Hui, Like
    Ma, Siyuan
    Belkin, Mikhail
    [J]. INTERSPEECH 2019, 2019, : 2748 - 2752
  • [9] Single channel speech enhancement using convolutional neural network
    Kounovsky, Tomas
    Malek, Jiri
    [J]. 2017 IEEE INTERNATIONAL WORKSHOP OF ELECTRONICS, CONTROL, MEASUREMENT, SIGNALS AND THEIR APPLICATION TO MECHATRONICS (ECMSM), 2017,
  • [10] A Single-channel Speech Enhancement Approach Based on Perceptual Masking Deep Neural Network
    [J]. Zhang, Xiong-Wei (xwzhang9898@163.com), 2017, Science Press (43):