The Application of Deep Neural Network in Speech Enhancement Processing

被引:0
|
作者
Chen Jian-ming [1 ]
Liang Zhi-cheng [1 ]
机构
[1] Army Acad Armored Forces, Dept Informat & Commun, Beijing, Peoples R China
关键词
Time-frequency analysis; Speech enhancement algorithm; Ensemble Empirical Mode Decomposition; Deep Neural Network; EMPIRICAL MODE DECOMPOSITION;
D O I
10.1109/ICISCE.2018.00257
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To solve the problem that Non-stationary noise is difficult to remove during speech enhancement process when using Fourier transform, this essay will put forward a speech enhancement algorithm based on the combination of Ensemble Empirical Mode Decomposition (EEMD) and Deep Neural Network (DNN). Firstly, preprocessing the original signal by EEMD, and decomposing a series of time-frequency information of the IMF component to meet the time-variation requirement better; Secondly, adjusting the weight of the IMF component by DNN and then synthesize it to enhanced the speech; Finally, comparing the differences of speech enhancement performance between using EEMD alone, using Fourier transform and EEMD as a preprocessing. The results show that the enhanced algorithm using EEMD as a preprocessing improves the scores of PESQ and STOI by 0.745 and 0.169 respectively, effectively improving the speech quality and intelligibility.
引用
收藏
页码:1263 / 1266
页数:4
相关论文
共 50 条
  • [1] Speech Enhancement based on Deep Convolutional Neural Network
    Nuthakki, Ramesh
    Masanta, Payel
    Yukta, T. N.
    [J]. PROCEEDINGS OF THE 2021 FIFTH INTERNATIONAL CONFERENCE ON I-SMAC (IOT IN SOCIAL, MOBILE, ANALYTICS AND CLOUD) (I-SMAC 2021), 2021, : 770 - 775
  • [2] Binaural Deep Neural Network for Robust Speech Enhancement
    Jiang, Yi
    Liu, Runsheng
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2014, : 692 - 695
  • [3] Supervised speech enhancement based on deep neural network
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Qazi, Abdul Baser
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (04) : 5187 - 5201
  • [4] Application of Deep Neural Network Algorithm in Speech Enhancement of Online English Learning Platform
    Peng, Haiyan
    Zhang, Min
    [J]. EAI ENDORSED TRANSACTIONS ON SCALABLE INFORMATION SYSTEMS, 2023, 10 (02)
  • [5] Utilizing Neural Network and Critical Band Processing for Speech Enhancement
    Yong, Pei Chee
    Chan, Kit Yan
    Nordholm, Sven
    [J]. 2017 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC 2017), 2017, : 1300 - 1303
  • [6] An optimization method for speech enhancement based on deep neural network
    Sun, Haixia
    Li, Sikun
    [J]. 3RD INTERNATIONAL CONFERENCE ON ADVANCES IN ENERGY, ENVIRONMENT AND CHEMICAL ENGINEERING, 2017, 69
  • [7] Convolutional Deep Neural Network and Full Connectivity for Speech Enhancement
    Alameri, Ban M.
    Kadhim, Inas Jawad
    Hadi, Suha Qasim
    Hassoon, Ali F.
    Abd, Mustafa M.
    Premaratne, Prashan
    [J]. INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (04) : 140 - 154
  • [8] Speech enhancement based on noise classification and deep neural network
    Wang, Wenbo
    Liu, Houguang
    Yang, Jianhua
    Cao, Guohua
    Hua, Chunli
    [J]. MODERN PHYSICS LETTERS B, 2019, 33 (17):
  • [9] Speech Enhancement for Optical Laser Microphone With Deep Neural Network
    Cai, Chengkai
    Iwai, Kenta
    Nishiura, Takanobu
    Yamashita, Yoichi
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 449 - 454
  • [10] Speech Enhancement Based on Two-Stage Processing with Deep Neural Network for Laser Doppler Vibrometer
    Cai, Chengkai
    Iwai, Kenta
    Nishiura, Takanobu
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (03):